Evaluation of classification techniques for identifying fake reviews about products and services on the internet
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Gestão & Produção |
Texto Completo: | http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-530X2020000400202 |
Resumo: | Abstract: With the e-commerce growth, more people are buying products over the internet. To increase customer satisfaction, merchants provide spaces for product and service reviews. Products with positive reviews attract customers, while products with negative reviews lose customers. Following this idea, some individuals and corporations write fake reviews to promote their products and services or defame their competitors. The difficulty for finding these reviews was in the large amount of information available. One solution is to use data mining techniques and tools, such as the classification function. Exploring this situation, the present work evaluates classification techniques to identify fake reviews about products and services on the Internet. The research also presents a literature systematic review on fake reviews. The research used 8 classification algorithms. The algorithms were trained and tested with a hotels database. The CONCENSO algorithm presented the best result, with 88% in the precision indicator. After the first test, the algorithms classified reviews on another hotels database. To compare the results of this new classification, the Review Skeptic algorithm was used. The SVM and GLMNET algorithms presented the highest convergence with the Review Skeptic algorithm, classifying 83% of reviews with the same result. The research contributes by demonstrating the algorithms ability to understand consumers’ real reviews to products and services on the Internet. Another contribution is to be the pioneer in the investigation of fake reviews in Brazil and in production engineering. |
id |
UFSCAR-3_0f64984d6ede7489b663cbbb942183b7 |
---|---|
oai_identifier_str |
oai:scielo:S0104-530X2020000400202 |
network_acronym_str |
UFSCAR-3 |
network_name_str |
Gestão & Produção |
repository_id_str |
|
spelling |
Evaluation of classification techniques for identifying fake reviews about products and services on the internetFake reviewsText classificationKnowledge discovery in databasesText miningAbstract: With the e-commerce growth, more people are buying products over the internet. To increase customer satisfaction, merchants provide spaces for product and service reviews. Products with positive reviews attract customers, while products with negative reviews lose customers. Following this idea, some individuals and corporations write fake reviews to promote their products and services or defame their competitors. The difficulty for finding these reviews was in the large amount of information available. One solution is to use data mining techniques and tools, such as the classification function. Exploring this situation, the present work evaluates classification techniques to identify fake reviews about products and services on the Internet. The research also presents a literature systematic review on fake reviews. The research used 8 classification algorithms. The algorithms were trained and tested with a hotels database. The CONCENSO algorithm presented the best result, with 88% in the precision indicator. After the first test, the algorithms classified reviews on another hotels database. To compare the results of this new classification, the Review Skeptic algorithm was used. The SVM and GLMNET algorithms presented the highest convergence with the Review Skeptic algorithm, classifying 83% of reviews with the same result. The research contributes by demonstrating the algorithms ability to understand consumers’ real reviews to products and services on the Internet. Another contribution is to be the pioneer in the investigation of fake reviews in Brazil and in production engineering.Universidade Federal de São Carlos2020-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-530X2020000400202Gestão & Produção v.27 n.4 2020reponame:Gestão & Produçãoinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCAR10.1590/0104-530x4672-20info:eu-repo/semantics/openAccessSantos,Andrey Schmidt dosCamargo,Luis Felipe RiehsLacerda,Daniel Pachecoeng2020-07-24T00:00:00Zoai:scielo:S0104-530X2020000400202Revistahttps://www.gestaoeproducao.com/PUBhttps://old.scielo.br/oai/scielo-oai.phpgp@dep.ufscar.br||revistagestaoemanalise@unichristus.edu.br1806-96490104-530Xopendoar:2020-07-24T00:00Gestão & Produção - Universidade Federal de São Carlos (UFSCAR)false |
dc.title.none.fl_str_mv |
Evaluation of classification techniques for identifying fake reviews about products and services on the internet |
title |
Evaluation of classification techniques for identifying fake reviews about products and services on the internet |
spellingShingle |
Evaluation of classification techniques for identifying fake reviews about products and services on the internet Santos,Andrey Schmidt dos Fake reviews Text classification Knowledge discovery in databases Text mining |
title_short |
Evaluation of classification techniques for identifying fake reviews about products and services on the internet |
title_full |
Evaluation of classification techniques for identifying fake reviews about products and services on the internet |
title_fullStr |
Evaluation of classification techniques for identifying fake reviews about products and services on the internet |
title_full_unstemmed |
Evaluation of classification techniques for identifying fake reviews about products and services on the internet |
title_sort |
Evaluation of classification techniques for identifying fake reviews about products and services on the internet |
author |
Santos,Andrey Schmidt dos |
author_facet |
Santos,Andrey Schmidt dos Camargo,Luis Felipe Riehs Lacerda,Daniel Pacheco |
author_role |
author |
author2 |
Camargo,Luis Felipe Riehs Lacerda,Daniel Pacheco |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Santos,Andrey Schmidt dos Camargo,Luis Felipe Riehs Lacerda,Daniel Pacheco |
dc.subject.por.fl_str_mv |
Fake reviews Text classification Knowledge discovery in databases Text mining |
topic |
Fake reviews Text classification Knowledge discovery in databases Text mining |
description |
Abstract: With the e-commerce growth, more people are buying products over the internet. To increase customer satisfaction, merchants provide spaces for product and service reviews. Products with positive reviews attract customers, while products with negative reviews lose customers. Following this idea, some individuals and corporations write fake reviews to promote their products and services or defame their competitors. The difficulty for finding these reviews was in the large amount of information available. One solution is to use data mining techniques and tools, such as the classification function. Exploring this situation, the present work evaluates classification techniques to identify fake reviews about products and services on the Internet. The research also presents a literature systematic review on fake reviews. The research used 8 classification algorithms. The algorithms were trained and tested with a hotels database. The CONCENSO algorithm presented the best result, with 88% in the precision indicator. After the first test, the algorithms classified reviews on another hotels database. To compare the results of this new classification, the Review Skeptic algorithm was used. The SVM and GLMNET algorithms presented the highest convergence with the Review Skeptic algorithm, classifying 83% of reviews with the same result. The research contributes by demonstrating the algorithms ability to understand consumers’ real reviews to products and services on the Internet. Another contribution is to be the pioneer in the investigation of fake reviews in Brazil and in production engineering. |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020-01-01 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-530X2020000400202 |
url |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-530X2020000400202 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1590/0104-530x4672-20 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
text/html |
dc.publisher.none.fl_str_mv |
Universidade Federal de São Carlos |
publisher.none.fl_str_mv |
Universidade Federal de São Carlos |
dc.source.none.fl_str_mv |
Gestão & Produção v.27 n.4 2020 reponame:Gestão & Produção instname:Universidade Federal de São Carlos (UFSCAR) instacron:UFSCAR |
instname_str |
Universidade Federal de São Carlos (UFSCAR) |
instacron_str |
UFSCAR |
institution |
UFSCAR |
reponame_str |
Gestão & Produção |
collection |
Gestão & Produção |
repository.name.fl_str_mv |
Gestão & Produção - Universidade Federal de São Carlos (UFSCAR) |
repository.mail.fl_str_mv |
gp@dep.ufscar.br||revistagestaoemanalise@unichristus.edu.br |
_version_ |
1750118207840059392 |