Evaluation of classification techniques for identifying fake reviews about products and services on the internet

Detalhes bibliográficos
Autor(a) principal: Santos,Andrey Schmidt dos
Data de Publicação: 2020
Outros Autores: Camargo,Luis Felipe Riehs, Lacerda,Daniel Pacheco
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Gestão & Produção
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-530X2020000400202
Resumo: Abstract: With the e-commerce growth, more people are buying products over the internet. To increase customer satisfaction, merchants provide spaces for product and service reviews. Products with positive reviews attract customers, while products with negative reviews lose customers. Following this idea, some individuals and corporations write fake reviews to promote their products and services or defame their competitors. The difficulty for finding these reviews was in the large amount of information available. One solution is to use data mining techniques and tools, such as the classification function. Exploring this situation, the present work evaluates classification techniques to identify fake reviews about products and services on the Internet. The research also presents a literature systematic review on fake reviews. The research used 8 classification algorithms. The algorithms were trained and tested with a hotels database. The CONCENSO algorithm presented the best result, with 88% in the precision indicator. After the first test, the algorithms classified reviews on another hotels database. To compare the results of this new classification, the Review Skeptic algorithm was used. The SVM and GLMNET algorithms presented the highest convergence with the Review Skeptic algorithm, classifying 83% of reviews with the same result. The research contributes by demonstrating the algorithms ability to understand consumers’ real reviews to products and services on the Internet. Another contribution is to be the pioneer in the investigation of fake reviews in Brazil and in production engineering.
id UFSCAR-3_0f64984d6ede7489b663cbbb942183b7
oai_identifier_str oai:scielo:S0104-530X2020000400202
network_acronym_str UFSCAR-3
network_name_str Gestão & Produção
repository_id_str
spelling Evaluation of classification techniques for identifying fake reviews about products and services on the internetFake reviewsText classificationKnowledge discovery in databasesText miningAbstract: With the e-commerce growth, more people are buying products over the internet. To increase customer satisfaction, merchants provide spaces for product and service reviews. Products with positive reviews attract customers, while products with negative reviews lose customers. Following this idea, some individuals and corporations write fake reviews to promote their products and services or defame their competitors. The difficulty for finding these reviews was in the large amount of information available. One solution is to use data mining techniques and tools, such as the classification function. Exploring this situation, the present work evaluates classification techniques to identify fake reviews about products and services on the Internet. The research also presents a literature systematic review on fake reviews. The research used 8 classification algorithms. The algorithms were trained and tested with a hotels database. The CONCENSO algorithm presented the best result, with 88% in the precision indicator. After the first test, the algorithms classified reviews on another hotels database. To compare the results of this new classification, the Review Skeptic algorithm was used. The SVM and GLMNET algorithms presented the highest convergence with the Review Skeptic algorithm, classifying 83% of reviews with the same result. The research contributes by demonstrating the algorithms ability to understand consumers’ real reviews to products and services on the Internet. Another contribution is to be the pioneer in the investigation of fake reviews in Brazil and in production engineering.Universidade Federal de São Carlos2020-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-530X2020000400202Gestão & Produção v.27 n.4 2020reponame:Gestão & Produçãoinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCAR10.1590/0104-530x4672-20info:eu-repo/semantics/openAccessSantos,Andrey Schmidt dosCamargo,Luis Felipe RiehsLacerda,Daniel Pachecoeng2020-07-24T00:00:00Zoai:scielo:S0104-530X2020000400202Revistahttps://www.gestaoeproducao.com/PUBhttps://old.scielo.br/oai/scielo-oai.phpgp@dep.ufscar.br||revistagestaoemanalise@unichristus.edu.br1806-96490104-530Xopendoar:2020-07-24T00:00Gestão & Produção - Universidade Federal de São Carlos (UFSCAR)false
dc.title.none.fl_str_mv Evaluation of classification techniques for identifying fake reviews about products and services on the internet
title Evaluation of classification techniques for identifying fake reviews about products and services on the internet
spellingShingle Evaluation of classification techniques for identifying fake reviews about products and services on the internet
Santos,Andrey Schmidt dos
Fake reviews
Text classification
Knowledge discovery in databases
Text mining
title_short Evaluation of classification techniques for identifying fake reviews about products and services on the internet
title_full Evaluation of classification techniques for identifying fake reviews about products and services on the internet
title_fullStr Evaluation of classification techniques for identifying fake reviews about products and services on the internet
title_full_unstemmed Evaluation of classification techniques for identifying fake reviews about products and services on the internet
title_sort Evaluation of classification techniques for identifying fake reviews about products and services on the internet
author Santos,Andrey Schmidt dos
author_facet Santos,Andrey Schmidt dos
Camargo,Luis Felipe Riehs
Lacerda,Daniel Pacheco
author_role author
author2 Camargo,Luis Felipe Riehs
Lacerda,Daniel Pacheco
author2_role author
author
dc.contributor.author.fl_str_mv Santos,Andrey Schmidt dos
Camargo,Luis Felipe Riehs
Lacerda,Daniel Pacheco
dc.subject.por.fl_str_mv Fake reviews
Text classification
Knowledge discovery in databases
Text mining
topic Fake reviews
Text classification
Knowledge discovery in databases
Text mining
description Abstract: With the e-commerce growth, more people are buying products over the internet. To increase customer satisfaction, merchants provide spaces for product and service reviews. Products with positive reviews attract customers, while products with negative reviews lose customers. Following this idea, some individuals and corporations write fake reviews to promote their products and services or defame their competitors. The difficulty for finding these reviews was in the large amount of information available. One solution is to use data mining techniques and tools, such as the classification function. Exploring this situation, the present work evaluates classification techniques to identify fake reviews about products and services on the Internet. The research also presents a literature systematic review on fake reviews. The research used 8 classification algorithms. The algorithms were trained and tested with a hotels database. The CONCENSO algorithm presented the best result, with 88% in the precision indicator. After the first test, the algorithms classified reviews on another hotels database. To compare the results of this new classification, the Review Skeptic algorithm was used. The SVM and GLMNET algorithms presented the highest convergence with the Review Skeptic algorithm, classifying 83% of reviews with the same result. The research contributes by demonstrating the algorithms ability to understand consumers’ real reviews to products and services on the Internet. Another contribution is to be the pioneer in the investigation of fake reviews in Brazil and in production engineering.
publishDate 2020
dc.date.none.fl_str_mv 2020-01-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-530X2020000400202
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-530X2020000400202
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/0104-530x4672-20
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Universidade Federal de São Carlos
publisher.none.fl_str_mv Universidade Federal de São Carlos
dc.source.none.fl_str_mv Gestão & Produção v.27 n.4 2020
reponame:Gestão & Produção
instname:Universidade Federal de São Carlos (UFSCAR)
instacron:UFSCAR
instname_str Universidade Federal de São Carlos (UFSCAR)
instacron_str UFSCAR
institution UFSCAR
reponame_str Gestão & Produção
collection Gestão & Produção
repository.name.fl_str_mv Gestão & Produção - Universidade Federal de São Carlos (UFSCAR)
repository.mail.fl_str_mv gp@dep.ufscar.br||revistagestaoemanalise@unichristus.edu.br
_version_ 1750118207840059392