Leveraging national tourist offices through data analytics

Detalhes bibliográficos
Autor(a) principal: Moro, S.
Data de Publicação: 2018
Outros Autores: Rita, P., Oliveira, C., Batista, F., Ribeiro, R.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10071/16702
Resumo: Purpose This study aims to propose a data-driven approach, based on open-source tools, that makes it possible to understand customer satisfaction of the accommodation offer of a whole country. Design/methodology/approach The method starts by extracting information from all hotels of Portugal available at TripAdvisor through Web scraping. Then, a support vector machine is adopted for modeling the TripAdvisor score, which is considered a proxy of customer satisfaction. Finally, knowledge extraction from the model is achieved using sensitivity analysis to unveil the influence of features on the score. Findings The model of the TripAdvisor score achieved a mean absolute percentage error of around 5 per cent, proving the value of modeling the extracted data. The number of rooms of the unit and the minimum price are the two most relevant features, showing that customers appreciate smaller and more expensive units, whereas the location of the hotel does not hold significant relevance. Originality/value National tourist offices can use the proposed approach to understand what drives tourists’ satisfaction, helping to shape a country’s strategy. For example, licensing new hotels may take into account the unit size and other characteristics that make it more attractive to tourists. Furthermore, the procedure can be replicated at any time and in any country, making it a valuable tool for data-driven decision support on a national scale.
id RCAP_1fe316a554a307e7d2cd636a080a73bf
oai_identifier_str oai:repositorio.iscte-iul.pt:10071/16702
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Leveraging national tourist offices through data analyticsData miningData analyticsSensitivity analysisOnline reviewsNational tourist officesWeb scrapingPurpose This study aims to propose a data-driven approach, based on open-source tools, that makes it possible to understand customer satisfaction of the accommodation offer of a whole country. Design/methodology/approach The method starts by extracting information from all hotels of Portugal available at TripAdvisor through Web scraping. Then, a support vector machine is adopted for modeling the TripAdvisor score, which is considered a proxy of customer satisfaction. Finally, knowledge extraction from the model is achieved using sensitivity analysis to unveil the influence of features on the score. Findings The model of the TripAdvisor score achieved a mean absolute percentage error of around 5 per cent, proving the value of modeling the extracted data. The number of rooms of the unit and the minimum price are the two most relevant features, showing that customers appreciate smaller and more expensive units, whereas the location of the hotel does not hold significant relevance. Originality/value National tourist offices can use the proposed approach to understand what drives tourists’ satisfaction, helping to shape a country’s strategy. For example, licensing new hotels may take into account the unit size and other characteristics that make it more attractive to tourists. Furthermore, the procedure can be replicated at any time and in any country, making it a valuable tool for data-driven decision support on a national scale.Emerald2018-10-19T16:30:27Z2018-01-01T00:00:00Z20182018-11-28T09:58:07Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10071/16702eng1750-618210.1108/IJCTHR-04-2018-0051Moro, S.Rita, P.Oliveira, C.Batista, F.Ribeiro, R.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-09T17:54:43Zoai:repositorio.iscte-iul.pt:10071/16702Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T22:27:42.812964Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Leveraging national tourist offices through data analytics
title Leveraging national tourist offices through data analytics
spellingShingle Leveraging national tourist offices through data analytics
Moro, S.
Data mining
Data analytics
Sensitivity analysis
Online reviews
National tourist offices
Web scraping
title_short Leveraging national tourist offices through data analytics
title_full Leveraging national tourist offices through data analytics
title_fullStr Leveraging national tourist offices through data analytics
title_full_unstemmed Leveraging national tourist offices through data analytics
title_sort Leveraging national tourist offices through data analytics
author Moro, S.
author_facet Moro, S.
Rita, P.
Oliveira, C.
Batista, F.
Ribeiro, R.
author_role author
author2 Rita, P.
Oliveira, C.
Batista, F.
Ribeiro, R.
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Moro, S.
Rita, P.
Oliveira, C.
Batista, F.
Ribeiro, R.
dc.subject.por.fl_str_mv Data mining
Data analytics
Sensitivity analysis
Online reviews
National tourist offices
Web scraping
topic Data mining
Data analytics
Sensitivity analysis
Online reviews
National tourist offices
Web scraping
description Purpose This study aims to propose a data-driven approach, based on open-source tools, that makes it possible to understand customer satisfaction of the accommodation offer of a whole country. Design/methodology/approach The method starts by extracting information from all hotels of Portugal available at TripAdvisor through Web scraping. Then, a support vector machine is adopted for modeling the TripAdvisor score, which is considered a proxy of customer satisfaction. Finally, knowledge extraction from the model is achieved using sensitivity analysis to unveil the influence of features on the score. Findings The model of the TripAdvisor score achieved a mean absolute percentage error of around 5 per cent, proving the value of modeling the extracted data. The number of rooms of the unit and the minimum price are the two most relevant features, showing that customers appreciate smaller and more expensive units, whereas the location of the hotel does not hold significant relevance. Originality/value National tourist offices can use the proposed approach to understand what drives tourists’ satisfaction, helping to shape a country’s strategy. For example, licensing new hotels may take into account the unit size and other characteristics that make it more attractive to tourists. Furthermore, the procedure can be replicated at any time and in any country, making it a valuable tool for data-driven decision support on a national scale.
publishDate 2018
dc.date.none.fl_str_mv 2018-10-19T16:30:27Z
2018-01-01T00:00:00Z
2018
2018-11-28T09:58:07Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10071/16702
url http://hdl.handle.net/10071/16702
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 1750-6182
10.1108/IJCTHR-04-2018-0051
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Emerald
publisher.none.fl_str_mv Emerald
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134839555751936