Predicting and explaining Airbnb prices in Lisbon : machine learning approach

Detalhes bibliográficos
Autor(a) principal: Nunes, Madalena Ribeiro dos Santos Pais
Data de Publicação: 2023
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.14/41431
Resumo: Airbnb is an online platform that provides listing and arrangement for short-term local home renting services. Since its establishment in 2008, it has offered 7 million homes and rooms in more than 81,000 cities throughout 191 countries. Airbnb price prediction is a valuable and important task both for guests and hosts. Overall, for practical applications, these models can give a host an optimal price they should charge for their new listing. On the consumer side, this will help travellers determine whether the listing price they see is fair. Much research has been done in this field; however, the longitude and latitude of Airbnb listings are often disregarded. This project focuses on Airbnb price prediction using the most recent (Sep 2021) Airbnb data in Lisbon. Using Google Maps API, the original dataset was enriched with information on the number of ATMs, metro stations, bars and discos within a maximum radius of 1 km. Also, using the geodesic distance, the distance to the airport and the nearest attraction were computed for each listing. A Linear Regression and a Gradient Boosting algorithm were compared based on the original Airbnb dataset and the extended dataset to examine the impact of new features that have been identified. According to the results, all models perform better when the new features are included. The best results are achieved with the Gradient Boosting with the extended data, with an MAE of 0. 3102 and an adjusted R-squared of 0.4633.
id RCAP_4f7429b3ea7f3265c3fede4676dba5bc
oai_identifier_str oai:repositorio.ucp.pt:10400.14/41431
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Predicting and explaining Airbnb prices in Lisbon : machine learning approachAirbnbMachine learningPrice predictionxAIRegressionGradient boostingPrevisão de preçosRegressãoDomínio/Área Científica::Ciências Sociais::Economia e GestãoAirbnb is an online platform that provides listing and arrangement for short-term local home renting services. Since its establishment in 2008, it has offered 7 million homes and rooms in more than 81,000 cities throughout 191 countries. Airbnb price prediction is a valuable and important task both for guests and hosts. Overall, for practical applications, these models can give a host an optimal price they should charge for their new listing. On the consumer side, this will help travellers determine whether the listing price they see is fair. Much research has been done in this field; however, the longitude and latitude of Airbnb listings are often disregarded. This project focuses on Airbnb price prediction using the most recent (Sep 2021) Airbnb data in Lisbon. Using Google Maps API, the original dataset was enriched with information on the number of ATMs, metro stations, bars and discos within a maximum radius of 1 km. Also, using the geodesic distance, the distance to the airport and the nearest attraction were computed for each listing. A Linear Regression and a Gradient Boosting algorithm were compared based on the original Airbnb dataset and the extended dataset to examine the impact of new features that have been identified. According to the results, all models perform better when the new features are included. The best results are achieved with the Gradient Boosting with the extended data, with an MAE of 0. 3102 and an adjusted R-squared of 0.4633.O Airbnb é uma plataforma online que fornece alojamento de curto prazo. Desde a sua criação em 2008, já ofereceu 7 milhões de residências e quartos em mais de 81.000 cidades, em 191 países. A previsão de preços do Aibnb é uma tarefa valiosa tanto para hóspedes como para anfitriões. No geral, estes modelos de previsão podem oferecer ao anfitrião o preço ideal que deve ser cobrado pelo alojamento. Do lado do consumidor, ajudará os hóspedes a determinar se o preço do anúncio é justo. Muitos estudos já abordaram este tema, no entanto, a longitude e a latitude são frequentemente desconsideradas. Este projeto foca-se na previsão de preços do Airbnb em Lisboa usando os dados mais recentes (setembro de 2021). Usando a API do Google Maps, o conjunto de dados original foi ampliado adicionando colunas com o número de ATMs, estações de metro, bares e discotecas num raio máximo de 1 km. Além disso, usando a distância geodésica, a distância até o aeroporto e até à atração mais próxima foram calculadas. Os resultados de uma regressão linear e de um Gradient Boosting, com base no conjunto de dados original do Airbnb e no conjunto de dados alargado são comparados para examinar o impacto das novas variáveis. De acordo com os resultados, todos os modelos apresentam melhor desempenho quando as novas variáveis são incluídas. Os melhores resultados são obtidos com o Gradient Boosting com os dados alargados, com um MAE 0,3102 e um adjusted R-squared de 0,4633.Guedes, AnaVeritati - Repositório Institucional da Universidade Católica PortuguesaNunes, Madalena Ribeiro dos Santos Pais2023-06-26T09:01:28Z2023-01-272023-012023-01-27T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.14/41431TID:203253175enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-12T17:47:00Zoai:repositorio.ucp.pt:10400.14/41431Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T18:34:07.458785Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Predicting and explaining Airbnb prices in Lisbon : machine learning approach
title Predicting and explaining Airbnb prices in Lisbon : machine learning approach
spellingShingle Predicting and explaining Airbnb prices in Lisbon : machine learning approach
Nunes, Madalena Ribeiro dos Santos Pais
Airbnb
Machine learning
Price prediction
xAI
Regression
Gradient boosting
Previsão de preços
Regressão
Domínio/Área Científica::Ciências Sociais::Economia e Gestão
title_short Predicting and explaining Airbnb prices in Lisbon : machine learning approach
title_full Predicting and explaining Airbnb prices in Lisbon : machine learning approach
title_fullStr Predicting and explaining Airbnb prices in Lisbon : machine learning approach
title_full_unstemmed Predicting and explaining Airbnb prices in Lisbon : machine learning approach
title_sort Predicting and explaining Airbnb prices in Lisbon : machine learning approach
author Nunes, Madalena Ribeiro dos Santos Pais
author_facet Nunes, Madalena Ribeiro dos Santos Pais
author_role author
dc.contributor.none.fl_str_mv Guedes, Ana
Veritati - Repositório Institucional da Universidade Católica Portuguesa
dc.contributor.author.fl_str_mv Nunes, Madalena Ribeiro dos Santos Pais
dc.subject.por.fl_str_mv Airbnb
Machine learning
Price prediction
xAI
Regression
Gradient boosting
Previsão de preços
Regressão
Domínio/Área Científica::Ciências Sociais::Economia e Gestão
topic Airbnb
Machine learning
Price prediction
xAI
Regression
Gradient boosting
Previsão de preços
Regressão
Domínio/Área Científica::Ciências Sociais::Economia e Gestão
description Airbnb is an online platform that provides listing and arrangement for short-term local home renting services. Since its establishment in 2008, it has offered 7 million homes and rooms in more than 81,000 cities throughout 191 countries. Airbnb price prediction is a valuable and important task both for guests and hosts. Overall, for practical applications, these models can give a host an optimal price they should charge for their new listing. On the consumer side, this will help travellers determine whether the listing price they see is fair. Much research has been done in this field; however, the longitude and latitude of Airbnb listings are often disregarded. This project focuses on Airbnb price prediction using the most recent (Sep 2021) Airbnb data in Lisbon. Using Google Maps API, the original dataset was enriched with information on the number of ATMs, metro stations, bars and discos within a maximum radius of 1 km. Also, using the geodesic distance, the distance to the airport and the nearest attraction were computed for each listing. A Linear Regression and a Gradient Boosting algorithm were compared based on the original Airbnb dataset and the extended dataset to examine the impact of new features that have been identified. According to the results, all models perform better when the new features are included. The best results are achieved with the Gradient Boosting with the extended data, with an MAE of 0. 3102 and an adjusted R-squared of 0.4633.
publishDate 2023
dc.date.none.fl_str_mv 2023-06-26T09:01:28Z
2023-01-27
2023-01
2023-01-27T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.14/41431
TID:203253175
url http://hdl.handle.net/10400.14/41431
identifier_str_mv TID:203253175
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799132067608395776