Building a predictive lead scoring model for contact prioritization : the case of HUUB

Pereira, Rita Mafalda Magalhães

Building a predictive lead scoring model for contact prioritization : the case of HUUB

Detalhes bibliográficos
Autor(a) principal:	Pereira, Rita Mafalda Magalhães
Data de Publicação:	2021
Tipo de documento:	Dissertação
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10400.14/34877
Resumo:	In the last decades, machine learning has become quite popular for solving business problems, as it often delivers high-quality and efficient solutions. Moreover, the amount of data collected by companies has grown substantially, which has contributed to this trend. Companies do not have enough resources to contact every lead, so contact prioritization is essential. Lead scoring supports this task, by assigning a value to each lead based on his actions or characteristics. Even though it is expected that lead scoring contributes to higher conversion rates, there is still very few literature on how to use machine learning to automate this process. This dissertation shows how to combine historical data from Customer Relationship Management platforms and supervised learning to develop a lead scoring model for companies. The approach followed is based on the CRISP-DM method, where several tools were used, such as HubSpot, Microsoft Power BI and RStudio. The classification model proposed is a decision tree that predicts the leads’ conversion outcome (Won or Postpone), developed using the CART algorithm and data from a logistics company – HUUB. The main findings of this project conclude that machine learning can be used to develop a lead scoring model to perform contact prioritization. However, there are several factors, especially data-related, that should be taken into consideration, since they may impact the model’s performance. Lastly, a suggestion for future research is to develop an experiment to compare the results of manual and automated lead scoring, to assess if machine learning actually provides a superior alternative to the manual approach.

Metadados do item

id	RCAP_75ef32708cd68ab26fe934e927941a75
oai_identifier_str	oai:repositorio.ucp.pt:10400.14/34877
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Building a predictive lead scoring model for contact prioritization : the case of HUUBMarketing automationLead scoringMachine learningContact prioritizationAutomatização do marketingPriorização de contactosDomínio/Área Científica::Ciências Sociais::Economia e GestãoIn the last decades, machine learning has become quite popular for solving business problems, as it often delivers high-quality and efficient solutions. Moreover, the amount of data collected by companies has grown substantially, which has contributed to this trend. Companies do not have enough resources to contact every lead, so contact prioritization is essential. Lead scoring supports this task, by assigning a value to each lead based on his actions or characteristics. Even though it is expected that lead scoring contributes to higher conversion rates, there is still very few literature on how to use machine learning to automate this process. This dissertation shows how to combine historical data from Customer Relationship Management platforms and supervised learning to develop a lead scoring model for companies. The approach followed is based on the CRISP-DM method, where several tools were used, such as HubSpot, Microsoft Power BI and RStudio. The classification model proposed is a decision tree that predicts the leads’ conversion outcome (Won or Postpone), developed using the CART algorithm and data from a logistics company – HUUB. The main findings of this project conclude that machine learning can be used to develop a lead scoring model to perform contact prioritization. However, there are several factors, especially data-related, that should be taken into consideration, since they may impact the model’s performance. Lastly, a suggestion for future research is to develop an experiment to compare the results of manual and automated lead scoring, to assess if machine learning actually provides a superior alternative to the manual approach.Nas últimas décadas, o machine learning tornou-se bastante popular para resolver problemas organizacionais, já que tende a produzir soluções eficientes e de alta qualidade. Adicionalmente, a quantidade de dados colecionados pelas empresas cresceu substancialmente, o que contribuiu para esta tendência. As empresas não têm recursos suficientes para contactar todos os leads, pelo que é essencial priorizá-los. O lead scoring apoia esta tarefa, ao atribuir um valor para cada lead baseado nas suas ações ou características. Embora seja expectável que o lead scoring contribua para melhores taxas de conversão, ainda é escassa a literatura acerca da automatização deste processo através do machine learning. Esta dissertação expõe como combinar supervised learning e dados históricos de sistemas de Customer Relationship Management para desenvolver um modelo de lead scoring para empresas. A abordagem baseia-se no método CRISP-DM, onde diversas ferramentas foram usadas, nomeadamente o HubSpot, o Microsoft Power BI e o RStudio. O modelo de classificação proposto é uma árvore de decisão que prevê o desfecho de conversão dos leads, desenvolvido com o algoritmo CART e dados de uma empresa de logística – a HUUB. As principais descobertas deste projeto concluem que é viável utilizar o machine learning para desenvolver um modelo de lead scoring para priorizar os contactos. Contudo, há fatores que devem ser tidos em conta, especialmente relacionados com os dados, já que podem impactar o desempenho do modelo. Por fim, sugere-se para pesquisa futura o desenvolvimento de um estudo experimental que compare os resultados do lead scoring automatizado e manual, de forma a avaliar se o machine learning é de facto a melhor alternativa.Andrade, António Manuel Valente deVeritati - Repositório Institucional da Universidade Católica PortuguesaPereira, Rita Mafalda Magalhães2021-09-15T11:27:21Z2021-07-132021-052021-07-13T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.14/34877TID:202750051enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-12T17:40:24Zoai:repositorio.ucp.pt:10400.14/34877Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T18:28:16.934234Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Building a predictive lead scoring model for contact prioritization : the case of HUUB
title	Building a predictive lead scoring model for contact prioritization : the case of HUUB
spellingShingle	Building a predictive lead scoring model for contact prioritization : the case of HUUB Pereira, Rita Mafalda Magalhães Marketing automation Lead scoring Machine learning Contact prioritization Automatização do marketing Priorização de contactos Domínio/Área Científica::Ciências Sociais::Economia e Gestão
title_short	Building a predictive lead scoring model for contact prioritization : the case of HUUB
title_full	Building a predictive lead scoring model for contact prioritization : the case of HUUB
title_fullStr	Building a predictive lead scoring model for contact prioritization : the case of HUUB
title_full_unstemmed	Building a predictive lead scoring model for contact prioritization : the case of HUUB
title_sort	Building a predictive lead scoring model for contact prioritization : the case of HUUB
author	Pereira, Rita Mafalda Magalhães
author_facet	Pereira, Rita Mafalda Magalhães
author_role	author
dc.contributor.none.fl_str_mv	Andrade, António Manuel Valente de Veritati - Repositório Institucional da Universidade Católica Portuguesa
dc.contributor.author.fl_str_mv	Pereira, Rita Mafalda Magalhães
dc.subject.por.fl_str_mv	Marketing automation Lead scoring Machine learning Contact prioritization Automatização do marketing Priorização de contactos Domínio/Área Científica::Ciências Sociais::Economia e Gestão
topic	Marketing automation Lead scoring Machine learning Contact prioritization Automatização do marketing Priorização de contactos Domínio/Área Científica::Ciências Sociais::Economia e Gestão
description	In the last decades, machine learning has become quite popular for solving business problems, as it often delivers high-quality and efficient solutions. Moreover, the amount of data collected by companies has grown substantially, which has contributed to this trend. Companies do not have enough resources to contact every lead, so contact prioritization is essential. Lead scoring supports this task, by assigning a value to each lead based on his actions or characteristics. Even though it is expected that lead scoring contributes to higher conversion rates, there is still very few literature on how to use machine learning to automate this process. This dissertation shows how to combine historical data from Customer Relationship Management platforms and supervised learning to develop a lead scoring model for companies. The approach followed is based on the CRISP-DM method, where several tools were used, such as HubSpot, Microsoft Power BI and RStudio. The classification model proposed is a decision tree that predicts the leads’ conversion outcome (Won or Postpone), developed using the CART algorithm and data from a logistics company – HUUB. The main findings of this project conclude that machine learning can be used to develop a lead scoring model to perform contact prioritization. However, there are several factors, especially data-related, that should be taken into consideration, since they may impact the model’s performance. Lastly, a suggestion for future research is to develop an experiment to compare the results of manual and automated lead scoring, to assess if machine learning actually provides a superior alternative to the manual approach.
publishDate	2021
dc.date.none.fl_str_mv	2021-09-15T11:27:21Z 2021-07-13 2021-05 2021-07-13T00:00:00Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10400.14/34877 TID:202750051
url	http://hdl.handle.net/10400.14/34877
identifier_str_mv	TID:202750051
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799132001447444480

Building a predictive lead scoring model for contact prioritization : the case of HUUB

Registros relacionados