Predictive churn models in vehicle insurance

Detalhes bibliográficos
Autor(a) principal: Bellani, Carolina
Data de Publicação: 2019
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10362/90767
Resumo: Internship Report presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics
id RCAP_31cb881cda0f942a9385390396a2eadc
oai_identifier_str oai:run.unl.pt:10362/90767
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Predictive churn models in vehicle insuranceSupervised learningPredictive churn modelsMachine LearningEnsembleInternship Report presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced AnalyticsThe goal of this project is to develop a predictive model to reduce customer churn from a company. In order to reduce churn, the model will identify customers who may be thinking of ending their patronage. The model also seeks to identify the reasons behind the customers decision to leave, to enable the company to take appropriate counter measures. The company in question is an insurance company in Portugal, Tranquilidade, and this project will focus in particular on their vehicle insurance products. Customer churn will be calculated in relation to two insurance policies; the compulsory motor’s (third party liability) policy and the optional Kasko’s (first party liability) policy. This model will use information the company holds internally on their customers, as well as commercial, vehicle, policy details and external information (from census). The first step of the analysis was data pre-processing with data cleaning, transformation and reduction (especially, for redundancy); in particular, concept hierarchy generation was performed for nominal data. As the percentage of churn is not comparable with the active policy products, the dataset is unbalanced. In order to resolve this an under-sampling technique was used. To force the models to learn how to identify the churn cases, samples of the majority class were separated in such a way as to balance with the minority class. To prevent any loss of information, all the samples of the majority class were studied with the minority class. The predictive models used are generalized linear models, random forests and artificial neural networks, parameter tuning was also conducted. A further validation was also performed on a recent new sample, without any data leakage. In relation to compulsory motor’s insurances, the recommended model is an artificial neural network. The model has a first layer of 15 neurons and a second layer of 4 neurons, with an AUC of 68.72%, a sensitivity of 33.14% and a precision of 27%. For the Kasko’s insurances, the suggested model is a random forest with 325 decision trees with an AUC of 72.58%, a sensitivity of 36.85% and a precision of 31.70%. AUCs are aligned with other predictive churn model results, however, precision and sensitivity measures are worse than in telecommunication churn models’, but comparable with insurance churn predictions. Not only do the models allow for the creation of a churn classification, but they are also able to give some insight about this phenomenon, and therefore provide useful information and data which the company can use and analyze in order to reduce the customer churn rate. However, there are some hidden factors that couldn’t be accounted for with the information available, such as; competitors’ market and client interaction, if these could be integrated a better prediction could be achieved.Vanneschi, LeonardoMendes, Jorge MoraisRUNBellani, Carolina2020-01-06T19:32:25Z2019-11-072019-11-07T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10362/90767TID:202358011enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T04:40:17Zoai:run.unl.pt:10362/90767Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:37:10.733501Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Predictive churn models in vehicle insurance
title Predictive churn models in vehicle insurance
spellingShingle Predictive churn models in vehicle insurance
Bellani, Carolina
Supervised learning
Predictive churn models
Machine Learning
Ensemble
title_short Predictive churn models in vehicle insurance
title_full Predictive churn models in vehicle insurance
title_fullStr Predictive churn models in vehicle insurance
title_full_unstemmed Predictive churn models in vehicle insurance
title_sort Predictive churn models in vehicle insurance
author Bellani, Carolina
author_facet Bellani, Carolina
author_role author
dc.contributor.none.fl_str_mv Vanneschi, Leonardo
Mendes, Jorge Morais
RUN
dc.contributor.author.fl_str_mv Bellani, Carolina
dc.subject.por.fl_str_mv Supervised learning
Predictive churn models
Machine Learning
Ensemble
topic Supervised learning
Predictive churn models
Machine Learning
Ensemble
description Internship Report presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics
publishDate 2019
dc.date.none.fl_str_mv 2019-11-07
2019-11-07T00:00:00Z
2020-01-06T19:32:25Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10362/90767
TID:202358011
url http://hdl.handle.net/10362/90767
identifier_str_mv TID:202358011
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799137988962156544