A Machine Learning Approach to Predict Health Insurance Claims
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10362/148525 |
Resumo: | Internship Report presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Business Analytics |
id |
RCAP_73c0b81a2f0e17ebdb5e40f86a8f6f65 |
---|---|
oai_identifier_str |
oai:run.unl.pt:10362/148525 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
A Machine Learning Approach to Predict Health Insurance ClaimsClaim ForecastingEnsembleHealth InsuranceMachine LearningTailor Made PoliciesXGBoostDomínio/Área Científica::Ciências Sociais::Economia e GestãoDomínio/Área Científica::Ciências Naturais::Ciências da Computação e da InformaçãoInternship Report presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Business AnalyticsRenewing health insurance contracts is, usually, an annual process, in the Tailor Made policies branch. At Multicare, this process starts three months before the end of the clients’ annuity, by estimating the costs of the last quarter using information from the first three. This estimation process is critical to the renewal of insurance contracts, since, if the estimation is too high, the client will overpay for their insurance and might seek more competitive alternatives. In contrast, if the predictions are too low, it will result in losses for the company. This part of the renewal process is currently performed by a time series algorithm, specifically an ARIMA model. This project aims to build a machine learning-based model that will provide more accurate estimations of the claims’ cost and frequency, in the Inpatient coverage, to Multicare. Several algorithms were tested: Linear and Logistic Regressions, Decision Trees, Random Forests, Gradient Boosting and XGBoost; and their results were then compared to the ones of the current ARIMA model. This study showed that a machine learning technique, the XGBoost, is more powerful than the ARIMA, as it projects 9% above the real costs, against the ARIMA’s global error of -25%. These conclusions can lead to changes in Multicare’s approach to predicting claim costs and, consequentially, its way of doing business.António, Nuno Miguel da ConceiçãoRUNCordeiro, Miguel Filipe Martins2023-02-02T08:56:50Z2023-01-232023-01-23T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10362/148525TID:203210417enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T05:30:07Zoai:run.unl.pt:10362/148525Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:53:23.801274Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
A Machine Learning Approach to Predict Health Insurance Claims |
title |
A Machine Learning Approach to Predict Health Insurance Claims |
spellingShingle |
A Machine Learning Approach to Predict Health Insurance Claims Cordeiro, Miguel Filipe Martins Claim Forecasting Ensemble Health Insurance Machine Learning Tailor Made Policies XGBoost Domínio/Área Científica::Ciências Sociais::Economia e Gestão Domínio/Área Científica::Ciências Naturais::Ciências da Computação e da Informação |
title_short |
A Machine Learning Approach to Predict Health Insurance Claims |
title_full |
A Machine Learning Approach to Predict Health Insurance Claims |
title_fullStr |
A Machine Learning Approach to Predict Health Insurance Claims |
title_full_unstemmed |
A Machine Learning Approach to Predict Health Insurance Claims |
title_sort |
A Machine Learning Approach to Predict Health Insurance Claims |
author |
Cordeiro, Miguel Filipe Martins |
author_facet |
Cordeiro, Miguel Filipe Martins |
author_role |
author |
dc.contributor.none.fl_str_mv |
António, Nuno Miguel da Conceição RUN |
dc.contributor.author.fl_str_mv |
Cordeiro, Miguel Filipe Martins |
dc.subject.por.fl_str_mv |
Claim Forecasting Ensemble Health Insurance Machine Learning Tailor Made Policies XGBoost Domínio/Área Científica::Ciências Sociais::Economia e Gestão Domínio/Área Científica::Ciências Naturais::Ciências da Computação e da Informação |
topic |
Claim Forecasting Ensemble Health Insurance Machine Learning Tailor Made Policies XGBoost Domínio/Área Científica::Ciências Sociais::Economia e Gestão Domínio/Área Científica::Ciências Naturais::Ciências da Computação e da Informação |
description |
Internship Report presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Business Analytics |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-02-02T08:56:50Z 2023-01-23 2023-01-23T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10362/148525 TID:203210417 |
url |
http://hdl.handle.net/10362/148525 |
identifier_str_mv |
TID:203210417 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799138124536741888 |