The use of genetic programming for detecting the incorrect predictions of classification models
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10362/94537 |
Resumo: | Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics |
id |
RCAP_00a5ccfc5fbc1a608ffdac57b9f6a73a |
---|---|
oai_identifier_str |
oai:run.unl.pt:10362/94537 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
The use of genetic programming for detecting the incorrect predictions of classification modelsMachine LearningExplainable AIPost-processingClassificationGenetic ProgrammingErrors PredictionDissertation presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced AnalyticsCompanies around the world use Advanced Analytics to support their decision making process. Traditionally they used Statistics and Business Intelligence for that, but as the technology is advancing, the more complex models are gaining popularity. The main reason for an increasing interest in Machine Learning and Deep Learning models is the fact that they reach a high prediction accuracy. On the second hand with good performance, comes an increasing complexity of the programs. Therefore the new area of Predictors was introduced, it is called Explainable AI. The idea is to create models that can be understood by business users or models to explain other predictions. Therefore we propose the study in which we create a separate model, that will serve as a very er for the machine learning models predictions. This work falls into area of Post-processing of models outputs. For this purpose we select Genetic Programming, that was proven to be successful in various applications. In the scope of this research we investigate if GP can evaluate the prediction of other models. This area of applications was not explored yet, therefore in the study we explore the possibility of evolving an individual for another model validation. We focus on classi cation problems and select 4 machine learning models: logistic regression, decision tree, random forest, perceptron and 3 di erent datasets. This set up is used for assuring that during the research we conclude that the presented idea is universal for di erent problems. The performance of 12 Genetic Programming experiments indicates that in some cases it is possible to create a successful model for errors prediction. During the study we discovered that the performance of GP programs is mostly connected to the dataset on the experiment is conducted. The type of predictive models does not in uence the performance of GP. Although we managed to create good classi ers of errors, during the evolution process we faced the problem of over tting. That is common in problems with imbalanced datasets. The results of the study con rms that GP can be used for the new type of problems and successfully predict errors of Machine Learning Models.Vanneschi, LeonardoRUNNapiórkowska, Adrianna Maria2020-03-19T09:37:42Z2020-02-212020-02-21T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10362/94537TID:202461807enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T04:42:30Zoai:run.unl.pt:10362/94537Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:37:59.431888Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
The use of genetic programming for detecting the incorrect predictions of classification models |
title |
The use of genetic programming for detecting the incorrect predictions of classification models |
spellingShingle |
The use of genetic programming for detecting the incorrect predictions of classification models Napiórkowska, Adrianna Maria Machine Learning Explainable AI Post-processing Classification Genetic Programming Errors Prediction |
title_short |
The use of genetic programming for detecting the incorrect predictions of classification models |
title_full |
The use of genetic programming for detecting the incorrect predictions of classification models |
title_fullStr |
The use of genetic programming for detecting the incorrect predictions of classification models |
title_full_unstemmed |
The use of genetic programming for detecting the incorrect predictions of classification models |
title_sort |
The use of genetic programming for detecting the incorrect predictions of classification models |
author |
Napiórkowska, Adrianna Maria |
author_facet |
Napiórkowska, Adrianna Maria |
author_role |
author |
dc.contributor.none.fl_str_mv |
Vanneschi, Leonardo RUN |
dc.contributor.author.fl_str_mv |
Napiórkowska, Adrianna Maria |
dc.subject.por.fl_str_mv |
Machine Learning Explainable AI Post-processing Classification Genetic Programming Errors Prediction |
topic |
Machine Learning Explainable AI Post-processing Classification Genetic Programming Errors Prediction |
description |
Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020-03-19T09:37:42Z 2020-02-21 2020-02-21T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10362/94537 TID:202461807 |
url |
http://hdl.handle.net/10362/94537 |
identifier_str_mv |
TID:202461807 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799137996527632384 |