Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problems
Autor(a) principal: | |
---|---|
Data de Publicação: | 2018 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10362/28931 |
Resumo: | Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Information Systems and Technologies Management |
id |
RCAP_024eb05e4f697ad72c7af586b0c7c67d |
---|---|
oai_identifier_str |
oai:run.unl.pt:10362/28931 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problemsGenetic algorithmGenetic programmingMachine learningRepulsorOverfittingSymbolic regressionDissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Information Systems and Technologies ManagementGenetic Algorithms are bio-inspired metaheuristics that solve optimization problems; they are evolutionary algorithms that mimic the biological processes of evolution and natural selection to evolve solutions to a given problem. Genetic programming consists of the creation of programs employing GAs to evolve them. In both GA and GP, the algorithm starts with a random solution to a problem that is improved generation after generation building it on the positive traits of the previous generation and discarding traits that do not improve the solution. Repulsors consist of giving the learning algorithm some prior knowledge on the outcome of previous generations on a test set, to try to replace solutions that performed poorly on the data set with better ones. This thesis aims to test and document if the use of repulsors can change the behavior of GP, improve its learning rate and reduce overfitting thus also improving the generalization abilities? Overfitting is a problem in many machine learning algorithms, genetic programming (GP) is also affected by it, one of the objectives of this dissertation is to assess if overfitting can be reduced by using knowledge on the prior behavior of programs generated by GP on a validation data set, and, applying this knowledge to change the selection phase penalizing solutions similar to those that generalized poorly before. These poorly performing solutions will be called repulsors and are the main topic of this dissertation. We developed a program that implemented standard and repulsor based genetic programming. The program was then executed several times over some datasets and collect the results. Finally, the results were compared, and conclusions were taken. The results indicate that the use of repulsors produces better results on the training set and in the test set, this leads us to conclude that the use of repulsors has a positive effect on the performance of GP. The results indicate that the use of repulsors does indeed produce better results. On the training phase, seven out of the nine datasets showed improved algorithm performance when learning. In the test sets, the algorithm presented better generalization ability on five out of nine datasets. Studies could be extended to the use of multi-objective optimization when selecting individuals, and the extension of the repulsor list to other (independent) runs with the same parameters and dataset.Vanneschi, LeonardoRUNCanelhas, Jorge Miguel Silvestre2018-01-24T15:25:37Z2018-01-092018-01-09T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10362/28931TID:201826801enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T04:15:41Zoai:run.unl.pt:10362/28931Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:29:06.840964Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problems |
title |
Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problems |
spellingShingle |
Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problems Canelhas, Jorge Miguel Silvestre Genetic algorithm Genetic programming Machine learning Repulsor Overfitting Symbolic regression |
title_short |
Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problems |
title_full |
Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problems |
title_fullStr |
Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problems |
title_full_unstemmed |
Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problems |
title_sort |
Analysis of the use of repulsors to improve generalization ability in genetic programming : an application to symbolic regression problems |
author |
Canelhas, Jorge Miguel Silvestre |
author_facet |
Canelhas, Jorge Miguel Silvestre |
author_role |
author |
dc.contributor.none.fl_str_mv |
Vanneschi, Leonardo RUN |
dc.contributor.author.fl_str_mv |
Canelhas, Jorge Miguel Silvestre |
dc.subject.por.fl_str_mv |
Genetic algorithm Genetic programming Machine learning Repulsor Overfitting Symbolic regression |
topic |
Genetic algorithm Genetic programming Machine learning Repulsor Overfitting Symbolic regression |
description |
Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Information Systems and Technologies Management |
publishDate |
2018 |
dc.date.none.fl_str_mv |
2018-01-24T15:25:37Z 2018-01-09 2018-01-09T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10362/28931 TID:201826801 |
url |
http://hdl.handle.net/10362/28931 |
identifier_str_mv |
TID:201826801 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799137916351414272 |