Improving the quality of predictive models in small data GSDOT
Autor(a) principal: | |
---|---|
Data de Publicação: | 2022 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10362/136541 |
Resumo: | Douzas, G., Lechleitner, M., & Bacao, F. (2022). Improving the quality of predictive models in small data GSDOT: A new algorithm for generating synthetic data. PLoS ONE, 17(4), 1-15. [e0265626]. https://doi.org/10.1371/journal.pone.0265626 |
id |
RCAP_fd01fdac5a9de8843dcc2f36a97530b5 |
---|---|
oai_identifier_str |
oai:run.unl.pt:10362/136541 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Improving the quality of predictive models in small data GSDOTA new algorithm for generating synthetic dataGeneralDouzas, G., Lechleitner, M., & Bacao, F. (2022). Improving the quality of predictive models in small data GSDOT: A new algorithm for generating synthetic data. PLoS ONE, 17(4), 1-15. [e0265626]. https://doi.org/10.1371/journal.pone.0265626In the age of the data deluge there are still many domains and applications restricted to the use of small datasets. The ability to harness these small datasets to solve problems through the use of supervised learning methods can have a significant impact in many important areas. The insufficient size of training data usually results in unsatisfactory performance of machine learning algorithms. The current research work aims to contribute to mitigate the small data problem through the creation of artificial instances, which are added to the training process. The proposed algorithm, Geometric Small Data Oversampling Technique, uses geometric regions around existing samples to generate new high quality instances. Experimental results show a significant improvement in accuracy when compared with the use of the initial small dataset as well as other popular artificial data generation techniques.Information Management Research Center (MagIC) - NOVA Information Management SchoolNOVA Information Management School (NOVA IMS)RUNDouzas, GeorgiosLechleitner, MariaBacao, Fernando2022-04-16T22:38:24Z2022-04-072022-04-07T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article15application/pdfhttp://hdl.handle.net/10362/136541eng1932-6203PURE: 43275911https://doi.org/10.1371/journal.pone.0265626info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T05:14:36Zoai:run.unl.pt:10362/136541Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:48:43.875261Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Improving the quality of predictive models in small data GSDOT A new algorithm for generating synthetic data |
title |
Improving the quality of predictive models in small data GSDOT |
spellingShingle |
Improving the quality of predictive models in small data GSDOT Douzas, Georgios General |
title_short |
Improving the quality of predictive models in small data GSDOT |
title_full |
Improving the quality of predictive models in small data GSDOT |
title_fullStr |
Improving the quality of predictive models in small data GSDOT |
title_full_unstemmed |
Improving the quality of predictive models in small data GSDOT |
title_sort |
Improving the quality of predictive models in small data GSDOT |
author |
Douzas, Georgios |
author_facet |
Douzas, Georgios Lechleitner, Maria Bacao, Fernando |
author_role |
author |
author2 |
Lechleitner, Maria Bacao, Fernando |
author2_role |
author author |
dc.contributor.none.fl_str_mv |
Information Management Research Center (MagIC) - NOVA Information Management School NOVA Information Management School (NOVA IMS) RUN |
dc.contributor.author.fl_str_mv |
Douzas, Georgios Lechleitner, Maria Bacao, Fernando |
dc.subject.por.fl_str_mv |
General |
topic |
General |
description |
Douzas, G., Lechleitner, M., & Bacao, F. (2022). Improving the quality of predictive models in small data GSDOT: A new algorithm for generating synthetic data. PLoS ONE, 17(4), 1-15. [e0265626]. https://doi.org/10.1371/journal.pone.0265626 |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-04-16T22:38:24Z 2022-04-07 2022-04-07T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10362/136541 |
url |
http://hdl.handle.net/10362/136541 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
1932-6203 PURE: 43275911 https://doi.org/10.1371/journal.pone.0265626 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
15 application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799138087670906880 |