Feature Selection using LAID and its Implementation on High-Performance Computing Systems

Detalhes bibliográficos
Autor(a) principal: Morgado, Paulo
Data de Publicação: 2021
Outros Autores: Cavique, Luís
Tipo de documento: Artigo
Idioma: por
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://doi.org/10.34627/rcc.v16i0.266
Resumo: From the set of dimensionality reduction techniques, we focus on the feature selection, a possible approach to carry it out is the use of Logical Analysis of Inconsistent Data (LAID). Recently, studies have demonstrated its potential in solving this problem and highlighted its advantages as a robust methodology, easy to interpret, and additionally capable of dealing with inconsistent data. The same studies revealed processing times higher than desired for full use and recommended the execution of algorithms through parallel processing using high-performance computing (HPC). This work represents yet another contribution to this effort by addressing ways of storing data, solutions for parallelizing algorithms, configuring the HPC environment, and finally testing the National Infrastructure for Distributed Computing (INCD) that allowed us to draw the conclusions presented.
id RCAP_b137f4bb33ad888361269cdab8a24463
oai_identifier_str oai:ojs2.journals.uab.pt:article/266
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Feature Selection using LAID and its Implementation on High-Performance Computing SystemsSeleção de atributos usando LAID e sua implementação em sistemas de computação de alto desempenhoFrom the set of dimensionality reduction techniques, we focus on the feature selection, a possible approach to carry it out is the use of Logical Analysis of Inconsistent Data (LAID). Recently, studies have demonstrated its potential in solving this problem and highlighted its advantages as a robust methodology, easy to interpret, and additionally capable of dealing with inconsistent data. The same studies revealed processing times higher than desired for full use and recommended the execution of algorithms through parallel processing using high-performance computing (HPC). This work represents yet another contribution to this effort by addressing ways of storing data, solutions for parallelizing algorithms, configuring the HPC environment, and finally testing the National Infrastructure for Distributed Computing (INCD) that allowed us to draw the conclusions presented.Do conjunto de técnicas de redução de dimensionalidade focamo-nos na seleção de atributos uma possível abordagem para a realizar é a utilização da Análise Lógica de Dados Inconsistentes (LAID). Recentemente vários estudos demostraram as suas potencialidades na resolução deste problema e evidenciaram as suas vantagens como uma metodologia robusta, de fácil interpretação e adicionalmente capaz de lidar com dados inconsistentes. Os mesmos estudos revelaram tempos de processamento acima do desejado para uma utilização plena e preconizaram a execução dos algoritmos através de processamento paralelo com recurso a computação de alto desempenho (HPC). Este trabalho representa mais um contributo nesse esforço ao abordar formas de armazenamento dos dados, soluções de paralelização dos algoritmos, configuração do ambiente HPC e finalmente os testes na Infraestrutura Nacional de Computação Distribuída (INCD) que permitiram extrair as conclusões apresentadas.Universidade Aberta2021-12-08info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://doi.org/10.34627/rcc.v16i0.266https://doi.org/10.34627/rcc.v16i0.266Revista de Ciências da Computação; v. 16 (2021); 93-1122182-18011646-633010.34627/rcc.v16i0reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAPporhttps://journals.uab.pt/index.php/rcc/article/view/266https://journals.uab.pt/index.php/rcc/article/view/266/221Direitos de Autor (c) 2021 Universidade Abertahttp://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessMorgado, PauloCavique, Luís2022-12-23T06:30:14Zoai:ojs2.journals.uab.pt:article/266Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T16:14:02.490536Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Feature Selection using LAID and its Implementation on High-Performance Computing Systems
Seleção de atributos usando LAID e sua implementação em sistemas de computação de alto desempenho
title Feature Selection using LAID and its Implementation on High-Performance Computing Systems
spellingShingle Feature Selection using LAID and its Implementation on High-Performance Computing Systems
Morgado, Paulo
title_short Feature Selection using LAID and its Implementation on High-Performance Computing Systems
title_full Feature Selection using LAID and its Implementation on High-Performance Computing Systems
title_fullStr Feature Selection using LAID and its Implementation on High-Performance Computing Systems
title_full_unstemmed Feature Selection using LAID and its Implementation on High-Performance Computing Systems
title_sort Feature Selection using LAID and its Implementation on High-Performance Computing Systems
author Morgado, Paulo
author_facet Morgado, Paulo
Cavique, Luís
author_role author
author2 Cavique, Luís
author2_role author
dc.contributor.author.fl_str_mv Morgado, Paulo
Cavique, Luís
description From the set of dimensionality reduction techniques, we focus on the feature selection, a possible approach to carry it out is the use of Logical Analysis of Inconsistent Data (LAID). Recently, studies have demonstrated its potential in solving this problem and highlighted its advantages as a robust methodology, easy to interpret, and additionally capable of dealing with inconsistent data. The same studies revealed processing times higher than desired for full use and recommended the execution of algorithms through parallel processing using high-performance computing (HPC). This work represents yet another contribution to this effort by addressing ways of storing data, solutions for parallelizing algorithms, configuring the HPC environment, and finally testing the National Infrastructure for Distributed Computing (INCD) that allowed us to draw the conclusions presented.
publishDate 2021
dc.date.none.fl_str_mv 2021-12-08
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://doi.org/10.34627/rcc.v16i0.266
https://doi.org/10.34627/rcc.v16i0.266
url https://doi.org/10.34627/rcc.v16i0.266
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://journals.uab.pt/index.php/rcc/article/view/266
https://journals.uab.pt/index.php/rcc/article/view/266/221
dc.rights.driver.fl_str_mv Direitos de Autor (c) 2021 Universidade Aberta
http://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Direitos de Autor (c) 2021 Universidade Aberta
http://creativecommons.org/licenses/by/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Aberta
publisher.none.fl_str_mv Universidade Aberta
dc.source.none.fl_str_mv Revista de Ciências da Computação; v. 16 (2021); 93-112
2182-1801
1646-6330
10.34627/rcc.v16i0
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799130593811759104