Improving lazy attribute selection.
Autor(a) principal: | |
---|---|
Data de Publicação: | 2011 |
Outros Autores: | , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UFOP |
Texto Completo: | http://www.repositorio.ufop.br/handle/123456789/4385 |
Resumo: | Attribute selection is a data preprocessing step which aims at identifying relevant attributes for a target data mining task – specifically in this article, the classification task. Previously, we have proposed a new attribute selection strategy – based on a lazy learning approach – which postpones the identification of relevant attributes until an instance is submitted for classification. Experimental results showed the effectiveness of the technique, as in most cases it improved the accuracy of classification, when compared with the analogous eager attribute selection approach performed as a data preprocessing step. However, in the previously proposed approach, the performance of the classifier depends on the number of attributes selected, which is a user-defined parameter. In practice, it may be difficult to select a proper value for this parameter, that is, the value that produces the best performance for the classification task. In this article, aiming to overcome this drawback, we propose two approaches to be used coupled with lazy attribute selection technique: one that tries to identify, in a wrapper-based manner, the appropriate number of attributes to be selected and another that combines, in a voting approach, different numbers of attributes. Experimental results show the effectiveness of the proposed techniques. The assessment of these approaches confirms that the lazy learning paradigm can be compatible with traditional methods and appropriate for a large number of applications. |
id |
UFOP_aae5d2d75b3b2caedcf37133e50966b5 |
---|---|
oai_identifier_str |
oai:repositorio.ufop.br:123456789/4385 |
network_acronym_str |
UFOP |
network_name_str |
Repositório Institucional da UFOP |
repository_id_str |
3233 |
spelling |
Improving lazy attribute selection.Attribute selectionClassificationLazy learningAttribute selection is a data preprocessing step which aims at identifying relevant attributes for a target data mining task – specifically in this article, the classification task. Previously, we have proposed a new attribute selection strategy – based on a lazy learning approach – which postpones the identification of relevant attributes until an instance is submitted for classification. Experimental results showed the effectiveness of the technique, as in most cases it improved the accuracy of classification, when compared with the analogous eager attribute selection approach performed as a data preprocessing step. However, in the previously proposed approach, the performance of the classifier depends on the number of attributes selected, which is a user-defined parameter. In practice, it may be difficult to select a proper value for this parameter, that is, the value that produces the best performance for the classification task. In this article, aiming to overcome this drawback, we propose two approaches to be used coupled with lazy attribute selection technique: one that tries to identify, in a wrapper-based manner, the appropriate number of attributes to be selected and another that combines, in a voting approach, different numbers of attributes. Experimental results show the effectiveness of the proposed techniques. The assessment of these approaches confirms that the lazy learning paradigm can be compatible with traditional methods and appropriate for a large number of applications.2015-01-26T11:34:31Z2015-01-26T11:34:31Z2011info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfPEREIRA, R. B. et al. Improving lazy attribute selection. Journal of Information and Data Management - JIDM, v. 2, n. 3, p. 447-462, out. 2011. Disponível em: <https://seer.lcc.ufmg.br/index.php/jidm/article/view/156/95>. Acesso em: 23 jan. 2015.2178-7107http://www.repositorio.ufop.br/handle/123456789/4385Permission to copy without fee all or part of the material printed in JIDM is granted provided that the copies are not made or distributed for commercial advantage, and that notice is given that copying is by permission of the Sociedade Brasileira de Computação. Fonte: Informação contida no artigo.info:eu-repo/semantics/openAccessPereira, Rafael BarrosPlastino, AlexandreZadrozny, BiancaMerschmann, Luiz Henrique de CamposFreitas, Alex Alvesengreponame:Repositório Institucional da UFOPinstname:Universidade Federal de Ouro Preto (UFOP)instacron:UFOP2019-06-12T17:39:56Zoai:repositorio.ufop.br:123456789/4385Repositório InstitucionalPUBhttp://www.repositorio.ufop.br/oai/requestrepositorio@ufop.edu.bropendoar:32332019-06-12T17:39:56Repositório Institucional da UFOP - Universidade Federal de Ouro Preto (UFOP)false |
dc.title.none.fl_str_mv |
Improving lazy attribute selection. |
title |
Improving lazy attribute selection. |
spellingShingle |
Improving lazy attribute selection. Pereira, Rafael Barros Attribute selection Classification Lazy learning |
title_short |
Improving lazy attribute selection. |
title_full |
Improving lazy attribute selection. |
title_fullStr |
Improving lazy attribute selection. |
title_full_unstemmed |
Improving lazy attribute selection. |
title_sort |
Improving lazy attribute selection. |
author |
Pereira, Rafael Barros |
author_facet |
Pereira, Rafael Barros Plastino, Alexandre Zadrozny, Bianca Merschmann, Luiz Henrique de Campos Freitas, Alex Alves |
author_role |
author |
author2 |
Plastino, Alexandre Zadrozny, Bianca Merschmann, Luiz Henrique de Campos Freitas, Alex Alves |
author2_role |
author author author author |
dc.contributor.author.fl_str_mv |
Pereira, Rafael Barros Plastino, Alexandre Zadrozny, Bianca Merschmann, Luiz Henrique de Campos Freitas, Alex Alves |
dc.subject.por.fl_str_mv |
Attribute selection Classification Lazy learning |
topic |
Attribute selection Classification Lazy learning |
description |
Attribute selection is a data preprocessing step which aims at identifying relevant attributes for a target data mining task – specifically in this article, the classification task. Previously, we have proposed a new attribute selection strategy – based on a lazy learning approach – which postpones the identification of relevant attributes until an instance is submitted for classification. Experimental results showed the effectiveness of the technique, as in most cases it improved the accuracy of classification, when compared with the analogous eager attribute selection approach performed as a data preprocessing step. However, in the previously proposed approach, the performance of the classifier depends on the number of attributes selected, which is a user-defined parameter. In practice, it may be difficult to select a proper value for this parameter, that is, the value that produces the best performance for the classification task. In this article, aiming to overcome this drawback, we propose two approaches to be used coupled with lazy attribute selection technique: one that tries to identify, in a wrapper-based manner, the appropriate number of attributes to be selected and another that combines, in a voting approach, different numbers of attributes. Experimental results show the effectiveness of the proposed techniques. The assessment of these approaches confirms that the lazy learning paradigm can be compatible with traditional methods and appropriate for a large number of applications. |
publishDate |
2011 |
dc.date.none.fl_str_mv |
2011 2015-01-26T11:34:31Z 2015-01-26T11:34:31Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
PEREIRA, R. B. et al. Improving lazy attribute selection. Journal of Information and Data Management - JIDM, v. 2, n. 3, p. 447-462, out. 2011. Disponível em: <https://seer.lcc.ufmg.br/index.php/jidm/article/view/156/95>. Acesso em: 23 jan. 2015. 2178-7107 http://www.repositorio.ufop.br/handle/123456789/4385 |
identifier_str_mv |
PEREIRA, R. B. et al. Improving lazy attribute selection. Journal of Information and Data Management - JIDM, v. 2, n. 3, p. 447-462, out. 2011. Disponível em: <https://seer.lcc.ufmg.br/index.php/jidm/article/view/156/95>. Acesso em: 23 jan. 2015. 2178-7107 |
url |
http://www.repositorio.ufop.br/handle/123456789/4385 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFOP instname:Universidade Federal de Ouro Preto (UFOP) instacron:UFOP |
instname_str |
Universidade Federal de Ouro Preto (UFOP) |
instacron_str |
UFOP |
institution |
UFOP |
reponame_str |
Repositório Institucional da UFOP |
collection |
Repositório Institucional da UFOP |
repository.name.fl_str_mv |
Repositório Institucional da UFOP - Universidade Federal de Ouro Preto (UFOP) |
repository.mail.fl_str_mv |
repositorio@ufop.edu.br |
_version_ |
1813002840699830272 |