Multi-label semi-supervised classification through optimum-path forest
Autor(a) principal: | |
---|---|
Data de Publicação: | 2018 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.1016/j.ins.2018.06.067 http://hdl.handle.net/11449/164684 |
Resumo: | Multi-label classification consists of assigning one or multiple classes to each sample in a given dataset. However, the project of a multi-label classifier is usually limited to a small number of supervised samples as compared to the number of all possible label combinations. This scenario favors semi-supervised learning methods, which can cope with the absence of supervised samples by adding unsupervised ones to the training set. Recently, we proposed a semi-supervised learning method based on optimum connectivity for single-label classification. In this work, we extend it for multi-label classification with considerable effectiveness gain. After a single-label data transformation, the method propagates labels from supervised to unsupervised samples, as in the original approach, by assuming that samples from the same class are more closely connected through sequences of nearby samples than samples from distinct classes. Given that the procedure is more reliable in high-density regions of the feature space, an additional step repropagates labels from the maxima of a probability density function to correct possible labeling errors from the previous step. Finally, the data transformation is reversed to obtain multiple labels per sample. The new approach is experimentally validated on several datasets in comparison with state-of-the-art methods. (C) 2018 Elsevier Inc. All rights reserved. |
id |
UNSP_4ac9c1c7d8e175c6c8714636dca96bb9 |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/164684 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Multi-label semi-supervised classification through optimum-path forestSemi-supervised learningMulti-label assignmentOptimum-path forest classifiersMulti-label classification consists of assigning one or multiple classes to each sample in a given dataset. However, the project of a multi-label classifier is usually limited to a small number of supervised samples as compared to the number of all possible label combinations. This scenario favors semi-supervised learning methods, which can cope with the absence of supervised samples by adding unsupervised ones to the training set. Recently, we proposed a semi-supervised learning method based on optimum connectivity for single-label classification. In this work, we extend it for multi-label classification with considerable effectiveness gain. After a single-label data transformation, the method propagates labels from supervised to unsupervised samples, as in the original approach, by assuming that samples from the same class are more closely connected through sequences of nearby samples than samples from distinct classes. Given that the procedure is more reliable in high-density regions of the feature space, an additional step repropagates labels from the maxima of a probability density function to correct possible labeling errors from the previous step. Finally, the data transformation is reversed to obtain multiple labels per sample. The new approach is experimentally validated on several datasets in comparison with state-of-the-art methods. (C) 2018 Elsevier Inc. All rights reserved.Fundect-MSConselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Univ Fed Mato Grosso do Sul, Inst Comp, Campo Grande, MS, BrazilUniv Estadual Campinas, Inst Comp, Campinas, SP, BrazilSao Paulo State Univ, Dept Comp, Bauru, SP, BrazilSao Paulo State Univ, Dept Comp, Bauru, SP, BrazilCNPq: 303673/2010-9CNPq: 479070/2013-0CNPq: 302970/2014-2CNPq: 303182/2011-3CNPq: 470571/2013-6CNPq: 306166/2014-3FAPESP: 2013/20387-7FAPESP: 2014/12236-1FAPESP: 2014/16250-9Elsevier B.V.Universidade Federal de Mato Grosso do Sul (UFMS)Universidade Estadual de Campinas (UNICAMP)Universidade Estadual Paulista (Unesp)Amorim, Willian P.Falcao, Alexandre X.Papa, Joao P. [UNESP]2018-11-26T17:55:36Z2018-11-26T17:55:36Z2018-10-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article86-104application/pdfhttp://dx.doi.org/10.1016/j.ins.2018.06.067Information Sciences. New York: Elsevier Science Inc, v. 465, p. 86-104, 2018.0020-0255http://hdl.handle.net/11449/16468410.1016/j.ins.2018.06.067WOS:000445713900006WOS000445713900006.pdfWeb of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengInformation Sciencesinfo:eu-repo/semantics/openAccess2024-04-23T16:11:00Zoai:repositorio.unesp.br:11449/164684Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T19:58:07.197739Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Multi-label semi-supervised classification through optimum-path forest |
title |
Multi-label semi-supervised classification through optimum-path forest |
spellingShingle |
Multi-label semi-supervised classification through optimum-path forest Amorim, Willian P. Semi-supervised learning Multi-label assignment Optimum-path forest classifiers |
title_short |
Multi-label semi-supervised classification through optimum-path forest |
title_full |
Multi-label semi-supervised classification through optimum-path forest |
title_fullStr |
Multi-label semi-supervised classification through optimum-path forest |
title_full_unstemmed |
Multi-label semi-supervised classification through optimum-path forest |
title_sort |
Multi-label semi-supervised classification through optimum-path forest |
author |
Amorim, Willian P. |
author_facet |
Amorim, Willian P. Falcao, Alexandre X. Papa, Joao P. [UNESP] |
author_role |
author |
author2 |
Falcao, Alexandre X. Papa, Joao P. [UNESP] |
author2_role |
author author |
dc.contributor.none.fl_str_mv |
Universidade Federal de Mato Grosso do Sul (UFMS) Universidade Estadual de Campinas (UNICAMP) Universidade Estadual Paulista (Unesp) |
dc.contributor.author.fl_str_mv |
Amorim, Willian P. Falcao, Alexandre X. Papa, Joao P. [UNESP] |
dc.subject.por.fl_str_mv |
Semi-supervised learning Multi-label assignment Optimum-path forest classifiers |
topic |
Semi-supervised learning Multi-label assignment Optimum-path forest classifiers |
description |
Multi-label classification consists of assigning one or multiple classes to each sample in a given dataset. However, the project of a multi-label classifier is usually limited to a small number of supervised samples as compared to the number of all possible label combinations. This scenario favors semi-supervised learning methods, which can cope with the absence of supervised samples by adding unsupervised ones to the training set. Recently, we proposed a semi-supervised learning method based on optimum connectivity for single-label classification. In this work, we extend it for multi-label classification with considerable effectiveness gain. After a single-label data transformation, the method propagates labels from supervised to unsupervised samples, as in the original approach, by assuming that samples from the same class are more closely connected through sequences of nearby samples than samples from distinct classes. Given that the procedure is more reliable in high-density regions of the feature space, an additional step repropagates labels from the maxima of a probability density function to correct possible labeling errors from the previous step. Finally, the data transformation is reversed to obtain multiple labels per sample. The new approach is experimentally validated on several datasets in comparison with state-of-the-art methods. (C) 2018 Elsevier Inc. All rights reserved. |
publishDate |
2018 |
dc.date.none.fl_str_mv |
2018-11-26T17:55:36Z 2018-11-26T17:55:36Z 2018-10-01 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1016/j.ins.2018.06.067 Information Sciences. New York: Elsevier Science Inc, v. 465, p. 86-104, 2018. 0020-0255 http://hdl.handle.net/11449/164684 10.1016/j.ins.2018.06.067 WOS:000445713900006 WOS000445713900006.pdf |
url |
http://dx.doi.org/10.1016/j.ins.2018.06.067 http://hdl.handle.net/11449/164684 |
identifier_str_mv |
Information Sciences. New York: Elsevier Science Inc, v. 465, p. 86-104, 2018. 0020-0255 10.1016/j.ins.2018.06.067 WOS:000445713900006 WOS000445713900006.pdf |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Information Sciences |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
86-104 application/pdf |
dc.publisher.none.fl_str_mv |
Elsevier B.V. |
publisher.none.fl_str_mv |
Elsevier B.V. |
dc.source.none.fl_str_mv |
Web of Science reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808129144636047360 |