Multi-label semi-supervised classification through optimum-path forest

Detalhes bibliográficos
Autor(a) principal: Amorim, Willian P.
Data de Publicação: 2018
Outros Autores: Falcao, Alexandre X., Papa, Joao P. [UNESP]
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1016/j.ins.2018.06.067
http://hdl.handle.net/11449/164684
Resumo: Multi-label classification consists of assigning one or multiple classes to each sample in a given dataset. However, the project of a multi-label classifier is usually limited to a small number of supervised samples as compared to the number of all possible label combinations. This scenario favors semi-supervised learning methods, which can cope with the absence of supervised samples by adding unsupervised ones to the training set. Recently, we proposed a semi-supervised learning method based on optimum connectivity for single-label classification. In this work, we extend it for multi-label classification with considerable effectiveness gain. After a single-label data transformation, the method propagates labels from supervised to unsupervised samples, as in the original approach, by assuming that samples from the same class are more closely connected through sequences of nearby samples than samples from distinct classes. Given that the procedure is more reliable in high-density regions of the feature space, an additional step repropagates labels from the maxima of a probability density function to correct possible labeling errors from the previous step. Finally, the data transformation is reversed to obtain multiple labels per sample. The new approach is experimentally validated on several datasets in comparison with state-of-the-art methods. (C) 2018 Elsevier Inc. All rights reserved.
id UNSP_4ac9c1c7d8e175c6c8714636dca96bb9
oai_identifier_str oai:repositorio.unesp.br:11449/164684
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Multi-label semi-supervised classification through optimum-path forestSemi-supervised learningMulti-label assignmentOptimum-path forest classifiersMulti-label classification consists of assigning one or multiple classes to each sample in a given dataset. However, the project of a multi-label classifier is usually limited to a small number of supervised samples as compared to the number of all possible label combinations. This scenario favors semi-supervised learning methods, which can cope with the absence of supervised samples by adding unsupervised ones to the training set. Recently, we proposed a semi-supervised learning method based on optimum connectivity for single-label classification. In this work, we extend it for multi-label classification with considerable effectiveness gain. After a single-label data transformation, the method propagates labels from supervised to unsupervised samples, as in the original approach, by assuming that samples from the same class are more closely connected through sequences of nearby samples than samples from distinct classes. Given that the procedure is more reliable in high-density regions of the feature space, an additional step repropagates labels from the maxima of a probability density function to correct possible labeling errors from the previous step. Finally, the data transformation is reversed to obtain multiple labels per sample. The new approach is experimentally validated on several datasets in comparison with state-of-the-art methods. (C) 2018 Elsevier Inc. All rights reserved.Fundect-MSConselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Univ Fed Mato Grosso do Sul, Inst Comp, Campo Grande, MS, BrazilUniv Estadual Campinas, Inst Comp, Campinas, SP, BrazilSao Paulo State Univ, Dept Comp, Bauru, SP, BrazilSao Paulo State Univ, Dept Comp, Bauru, SP, BrazilCNPq: 303673/2010-9CNPq: 479070/2013-0CNPq: 302970/2014-2CNPq: 303182/2011-3CNPq: 470571/2013-6CNPq: 306166/2014-3FAPESP: 2013/20387-7FAPESP: 2014/12236-1FAPESP: 2014/16250-9Elsevier B.V.Universidade Federal de Mato Grosso do Sul (UFMS)Universidade Estadual de Campinas (UNICAMP)Universidade Estadual Paulista (Unesp)Amorim, Willian P.Falcao, Alexandre X.Papa, Joao P. [UNESP]2018-11-26T17:55:36Z2018-11-26T17:55:36Z2018-10-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article86-104application/pdfhttp://dx.doi.org/10.1016/j.ins.2018.06.067Information Sciences. New York: Elsevier Science Inc, v. 465, p. 86-104, 2018.0020-0255http://hdl.handle.net/11449/16468410.1016/j.ins.2018.06.067WOS:000445713900006WOS000445713900006.pdfWeb of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengInformation Sciencesinfo:eu-repo/semantics/openAccess2024-04-23T16:11:00Zoai:repositorio.unesp.br:11449/164684Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T19:58:07.197739Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Multi-label semi-supervised classification through optimum-path forest
title Multi-label semi-supervised classification through optimum-path forest
spellingShingle Multi-label semi-supervised classification through optimum-path forest
Amorim, Willian P.
Semi-supervised learning
Multi-label assignment
Optimum-path forest classifiers
title_short Multi-label semi-supervised classification through optimum-path forest
title_full Multi-label semi-supervised classification through optimum-path forest
title_fullStr Multi-label semi-supervised classification through optimum-path forest
title_full_unstemmed Multi-label semi-supervised classification through optimum-path forest
title_sort Multi-label semi-supervised classification through optimum-path forest
author Amorim, Willian P.
author_facet Amorim, Willian P.
Falcao, Alexandre X.
Papa, Joao P. [UNESP]
author_role author
author2 Falcao, Alexandre X.
Papa, Joao P. [UNESP]
author2_role author
author
dc.contributor.none.fl_str_mv Universidade Federal de Mato Grosso do Sul (UFMS)
Universidade Estadual de Campinas (UNICAMP)
Universidade Estadual Paulista (Unesp)
dc.contributor.author.fl_str_mv Amorim, Willian P.
Falcao, Alexandre X.
Papa, Joao P. [UNESP]
dc.subject.por.fl_str_mv Semi-supervised learning
Multi-label assignment
Optimum-path forest classifiers
topic Semi-supervised learning
Multi-label assignment
Optimum-path forest classifiers
description Multi-label classification consists of assigning one or multiple classes to each sample in a given dataset. However, the project of a multi-label classifier is usually limited to a small number of supervised samples as compared to the number of all possible label combinations. This scenario favors semi-supervised learning methods, which can cope with the absence of supervised samples by adding unsupervised ones to the training set. Recently, we proposed a semi-supervised learning method based on optimum connectivity for single-label classification. In this work, we extend it for multi-label classification with considerable effectiveness gain. After a single-label data transformation, the method propagates labels from supervised to unsupervised samples, as in the original approach, by assuming that samples from the same class are more closely connected through sequences of nearby samples than samples from distinct classes. Given that the procedure is more reliable in high-density regions of the feature space, an additional step repropagates labels from the maxima of a probability density function to correct possible labeling errors from the previous step. Finally, the data transformation is reversed to obtain multiple labels per sample. The new approach is experimentally validated on several datasets in comparison with state-of-the-art methods. (C) 2018 Elsevier Inc. All rights reserved.
publishDate 2018
dc.date.none.fl_str_mv 2018-11-26T17:55:36Z
2018-11-26T17:55:36Z
2018-10-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1016/j.ins.2018.06.067
Information Sciences. New York: Elsevier Science Inc, v. 465, p. 86-104, 2018.
0020-0255
http://hdl.handle.net/11449/164684
10.1016/j.ins.2018.06.067
WOS:000445713900006
WOS000445713900006.pdf
url http://dx.doi.org/10.1016/j.ins.2018.06.067
http://hdl.handle.net/11449/164684
identifier_str_mv Information Sciences. New York: Elsevier Science Inc, v. 465, p. 86-104, 2018.
0020-0255
10.1016/j.ins.2018.06.067
WOS:000445713900006
WOS000445713900006.pdf
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Information Sciences
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 86-104
application/pdf
dc.publisher.none.fl_str_mv Elsevier B.V.
publisher.none.fl_str_mv Elsevier B.V.
dc.source.none.fl_str_mv Web of Science
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808129144636047360