Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning
Autor(a) principal: | |
---|---|
Data de Publicação: | 2012 |
Outros Autores: | |
Tipo de documento: | Artigo de conferência |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.1109/SBRN.2012.16 http://hdl.handle.net/11449/73831 |
Resumo: | Semi-supervised learning is applied to classification problems where only a small portion of the data items is labeled. In these cases, the reliability of the labels is a crucial factor, because mislabeled items may propagate wrong labels to a large portion or even the entire data set. This paper aims to address this problem by presenting a graph-based (network-based) semi-supervised learning method, specifically designed to handle data sets with mislabeled samples. The method uses teams of walking particles, with competitive and cooperative behavior, for label propagation in the network constructed from the input data set. The proposed model is nature-inspired and it incorporates some features to make it robust to a considerable amount of mislabeled data items. Computer simulations show the performance of the method in the presence of different percentage of mislabeled data, in networks of different sizes and average node degree. Importantly, these simulations reveals the existence of the critical points of the mislabeled subset size, below which the network is free of wrong label contamination, but above which the mislabeled samples start to propagate their labels to the rest of the network. Moreover, numerical comparisons have been made among the proposed method and other representative graph-based semi-supervised learning methods using both artificial and real-world data sets. Interestingly, the proposed method has increasing better performance than the others as the percentage of mislabeled samples is getting larger. © 2012 IEEE. |
id |
UNSP_d0de921d82a8385e922b741c3acc9ab2 |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/73831 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learningComputational intelligenceMachine learningCo-operative behaviorsCompetition and cooperationCritical pointsData itemsData setsDifferent sizesGraph-basedInput datasLabel propagationMislabeled dataNetwork-basedNode degreeNumerical comparisonPrevent error propagationReal world dataSemi-supervised learningSemi-supervised learning methodsArtificial intelligenceBehavioral researchGraphic methodsLearning systemsNeural networksNumerical methodsVirtual realitySupervised learningSemi-supervised learning is applied to classification problems where only a small portion of the data items is labeled. In these cases, the reliability of the labels is a crucial factor, because mislabeled items may propagate wrong labels to a large portion or even the entire data set. This paper aims to address this problem by presenting a graph-based (network-based) semi-supervised learning method, specifically designed to handle data sets with mislabeled samples. The method uses teams of walking particles, with competitive and cooperative behavior, for label propagation in the network constructed from the input data set. The proposed model is nature-inspired and it incorporates some features to make it robust to a considerable amount of mislabeled data items. Computer simulations show the performance of the method in the presence of different percentage of mislabeled data, in networks of different sizes and average node degree. Importantly, these simulations reveals the existence of the critical points of the mislabeled subset size, below which the network is free of wrong label contamination, but above which the mislabeled samples start to propagate their labels to the rest of the network. Moreover, numerical comparisons have been made among the proposed method and other representative graph-based semi-supervised learning methods using both artificial and real-world data sets. Interestingly, the proposed method has increasing better performance than the others as the percentage of mislabeled samples is getting larger. © 2012 IEEE.Institute of Geosciences and Exact Sciences (IGCE) Sao Paulo State University (UNESP), Rio ClaroInstitute of Mathematics and Computer Science (ICMC) University of Sao Paulo (USP), Sao CarlosInstitute of Geosciences and Exact Sciences (IGCE) Sao Paulo State University (UNESP), Rio ClaroUniversidade Estadual Paulista (Unesp)Universidade de São Paulo (USP)Breve, Fabricio [UNESP]Zhao, Liang2014-05-27T11:27:18Z2014-05-27T11:27:18Z2012-12-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject79-84http://dx.doi.org/10.1109/SBRN.2012.16Proceedings - Brazilian Symposium on Neural Networks, SBRN, p. 79-84.1522-4899http://hdl.handle.net/11449/7383110.1109/SBRN.2012.162-s2.0-8487312123456938600255383270000-0002-1123-9784Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengProceedings - Brazilian Symposium on Neural Networks, SBRNinfo:eu-repo/semantics/openAccess2021-10-23T21:37:44Zoai:repositorio.unesp.br:11449/73831Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T17:05:17.807111Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning |
title |
Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning |
spellingShingle |
Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning Breve, Fabricio [UNESP] Computational intelligence Machine learning Co-operative behaviors Competition and cooperation Critical points Data items Data sets Different sizes Graph-based Input datas Label propagation Mislabeled data Network-based Node degree Numerical comparison Prevent error propagation Real world data Semi-supervised learning Semi-supervised learning methods Artificial intelligence Behavioral research Graphic methods Learning systems Neural networks Numerical methods Virtual reality Supervised learning |
title_short |
Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning |
title_full |
Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning |
title_fullStr |
Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning |
title_full_unstemmed |
Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning |
title_sort |
Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning |
author |
Breve, Fabricio [UNESP] |
author_facet |
Breve, Fabricio [UNESP] Zhao, Liang |
author_role |
author |
author2 |
Zhao, Liang |
author2_role |
author |
dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (Unesp) Universidade de São Paulo (USP) |
dc.contributor.author.fl_str_mv |
Breve, Fabricio [UNESP] Zhao, Liang |
dc.subject.por.fl_str_mv |
Computational intelligence Machine learning Co-operative behaviors Competition and cooperation Critical points Data items Data sets Different sizes Graph-based Input datas Label propagation Mislabeled data Network-based Node degree Numerical comparison Prevent error propagation Real world data Semi-supervised learning Semi-supervised learning methods Artificial intelligence Behavioral research Graphic methods Learning systems Neural networks Numerical methods Virtual reality Supervised learning |
topic |
Computational intelligence Machine learning Co-operative behaviors Competition and cooperation Critical points Data items Data sets Different sizes Graph-based Input datas Label propagation Mislabeled data Network-based Node degree Numerical comparison Prevent error propagation Real world data Semi-supervised learning Semi-supervised learning methods Artificial intelligence Behavioral research Graphic methods Learning systems Neural networks Numerical methods Virtual reality Supervised learning |
description |
Semi-supervised learning is applied to classification problems where only a small portion of the data items is labeled. In these cases, the reliability of the labels is a crucial factor, because mislabeled items may propagate wrong labels to a large portion or even the entire data set. This paper aims to address this problem by presenting a graph-based (network-based) semi-supervised learning method, specifically designed to handle data sets with mislabeled samples. The method uses teams of walking particles, with competitive and cooperative behavior, for label propagation in the network constructed from the input data set. The proposed model is nature-inspired and it incorporates some features to make it robust to a considerable amount of mislabeled data items. Computer simulations show the performance of the method in the presence of different percentage of mislabeled data, in networks of different sizes and average node degree. Importantly, these simulations reveals the existence of the critical points of the mislabeled subset size, below which the network is free of wrong label contamination, but above which the mislabeled samples start to propagate their labels to the rest of the network. Moreover, numerical comparisons have been made among the proposed method and other representative graph-based semi-supervised learning methods using both artificial and real-world data sets. Interestingly, the proposed method has increasing better performance than the others as the percentage of mislabeled samples is getting larger. © 2012 IEEE. |
publishDate |
2012 |
dc.date.none.fl_str_mv |
2012-12-01 2014-05-27T11:27:18Z 2014-05-27T11:27:18Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1109/SBRN.2012.16 Proceedings - Brazilian Symposium on Neural Networks, SBRN, p. 79-84. 1522-4899 http://hdl.handle.net/11449/73831 10.1109/SBRN.2012.16 2-s2.0-84873121234 5693860025538327 0000-0002-1123-9784 |
url |
http://dx.doi.org/10.1109/SBRN.2012.16 http://hdl.handle.net/11449/73831 |
identifier_str_mv |
Proceedings - Brazilian Symposium on Neural Networks, SBRN, p. 79-84. 1522-4899 10.1109/SBRN.2012.16 2-s2.0-84873121234 5693860025538327 0000-0002-1123-9784 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Proceedings - Brazilian Symposium on Neural Networks, SBRN |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
79-84 |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808128753777246208 |