Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning

Detalhes bibliográficos
Autor(a) principal: Breve, Fabricio [UNESP]
Data de Publicação: 2012
Outros Autores: Zhao, Liang
Tipo de documento: Artigo de conferência
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1109/SBRN.2012.16
http://hdl.handle.net/11449/73831
Resumo: Semi-supervised learning is applied to classification problems where only a small portion of the data items is labeled. In these cases, the reliability of the labels is a crucial factor, because mislabeled items may propagate wrong labels to a large portion or even the entire data set. This paper aims to address this problem by presenting a graph-based (network-based) semi-supervised learning method, specifically designed to handle data sets with mislabeled samples. The method uses teams of walking particles, with competitive and cooperative behavior, for label propagation in the network constructed from the input data set. The proposed model is nature-inspired and it incorporates some features to make it robust to a considerable amount of mislabeled data items. Computer simulations show the performance of the method in the presence of different percentage of mislabeled data, in networks of different sizes and average node degree. Importantly, these simulations reveals the existence of the critical points of the mislabeled subset size, below which the network is free of wrong label contamination, but above which the mislabeled samples start to propagate their labels to the rest of the network. Moreover, numerical comparisons have been made among the proposed method and other representative graph-based semi-supervised learning methods using both artificial and real-world data sets. Interestingly, the proposed method has increasing better performance than the others as the percentage of mislabeled samples is getting larger. © 2012 IEEE.
id UNSP_d0de921d82a8385e922b741c3acc9ab2
oai_identifier_str oai:repositorio.unesp.br:11449/73831
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learningComputational intelligenceMachine learningCo-operative behaviorsCompetition and cooperationCritical pointsData itemsData setsDifferent sizesGraph-basedInput datasLabel propagationMislabeled dataNetwork-basedNode degreeNumerical comparisonPrevent error propagationReal world dataSemi-supervised learningSemi-supervised learning methodsArtificial intelligenceBehavioral researchGraphic methodsLearning systemsNeural networksNumerical methodsVirtual realitySupervised learningSemi-supervised learning is applied to classification problems where only a small portion of the data items is labeled. In these cases, the reliability of the labels is a crucial factor, because mislabeled items may propagate wrong labels to a large portion or even the entire data set. This paper aims to address this problem by presenting a graph-based (network-based) semi-supervised learning method, specifically designed to handle data sets with mislabeled samples. The method uses teams of walking particles, with competitive and cooperative behavior, for label propagation in the network constructed from the input data set. The proposed model is nature-inspired and it incorporates some features to make it robust to a considerable amount of mislabeled data items. Computer simulations show the performance of the method in the presence of different percentage of mislabeled data, in networks of different sizes and average node degree. Importantly, these simulations reveals the existence of the critical points of the mislabeled subset size, below which the network is free of wrong label contamination, but above which the mislabeled samples start to propagate their labels to the rest of the network. Moreover, numerical comparisons have been made among the proposed method and other representative graph-based semi-supervised learning methods using both artificial and real-world data sets. Interestingly, the proposed method has increasing better performance than the others as the percentage of mislabeled samples is getting larger. © 2012 IEEE.Institute of Geosciences and Exact Sciences (IGCE) Sao Paulo State University (UNESP), Rio ClaroInstitute of Mathematics and Computer Science (ICMC) University of Sao Paulo (USP), Sao CarlosInstitute of Geosciences and Exact Sciences (IGCE) Sao Paulo State University (UNESP), Rio ClaroUniversidade Estadual Paulista (Unesp)Universidade de São Paulo (USP)Breve, Fabricio [UNESP]Zhao, Liang2014-05-27T11:27:18Z2014-05-27T11:27:18Z2012-12-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject79-84http://dx.doi.org/10.1109/SBRN.2012.16Proceedings - Brazilian Symposium on Neural Networks, SBRN, p. 79-84.1522-4899http://hdl.handle.net/11449/7383110.1109/SBRN.2012.162-s2.0-8487312123456938600255383270000-0002-1123-9784Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengProceedings - Brazilian Symposium on Neural Networks, SBRNinfo:eu-repo/semantics/openAccess2021-10-23T21:37:44Zoai:repositorio.unesp.br:11449/73831Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T17:05:17.807111Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning
title Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning
spellingShingle Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning
Breve, Fabricio [UNESP]
Computational intelligence
Machine learning
Co-operative behaviors
Competition and cooperation
Critical points
Data items
Data sets
Different sizes
Graph-based
Input datas
Label propagation
Mislabeled data
Network-based
Node degree
Numerical comparison
Prevent error propagation
Real world data
Semi-supervised learning
Semi-supervised learning methods
Artificial intelligence
Behavioral research
Graphic methods
Learning systems
Neural networks
Numerical methods
Virtual reality
Supervised learning
title_short Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning
title_full Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning
title_fullStr Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning
title_full_unstemmed Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning
title_sort Particle competition and cooperation to prevent error propagation from mislabeled data in semi-supervised learning
author Breve, Fabricio [UNESP]
author_facet Breve, Fabricio [UNESP]
Zhao, Liang
author_role author
author2 Zhao, Liang
author2_role author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (Unesp)
Universidade de São Paulo (USP)
dc.contributor.author.fl_str_mv Breve, Fabricio [UNESP]
Zhao, Liang
dc.subject.por.fl_str_mv Computational intelligence
Machine learning
Co-operative behaviors
Competition and cooperation
Critical points
Data items
Data sets
Different sizes
Graph-based
Input datas
Label propagation
Mislabeled data
Network-based
Node degree
Numerical comparison
Prevent error propagation
Real world data
Semi-supervised learning
Semi-supervised learning methods
Artificial intelligence
Behavioral research
Graphic methods
Learning systems
Neural networks
Numerical methods
Virtual reality
Supervised learning
topic Computational intelligence
Machine learning
Co-operative behaviors
Competition and cooperation
Critical points
Data items
Data sets
Different sizes
Graph-based
Input datas
Label propagation
Mislabeled data
Network-based
Node degree
Numerical comparison
Prevent error propagation
Real world data
Semi-supervised learning
Semi-supervised learning methods
Artificial intelligence
Behavioral research
Graphic methods
Learning systems
Neural networks
Numerical methods
Virtual reality
Supervised learning
description Semi-supervised learning is applied to classification problems where only a small portion of the data items is labeled. In these cases, the reliability of the labels is a crucial factor, because mislabeled items may propagate wrong labels to a large portion or even the entire data set. This paper aims to address this problem by presenting a graph-based (network-based) semi-supervised learning method, specifically designed to handle data sets with mislabeled samples. The method uses teams of walking particles, with competitive and cooperative behavior, for label propagation in the network constructed from the input data set. The proposed model is nature-inspired and it incorporates some features to make it robust to a considerable amount of mislabeled data items. Computer simulations show the performance of the method in the presence of different percentage of mislabeled data, in networks of different sizes and average node degree. Importantly, these simulations reveals the existence of the critical points of the mislabeled subset size, below which the network is free of wrong label contamination, but above which the mislabeled samples start to propagate their labels to the rest of the network. Moreover, numerical comparisons have been made among the proposed method and other representative graph-based semi-supervised learning methods using both artificial and real-world data sets. Interestingly, the proposed method has increasing better performance than the others as the percentage of mislabeled samples is getting larger. © 2012 IEEE.
publishDate 2012
dc.date.none.fl_str_mv 2012-12-01
2014-05-27T11:27:18Z
2014-05-27T11:27:18Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1109/SBRN.2012.16
Proceedings - Brazilian Symposium on Neural Networks, SBRN, p. 79-84.
1522-4899
http://hdl.handle.net/11449/73831
10.1109/SBRN.2012.16
2-s2.0-84873121234
5693860025538327
0000-0002-1123-9784
url http://dx.doi.org/10.1109/SBRN.2012.16
http://hdl.handle.net/11449/73831
identifier_str_mv Proceedings - Brazilian Symposium on Neural Networks, SBRN, p. 79-84.
1522-4899
10.1109/SBRN.2012.16
2-s2.0-84873121234
5693860025538327
0000-0002-1123-9784
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Proceedings - Brazilian Symposium on Neural Networks, SBRN
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 79-84
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808128753777246208