Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data Context
Autor(a) principal: | |
---|---|
Data de Publicação: | 2016 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.22/10074 |
Resumo: | One of the biggest challenges in Big Data is to exploit value from large volumes of variable and changing data. For this, one must focus on analyzing the data in these Big Data sources and classify the data items according to a domain model (e.g. an ontology). To automatically classify unstructured text documents according to an ontology, a hierarchical multi-label classification process called Semantic HMC was proposed. This process uses ontologies to describe the classification model. To prevent cold start and user overload, the classification process automatically learns the ontology-described classification model from a very large set of unstructured text documents. However, data is always being generated and its statistical properties can change over time. In order to learn in such environment, the classification processes must handle streams of non-stationary data to adapt the classification model. This paper proposes a new adaptive learning process to consistently adapt the ontologydescribed classification model according to a non-stationary stream of unstructured text data in Big Data context. The adaptive process is then instantiated for the specific case of of the previously proposed Semantic HMC. |
id |
RCAP_1f57a98dfe69249b27f311bba6b0f657 |
---|---|
oai_identifier_str |
oai:recipp.ipp.pt:10400.22/10074 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data ContextMaintenanceMulti-label classificationAdaptive learningOntologyMachine learningOne of the biggest challenges in Big Data is to exploit value from large volumes of variable and changing data. For this, one must focus on analyzing the data in these Big Data sources and classify the data items according to a domain model (e.g. an ontology). To automatically classify unstructured text documents according to an ontology, a hierarchical multi-label classification process called Semantic HMC was proposed. This process uses ontologies to describe the classification model. To prevent cold start and user overload, the classification process automatically learns the ontology-described classification model from a very large set of unstructured text documents. However, data is always being generated and its statistical properties can change over time. In order to learn in such environment, the classification processes must handle streams of non-stationary data to adapt the classification model. This paper proposes a new adaptive learning process to consistently adapt the ontologydescribed classification model according to a non-stationary stream of unstructured text data in Big Data context. The adaptive process is then instantiated for the specific case of of the previously proposed Semantic HMC.Institute of Electrical and Electronics EngineersRepositório Científico do Instituto Politécnico do PortoPeixoto, RafaelCruz, ChristopheSilva, Nuno20162117-01-01T00:00:00Z2016-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/10074eng10.1109/SAI.2016.7556031metadata only accessinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-03-13T12:51:38Zoai:recipp.ipp.pt:10400.22/10074Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:30:35.445781Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data Context |
title |
Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data Context |
spellingShingle |
Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data Context Peixoto, Rafael Maintenance Multi-label classification Adaptive learning Ontology Machine learning |
title_short |
Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data Context |
title_full |
Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data Context |
title_fullStr |
Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data Context |
title_full_unstemmed |
Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data Context |
title_sort |
Adaptive Learning Process for the Evolution of Ontology-Described Classification Model in Big Data Context |
author |
Peixoto, Rafael |
author_facet |
Peixoto, Rafael Cruz, Christophe Silva, Nuno |
author_role |
author |
author2 |
Cruz, Christophe Silva, Nuno |
author2_role |
author author |
dc.contributor.none.fl_str_mv |
Repositório Científico do Instituto Politécnico do Porto |
dc.contributor.author.fl_str_mv |
Peixoto, Rafael Cruz, Christophe Silva, Nuno |
dc.subject.por.fl_str_mv |
Maintenance Multi-label classification Adaptive learning Ontology Machine learning |
topic |
Maintenance Multi-label classification Adaptive learning Ontology Machine learning |
description |
One of the biggest challenges in Big Data is to exploit value from large volumes of variable and changing data. For this, one must focus on analyzing the data in these Big Data sources and classify the data items according to a domain model (e.g. an ontology). To automatically classify unstructured text documents according to an ontology, a hierarchical multi-label classification process called Semantic HMC was proposed. This process uses ontologies to describe the classification model. To prevent cold start and user overload, the classification process automatically learns the ontology-described classification model from a very large set of unstructured text documents. However, data is always being generated and its statistical properties can change over time. In order to learn in such environment, the classification processes must handle streams of non-stationary data to adapt the classification model. This paper proposes a new adaptive learning process to consistently adapt the ontologydescribed classification model according to a non-stationary stream of unstructured text data in Big Data context. The adaptive process is then instantiated for the specific case of of the previously proposed Semantic HMC. |
publishDate |
2016 |
dc.date.none.fl_str_mv |
2016 2016-01-01T00:00:00Z 2117-01-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.22/10074 |
url |
http://hdl.handle.net/10400.22/10074 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1109/SAI.2016.7556031 |
dc.rights.driver.fl_str_mv |
metadata only access info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
metadata only access |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Institute of Electrical and Electronics Engineers |
publisher.none.fl_str_mv |
Institute of Electrical and Electronics Engineers |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1817553780069105664 |