Labeling methods for association rule clustering
Autor(a) principal: | |
---|---|
Data de Publicação: | 2012 |
Outros Autores: | , , |
Tipo de documento: | Artigo de conferência |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.5220/0003970001050111 http://hdl.handle.net/11449/73568 |
Resumo: | Although association mining has been highlighted in the last years, the huge number of rules that are generated hamper its use. To overcome this problem, many post-processing approaches were suggested, such as clustering, which organizes the rules in groups that contain, somehow, similar knowledge. Nevertheless, clustering can aid the user only if good descriptors be associated with each group. This is a relevant issue, since the labels will provide to the user a view of the topics to be explored, helping to guide its search. This is interesting, for example, when the user doesn't have, a priori, an idea where to start. Thus, the analysis of different labeling methods for association rule clustering is important. Considering the exposed arguments, this paper analyzes some labeling methods through two measures that are proposed. One of them, Precision, measures how much the methods can find labels that represent as accurately as possible the rules contained in its group and Repetition Frequency determines how the labels are distributed along the clusters. As a result, it was possible to identify the methods and the domain organizations with the best performances that can be applied in clusters of association rules. |
id |
UNSP_814f78b4622940aaa3b5582a054aac1f |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/73568 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Labeling methods for association rule clusteringAssociation rulesClusteringLabeling methodsPost-processingAssociation miningDescriptorsPost processingRepetition frequencyInformation systemsAlthough association mining has been highlighted in the last years, the huge number of rules that are generated hamper its use. To overcome this problem, many post-processing approaches were suggested, such as clustering, which organizes the rules in groups that contain, somehow, similar knowledge. Nevertheless, clustering can aid the user only if good descriptors be associated with each group. This is a relevant issue, since the labels will provide to the user a view of the topics to be explored, helping to guide its search. This is interesting, for example, when the user doesn't have, a priori, an idea where to start. Thus, the analysis of different labeling methods for association rule clustering is important. Considering the exposed arguments, this paper analyzes some labeling methods through two measures that are proposed. One of them, Precision, measures how much the methods can find labels that represent as accurately as possible the rules contained in its group and Repetition Frequency determines how the labels are distributed along the clusters. As a result, it was possible to identify the methods and the domain organizations with the best performances that can be applied in clusters of association rules.Instituto de Geociências e Ciências Exatas Universidade Estadual Paulista (UNESP), São PauloInstituto de Ciências Matemáticas e de Computaçã o Universidade de São Paulo, São PauloInstituto de Geociências e Ciências Exatas Universidade Estadual Paulista (UNESP), São PauloUniversidade Estadual Paulista (Unesp)Universidade de São Paulo (USP)De Carvalho, Veronica Oliveira [UNESP]Biondi, Daniel Savoia [UNESP]Dos Santos, Fabiano FernandesRezende, Solange Oliveira2014-05-27T11:26:59Z2014-05-27T11:26:59Z2012-09-10info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject105-111http://dx.doi.org/10.5220/0003970001050111ICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systems, v. 1 DISI, n. AIDSS/-, p. 105-111, 2012.http://hdl.handle.net/11449/7356810.5220/00039700010501112-s2.0-84865763484Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systemsinfo:eu-repo/semantics/openAccess2021-10-23T21:37:44Zoai:repositorio.unesp.br:11449/73568Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T16:26:06.371867Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Labeling methods for association rule clustering |
title |
Labeling methods for association rule clustering |
spellingShingle |
Labeling methods for association rule clustering De Carvalho, Veronica Oliveira [UNESP] Association rules Clustering Labeling methods Post-processing Association mining Descriptors Post processing Repetition frequency Information systems |
title_short |
Labeling methods for association rule clustering |
title_full |
Labeling methods for association rule clustering |
title_fullStr |
Labeling methods for association rule clustering |
title_full_unstemmed |
Labeling methods for association rule clustering |
title_sort |
Labeling methods for association rule clustering |
author |
De Carvalho, Veronica Oliveira [UNESP] |
author_facet |
De Carvalho, Veronica Oliveira [UNESP] Biondi, Daniel Savoia [UNESP] Dos Santos, Fabiano Fernandes Rezende, Solange Oliveira |
author_role |
author |
author2 |
Biondi, Daniel Savoia [UNESP] Dos Santos, Fabiano Fernandes Rezende, Solange Oliveira |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (Unesp) Universidade de São Paulo (USP) |
dc.contributor.author.fl_str_mv |
De Carvalho, Veronica Oliveira [UNESP] Biondi, Daniel Savoia [UNESP] Dos Santos, Fabiano Fernandes Rezende, Solange Oliveira |
dc.subject.por.fl_str_mv |
Association rules Clustering Labeling methods Post-processing Association mining Descriptors Post processing Repetition frequency Information systems |
topic |
Association rules Clustering Labeling methods Post-processing Association mining Descriptors Post processing Repetition frequency Information systems |
description |
Although association mining has been highlighted in the last years, the huge number of rules that are generated hamper its use. To overcome this problem, many post-processing approaches were suggested, such as clustering, which organizes the rules in groups that contain, somehow, similar knowledge. Nevertheless, clustering can aid the user only if good descriptors be associated with each group. This is a relevant issue, since the labels will provide to the user a view of the topics to be explored, helping to guide its search. This is interesting, for example, when the user doesn't have, a priori, an idea where to start. Thus, the analysis of different labeling methods for association rule clustering is important. Considering the exposed arguments, this paper analyzes some labeling methods through two measures that are proposed. One of them, Precision, measures how much the methods can find labels that represent as accurately as possible the rules contained in its group and Repetition Frequency determines how the labels are distributed along the clusters. As a result, it was possible to identify the methods and the domain organizations with the best performances that can be applied in clusters of association rules. |
publishDate |
2012 |
dc.date.none.fl_str_mv |
2012-09-10 2014-05-27T11:26:59Z 2014-05-27T11:26:59Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.5220/0003970001050111 ICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systems, v. 1 DISI, n. AIDSS/-, p. 105-111, 2012. http://hdl.handle.net/11449/73568 10.5220/0003970001050111 2-s2.0-84865763484 |
url |
http://dx.doi.org/10.5220/0003970001050111 http://hdl.handle.net/11449/73568 |
identifier_str_mv |
ICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systems, v. 1 DISI, n. AIDSS/-, p. 105-111, 2012. 10.5220/0003970001050111 2-s2.0-84865763484 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
ICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systems |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
105-111 |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808128650867900416 |