Labeling methods for association rule clustering

Detalhes bibliográficos
Autor(a) principal: De Carvalho, Veronica Oliveira [UNESP]
Data de Publicação: 2012
Outros Autores: Biondi, Daniel Savoia [UNESP], Dos Santos, Fabiano Fernandes, Rezende, Solange Oliveira
Tipo de documento: Artigo de conferência
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.5220/0003970001050111
http://hdl.handle.net/11449/73568
Resumo: Although association mining has been highlighted in the last years, the huge number of rules that are generated hamper its use. To overcome this problem, many post-processing approaches were suggested, such as clustering, which organizes the rules in groups that contain, somehow, similar knowledge. Nevertheless, clustering can aid the user only if good descriptors be associated with each group. This is a relevant issue, since the labels will provide to the user a view of the topics to be explored, helping to guide its search. This is interesting, for example, when the user doesn't have, a priori, an idea where to start. Thus, the analysis of different labeling methods for association rule clustering is important. Considering the exposed arguments, this paper analyzes some labeling methods through two measures that are proposed. One of them, Precision, measures how much the methods can find labels that represent as accurately as possible the rules contained in its group and Repetition Frequency determines how the labels are distributed along the clusters. As a result, it was possible to identify the methods and the domain organizations with the best performances that can be applied in clusters of association rules.
id UNSP_814f78b4622940aaa3b5582a054aac1f
oai_identifier_str oai:repositorio.unesp.br:11449/73568
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Labeling methods for association rule clusteringAssociation rulesClusteringLabeling methodsPost-processingAssociation miningDescriptorsPost processingRepetition frequencyInformation systemsAlthough association mining has been highlighted in the last years, the huge number of rules that are generated hamper its use. To overcome this problem, many post-processing approaches were suggested, such as clustering, which organizes the rules in groups that contain, somehow, similar knowledge. Nevertheless, clustering can aid the user only if good descriptors be associated with each group. This is a relevant issue, since the labels will provide to the user a view of the topics to be explored, helping to guide its search. This is interesting, for example, when the user doesn't have, a priori, an idea where to start. Thus, the analysis of different labeling methods for association rule clustering is important. Considering the exposed arguments, this paper analyzes some labeling methods through two measures that are proposed. One of them, Precision, measures how much the methods can find labels that represent as accurately as possible the rules contained in its group and Repetition Frequency determines how the labels are distributed along the clusters. As a result, it was possible to identify the methods and the domain organizations with the best performances that can be applied in clusters of association rules.Instituto de Geociências e Ciências Exatas Universidade Estadual Paulista (UNESP), São PauloInstituto de Ciências Matemáticas e de Computaçã o Universidade de São Paulo, São PauloInstituto de Geociências e Ciências Exatas Universidade Estadual Paulista (UNESP), São PauloUniversidade Estadual Paulista (Unesp)Universidade de São Paulo (USP)De Carvalho, Veronica Oliveira [UNESP]Biondi, Daniel Savoia [UNESP]Dos Santos, Fabiano FernandesRezende, Solange Oliveira2014-05-27T11:26:59Z2014-05-27T11:26:59Z2012-09-10info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject105-111http://dx.doi.org/10.5220/0003970001050111ICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systems, v. 1 DISI, n. AIDSS/-, p. 105-111, 2012.http://hdl.handle.net/11449/7356810.5220/00039700010501112-s2.0-84865763484Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systemsinfo:eu-repo/semantics/openAccess2021-10-23T21:37:44Zoai:repositorio.unesp.br:11449/73568Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T16:26:06.371867Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Labeling methods for association rule clustering
title Labeling methods for association rule clustering
spellingShingle Labeling methods for association rule clustering
De Carvalho, Veronica Oliveira [UNESP]
Association rules
Clustering
Labeling methods
Post-processing
Association mining
Descriptors
Post processing
Repetition frequency
Information systems
title_short Labeling methods for association rule clustering
title_full Labeling methods for association rule clustering
title_fullStr Labeling methods for association rule clustering
title_full_unstemmed Labeling methods for association rule clustering
title_sort Labeling methods for association rule clustering
author De Carvalho, Veronica Oliveira [UNESP]
author_facet De Carvalho, Veronica Oliveira [UNESP]
Biondi, Daniel Savoia [UNESP]
Dos Santos, Fabiano Fernandes
Rezende, Solange Oliveira
author_role author
author2 Biondi, Daniel Savoia [UNESP]
Dos Santos, Fabiano Fernandes
Rezende, Solange Oliveira
author2_role author
author
author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (Unesp)
Universidade de São Paulo (USP)
dc.contributor.author.fl_str_mv De Carvalho, Veronica Oliveira [UNESP]
Biondi, Daniel Savoia [UNESP]
Dos Santos, Fabiano Fernandes
Rezende, Solange Oliveira
dc.subject.por.fl_str_mv Association rules
Clustering
Labeling methods
Post-processing
Association mining
Descriptors
Post processing
Repetition frequency
Information systems
topic Association rules
Clustering
Labeling methods
Post-processing
Association mining
Descriptors
Post processing
Repetition frequency
Information systems
description Although association mining has been highlighted in the last years, the huge number of rules that are generated hamper its use. To overcome this problem, many post-processing approaches were suggested, such as clustering, which organizes the rules in groups that contain, somehow, similar knowledge. Nevertheless, clustering can aid the user only if good descriptors be associated with each group. This is a relevant issue, since the labels will provide to the user a view of the topics to be explored, helping to guide its search. This is interesting, for example, when the user doesn't have, a priori, an idea where to start. Thus, the analysis of different labeling methods for association rule clustering is important. Considering the exposed arguments, this paper analyzes some labeling methods through two measures that are proposed. One of them, Precision, measures how much the methods can find labels that represent as accurately as possible the rules contained in its group and Repetition Frequency determines how the labels are distributed along the clusters. As a result, it was possible to identify the methods and the domain organizations with the best performances that can be applied in clusters of association rules.
publishDate 2012
dc.date.none.fl_str_mv 2012-09-10
2014-05-27T11:26:59Z
2014-05-27T11:26:59Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.5220/0003970001050111
ICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systems, v. 1 DISI, n. AIDSS/-, p. 105-111, 2012.
http://hdl.handle.net/11449/73568
10.5220/0003970001050111
2-s2.0-84865763484
url http://dx.doi.org/10.5220/0003970001050111
http://hdl.handle.net/11449/73568
identifier_str_mv ICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systems, v. 1 DISI, n. AIDSS/-, p. 105-111, 2012.
10.5220/0003970001050111
2-s2.0-84865763484
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv ICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systems
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 105-111
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808128650867900416