Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques

Lorena, Luiz Henrique Nogueira [UNIFESP]

Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques

Detalhes bibliográficos
Autor(a) principal:	Lorena, Luiz Henrique Nogueira [UNIFESP]
Data de Publicação:	2019
Tipo de documento:	Tese
Idioma:	por
Título da fonte:	Repositório Institucional da UNIFESP
Texto Completo:	https://sucupira.capes.gov.br/sucupira/public/consultas/coleta/trabalhoConclusao/viewTrabalhoConclusao.jsf?popup=true&id_trabalho=7716857 https://repositorio.unifesp.br/handle/11600/59482
Resumo:	Graph clustering is a fundamental technique in data analysis, used to understand the relation between the entities of a dataset. In this wor k, we have studied a graph clustering problem called Clique Partitioning Problem. In this problem, the objective is to partition the vertices of a weighted complete graph, such that the value of the sum of the edge’s weight within the obtained groups is maximum. The authors who suggested this problem proposed a linear integer programming formulation to solve it. This formulation, however, creates mathematical models with a high number of variables and constraints, which require a high computational effort, in terms of time and memory, to be solved. The authors, however, have empirically demonstrated that a large number of constraints inserted in mathematical models are redundant and can be disregarded. Despite this, the authors did not identified the reason for such behavior. Recent studies tr y to identify the redundancy of this formulation and prevent it to be inserted into the mathematical model. Such models require less computational effort to be solved. Another approach used in the literature, to attenuate the issue of the original formulation, is to reduce the size of the graph instance that represents the problem through a preprocessing technique. A smaller graph gives rise to a mathematical model with fewer variables and constraints. This thesis followed the concepts presented in these works to propose new formulations and preprocessing techniques. The computational experiments show that the techniques proposed in this work create mathematical models that are more compact, and which require less computational effort to be solved than the models created from the formulations proposed in the literature.

Metadados do item

id	UFSP_f46d318c72a3981ba7ab7fa5fe9a1c6e
oai_identifier_str	oai:repositorio.unifesp.br/:11600/59482
network_acronym_str	UFSP
network_name_str	Repositório Institucional da UNIFESP
repository_id_str	3465
spelling	Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliquesGraph ClusteringInteger Linear ProgrammingPreprocessing TechniquesAgrupamento De Dados Em GrafoProgramação Linear InteiraTécnicas De Pré-Processamento.Graph clustering is a fundamental technique in data analysis, used to understand the relation between the entities of a dataset. In this wor k, we have studied a graph clustering problem called Clique Partitioning Problem. In this problem, the objective is to partition the vertices of a weighted complete graph, such that the value of the sum of the edge’s weight within the obtained groups is maximum. The authors who suggested this problem proposed a linear integer programming formulation to solve it. This formulation, however, creates mathematical models with a high number of variables and constraints, which require a high computational effort, in terms of time and memory, to be solved. The authors, however, have empirically demonstrated that a large number of constraints inserted in mathematical models are redundant and can be disregarded. Despite this, the authors did not identified the reason for such behavior. Recent studies tr y to identify the redundancy of this formulation and prevent it to be inserted into the mathematical model. Such models require less computational effort to be solved. Another approach used in the literature, to attenuate the issue of the original formulation, is to reduce the size of the graph instance that represents the problem through a preprocessing technique. A smaller graph gives rise to a mathematical model with fewer variables and constraints. This thesis followed the concepts presented in these works to propose new formulations and preprocessing techniques. The computational experiments show that the techniques proposed in this work create mathematical models that are more compact, and which require less computational effort to be solved than the models created from the formulations proposed in the literature.O agrupamento de dados em grafo é uma técnica fundamental em análise de dados, utilizada para compreender a relação entre as entidades de um conjunto de dados. Neste trabalho, estudou-se um problema de agr upamento de dados em grafo em par ticular denominado Par ticionamento de Grafo em Cliques. Neste problema, o objetivo é par ticionar os vér tices de um grafo completo ponderado, de for ma que o valor da soma do peso das arestas dentro dos gr upos obtidos seja máximo. Os autores que suger iram esse problema criaram uma for mulação de programação linear inteira para solucioná-lo. Essa for mulação, entretanto, cr ia modelos matemáticos com um elevado número de var iáveis e restr ições, e que demandam um elevado esforço computacional, em ter mos de tempo e memór ia, para serem solucionados. Os autores, no entanto, demonstraram empir icamente que uma grande número de restrições inseridas nos modelos matemáticos são redundantes e podem ser desconsideradas. Apesar disso, não realizaram um estudo para tentar identificar o porquê desse compor tamento. Apenas recentemente, foram realizados estudos que tentam identificar a redundância dessa for mulação. Nesses estudos, os autores conseguiram identificaram par te da redundância e propuseram formulações que evitam que ela seja inser ida no modelo matemático, criando um modelo que demandam menos esforço computacional para ser solucionado. Uma outra abordagem utilizada na literatura para tentar mitigar os problemas da for mulação or iginal é diminuir o tamanho do grafo do problema, através de uma técnica de pré-processamento. Um grafo menor dá origem a um modelo matemático com menos var iáveis e restrições. Esta tese seguiu os princípios desses trabalhos para propor novas formulações e técnicas de pré-processamento. Os experimentos computacionais mostram que as técnicas propostas neste trabalho dão origem à modelos matemáticos mais compactos, e que demandam menos esforço computacional para serem solucionados do que os modelos criados a partir das formulações propostas na literatura.Dados abertos - Sucupira - Teses e dissertações (2019)Universidade Federal de São Paulo (UNIFESP)Quiles, Marcos Goncalves [UNIFESP]Universidade Federal de São Paulo (UNIFESP)Lorena, Luiz Henrique Nogueira [UNIFESP]2021-01-19T16:32:25Z2021-01-19T16:32:25Z2019-06-07info:eu-repo/semantics/doctoralThesisinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://sucupira.capes.gov.br/sucupira/public/consultas/coleta/trabalhoConclusao/viewTrabalhoConclusao.jsf?popup=true&id_trabalho=7716857LUIZ HENRIQUE NOGUEIRA LORENA.pdfhttps://repositorio.unifesp.br/handle/11600/59482porinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UNIFESPinstname:Universidade Federal de São Paulo (UNIFESP)instacron:UNIFESP2024-08-03T01:57:14Zoai:repositorio.unifesp.br/:11600/59482Repositório InstitucionalPUBhttp://www.repositorio.unifesp.br/oai/requestbiblioteca.csp@unifesp.bropendoar:34652024-08-03T01:57:14Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP)false
dc.title.none.fl_str_mv	Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques
title	Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques
spellingShingle	Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques Lorena, Luiz Henrique Nogueira [UNIFESP] Graph Clustering Integer Linear Programming Preprocessing Techniques Agrupamento De Dados Em Grafo Programação Linear Inteira Técnicas De Pré-Processamento.
title_short	Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques
title_full	Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques
title_fullStr	Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques
title_full_unstemmed	Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques
title_sort	Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques
author	Lorena, Luiz Henrique Nogueira [UNIFESP]
author_facet	Lorena, Luiz Henrique Nogueira [UNIFESP]
author_role	author
dc.contributor.none.fl_str_mv	Quiles, Marcos Goncalves [UNIFESP] Universidade Federal de São Paulo (UNIFESP)
dc.contributor.author.fl_str_mv	Lorena, Luiz Henrique Nogueira [UNIFESP]
dc.subject.por.fl_str_mv	Graph Clustering Integer Linear Programming Preprocessing Techniques Agrupamento De Dados Em Grafo Programação Linear Inteira Técnicas De Pré-Processamento.
topic	Graph Clustering Integer Linear Programming Preprocessing Techniques Agrupamento De Dados Em Grafo Programação Linear Inteira Técnicas De Pré-Processamento.
description	Graph clustering is a fundamental technique in data analysis, used to understand the relation between the entities of a dataset. In this wor k, we have studied a graph clustering problem called Clique Partitioning Problem. In this problem, the objective is to partition the vertices of a weighted complete graph, such that the value of the sum of the edge’s weight within the obtained groups is maximum. The authors who suggested this problem proposed a linear integer programming formulation to solve it. This formulation, however, creates mathematical models with a high number of variables and constraints, which require a high computational effort, in terms of time and memory, to be solved. The authors, however, have empirically demonstrated that a large number of constraints inserted in mathematical models are redundant and can be disregarded. Despite this, the authors did not identified the reason for such behavior. Recent studies tr y to identify the redundancy of this formulation and prevent it to be inserted into the mathematical model. Such models require less computational effort to be solved. Another approach used in the literature, to attenuate the issue of the original formulation, is to reduce the size of the graph instance that represents the problem through a preprocessing technique. A smaller graph gives rise to a mathematical model with fewer variables and constraints. This thesis followed the concepts presented in these works to propose new formulations and preprocessing techniques. The computational experiments show that the techniques proposed in this work create mathematical models that are more compact, and which require less computational effort to be solved than the models created from the formulations proposed in the literature.
publishDate	2019
dc.date.none.fl_str_mv	2019-06-07 2021-01-19T16:32:25Z 2021-01-19T16:32:25Z
dc.type.driver.fl_str_mv	info:eu-repo/semantics/doctoralThesis
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
format	doctoralThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://sucupira.capes.gov.br/sucupira/public/consultas/coleta/trabalhoConclusao/viewTrabalhoConclusao.jsf?popup=true&id_trabalho=7716857 LUIZ HENRIQUE NOGUEIRA LORENA.pdf https://repositorio.unifesp.br/handle/11600/59482
url	https://sucupira.capes.gov.br/sucupira/public/consultas/coleta/trabalhoConclusao/viewTrabalhoConclusao.jsf?popup=true&id_trabalho=7716857 https://repositorio.unifesp.br/handle/11600/59482
identifier_str_mv	LUIZ HENRIQUE NOGUEIRA LORENA.pdf
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Federal de São Paulo (UNIFESP)
publisher.none.fl_str_mv	Universidade Federal de São Paulo (UNIFESP)
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UNIFESP instname:Universidade Federal de São Paulo (UNIFESP) instacron:UNIFESP
instname_str	Universidade Federal de São Paulo (UNIFESP)
instacron_str	UNIFESP
institution	UNIFESP
reponame_str	Repositório Institucional da UNIFESP
collection	Repositório Institucional da UNIFESP
repository.name.fl_str_mv	Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP)
repository.mail.fl_str_mv	biblioteca.csp@unifesp.br
_version_	1814268339577421824

Novas formulações e técnicas de pré-processamento para o problema de particionamento de grafo em cliques

Registros relacionados