Node concordance: a local homophily prediction task in graphs

Martinelli, Caio Lorenzetti

Node concordance: a local homophily prediction task in graphs

Detalhes bibliográficos
Autor(a) principal:	Martinelli, Caio Lorenzetti
Data de Publicação:	2023
Tipo de documento:	Dissertação
Idioma:	eng
Título da fonte:	Biblioteca Digital de Teses e Dissertações da USP
Texto Completo:	https://www.teses.usp.br/teses/disponiveis/45/45134/tde-25092023-200028/
Resumo:	Homophily is a characteristic present in many real-world graphs, this work proposes a task to predict the local manifestation of it, the node concordance. The task is explored in benchmark datasets for node classification, using node labels to create the concordance label, and with two frameworks, one positional and one structured, for a semi-supervised version of the task are pro- posed. In those datasets, the task can be viewed as a subtask of the node classification, we want to predict if two nodes are same-class nodes, not taking into account which classes the nodes belong to. It is shown here that there is a performance advantage in tackling node concordance directly in this case. The frameworks consist of utilizing Graph Neural Networks (GNNs) and Node2Vec to generate node embeddings that are informative of the node concordance. The positional framework is trained in an unsupervised manner, actually targeting link prediction, using the graph topology as its only feature, and is shown to hold predictive power for node concordance although the relation between the link prediction and node concordance predictive powers is not direct, as is shown in this work . The structural embeddings are trained directly for node concordance, using node features and GNNs convolutional mechanisms, and generally perform better than the posi- tional framework, but are more sensitive to the number of labeled edges. It is also shown that the two frameworks can be used in combination, in an ensemble, since they contain complementary information to each other. This task can be an end in itself if one desires exactly to assess the node concordance of the nodes, or can serve as a preprocessing step, to attribute edge weights or rewire and make projections of the graph. The code of this work is made publicly available on https://github.com/caiolmart/node-concordance.

Metadados do item

id	USP_ec0f71d31dfcc29694357219c14a93af
oai_identifier_str	oai:teses.usp.br:tde-25092023-200028
network_acronym_str	USP
network_name_str	Biblioteca Digital de Teses e Dissertações da USP
repository_id_str	2721
spelling	Node concordance: a local homophily prediction task in graphsConcordância entre nós: uma tarefa de predição de homofilia localClassificação de nósGraph Neural NetworksGraph Neural NetworksGraph representation learningGraph representation learningGraph topologyHomofiliaHomophilyLink predictionNode classificationPredição de arestasTarefas em grafosTasks on graphsTopologia de grafosHomophily is a characteristic present in many real-world graphs, this work proposes a task to predict the local manifestation of it, the node concordance. The task is explored in benchmark datasets for node classification, using node labels to create the concordance label, and with two frameworks, one positional and one structured, for a semi-supervised version of the task are pro- posed. In those datasets, the task can be viewed as a subtask of the node classification, we want to predict if two nodes are same-class nodes, not taking into account which classes the nodes belong to. It is shown here that there is a performance advantage in tackling node concordance directly in this case. The frameworks consist of utilizing Graph Neural Networks (GNNs) and Node2Vec to generate node embeddings that are informative of the node concordance. The positional framework is trained in an unsupervised manner, actually targeting link prediction, using the graph topology as its only feature, and is shown to hold predictive power for node concordance although the relation between the link prediction and node concordance predictive powers is not direct, as is shown in this work . The structural embeddings are trained directly for node concordance, using node features and GNNs convolutional mechanisms, and generally perform better than the posi- tional framework, but are more sensitive to the number of labeled edges. It is also shown that the two frameworks can be used in combination, in an ensemble, since they contain complementary information to each other. This task can be an end in itself if one desires exactly to assess the node concordance of the nodes, or can serve as a preprocessing step, to attribute edge weights or rewire and make projections of the graph. The code of this work is made publicly available on https://github.com/caiolmart/node-concordance.Homofilia é uma característica presente em muitos grafos do mundo real, esse trabalho propõe a tarefa de prever o manifestação local dela, a concordância entre nós. A tarefa é explorada em con- juntos de dados referência de predição de nós, usando os rótulos dos nós para criar o rótulo de con- cordância, e dois frameworks, um posicional e um estrutural, para uma versão semi-supervisionada da tarefa são propostos. Nesses conjuntos de dados, a tarefa pode ser vista como uma subtarefa da classificação de nós, nós queremos prever se dois nós são de mesma classe, sem levar em conta a quais classes eles pertencem. É mostrado que existe um vantangem de performance em atacar o problema de concordância de nós diretamente nesse caso. Os frameworks consistem em utilizar Graph Neural Networks (GNNs) e Node2Vec para gerar embeddings de nós que são informativos da concordância entre nós. O framework posicional é treinado de maneira não-supervisionada, tendo como objetivo, na verdade, a predição de arestas, usando apenas a topologia do grafo como recurso, e apresenta poder preditivo para concordância entre nós apesar de que a relação entre os poderes preditivos para concordância entre nós e predição de arestas não é direta, como é mostrado nesse trabalho . Os embeddings estruturais são treinados diretamente para concordância entre nós, usando as var- iáveis explicativas dos nós e mecanismos convolucionais das GNNs, e geralmente performam melhor do que o framework posicional, mas são mais sensíveis ao número de arestas rotuladas. Também é mostrado que os dois frameworks podem ser usados em combinação, uma vez que eles contém informações complementares um ao outro. Essa tarefa pode ter como fim ela própria se alguém quiser apurar exatamente a concordância entre nós de um grafo, ou pode servir como um passo de pré-processamento, para atribuir pesos às arestas ou alterar e fazer projeções de um grafo. O código desse trabalho é disponibilizado publicamente em https://github.com/caiolmart/node-concordance.Biblioteca Digitais de Teses e Dissertações da USPMauá, Denis DerataniMartinelli, Caio Lorenzetti2023-07-31info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://www.teses.usp.br/teses/disponiveis/45/45134/tde-25092023-200028/reponame:Biblioteca Digital de Teses e Dissertações da USPinstname:Universidade de São Paulo (USP)instacron:USPLiberar o conteúdo para acesso público.info:eu-repo/semantics/openAccesseng2023-10-11T11:47:32Zoai:teses.usp.br:tde-25092023-200028Biblioteca Digital de Teses e Dissertaçõeshttp://www.teses.usp.br/PUBhttp://www.teses.usp.br/cgi-bin/mtd2br.plvirginia@if.usp.br\|\| atendimento@aguia.usp.br\|\|virginia@if.usp.bropendoar:27212023-10-11T11:47:32Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)false
dc.title.none.fl_str_mv	Node concordance: a local homophily prediction task in graphs Concordância entre nós: uma tarefa de predição de homofilia local
title	Node concordance: a local homophily prediction task in graphs
spellingShingle	Node concordance: a local homophily prediction task in graphs Martinelli, Caio Lorenzetti Classificação de nós Graph Neural Networks Graph Neural Networks Graph representation learning Graph representation learning Graph topology Homofilia Homophily Link prediction Node classification Predição de arestas Tarefas em grafos Tasks on graphs Topologia de grafos
title_short	Node concordance: a local homophily prediction task in graphs
title_full	Node concordance: a local homophily prediction task in graphs
title_fullStr	Node concordance: a local homophily prediction task in graphs
title_full_unstemmed	Node concordance: a local homophily prediction task in graphs
title_sort	Node concordance: a local homophily prediction task in graphs
author	Martinelli, Caio Lorenzetti
author_facet	Martinelli, Caio Lorenzetti
author_role	author
dc.contributor.none.fl_str_mv	Mauá, Denis Deratani
dc.contributor.author.fl_str_mv	Martinelli, Caio Lorenzetti
dc.subject.por.fl_str_mv	Classificação de nós Graph Neural Networks Graph Neural Networks Graph representation learning Graph representation learning Graph topology Homofilia Homophily Link prediction Node classification Predição de arestas Tarefas em grafos Tasks on graphs Topologia de grafos
topic	Classificação de nós Graph Neural Networks Graph Neural Networks Graph representation learning Graph representation learning Graph topology Homofilia Homophily Link prediction Node classification Predição de arestas Tarefas em grafos Tasks on graphs Topologia de grafos
description	Homophily is a characteristic present in many real-world graphs, this work proposes a task to predict the local manifestation of it, the node concordance. The task is explored in benchmark datasets for node classification, using node labels to create the concordance label, and with two frameworks, one positional and one structured, for a semi-supervised version of the task are pro- posed. In those datasets, the task can be viewed as a subtask of the node classification, we want to predict if two nodes are same-class nodes, not taking into account which classes the nodes belong to. It is shown here that there is a performance advantage in tackling node concordance directly in this case. The frameworks consist of utilizing Graph Neural Networks (GNNs) and Node2Vec to generate node embeddings that are informative of the node concordance. The positional framework is trained in an unsupervised manner, actually targeting link prediction, using the graph topology as its only feature, and is shown to hold predictive power for node concordance although the relation between the link prediction and node concordance predictive powers is not direct, as is shown in this work . The structural embeddings are trained directly for node concordance, using node features and GNNs convolutional mechanisms, and generally perform better than the posi- tional framework, but are more sensitive to the number of labeled edges. It is also shown that the two frameworks can be used in combination, in an ensemble, since they contain complementary information to each other. This task can be an end in itself if one desires exactly to assess the node concordance of the nodes, or can serve as a preprocessing step, to attribute edge weights or rewire and make projections of the graph. The code of this work is made publicly available on https://github.com/caiolmart/node-concordance.
publishDate	2023
dc.date.none.fl_str_mv	2023-07-31
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://www.teses.usp.br/teses/disponiveis/45/45134/tde-25092023-200028/
url	https://www.teses.usp.br/teses/disponiveis/45/45134/tde-25092023-200028/
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv
dc.rights.driver.fl_str_mv	Liberar o conteúdo para acesso público. info:eu-repo/semantics/openAccess
rights_invalid_str_mv	Liberar o conteúdo para acesso público.
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.coverage.none.fl_str_mv
dc.publisher.none.fl_str_mv	Biblioteca Digitais de Teses e Dissertações da USP
publisher.none.fl_str_mv	Biblioteca Digitais de Teses e Dissertações da USP
dc.source.none.fl_str_mv	reponame:Biblioteca Digital de Teses e Dissertações da USP instname:Universidade de São Paulo (USP) instacron:USP
instname_str	Universidade de São Paulo (USP)
instacron_str	USP
institution	USP
reponame_str	Biblioteca Digital de Teses e Dissertações da USP
collection	Biblioteca Digital de Teses e Dissertações da USP
repository.name.fl_str_mv	Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)
repository.mail.fl_str_mv	virginia@if.usp.br\|\| atendimento@aguia.usp.br\|\|virginia@if.usp.br
_version_	1815257342342594560

Node concordance: a local homophily prediction task in graphs

Registros relacionados