Identifying and ranking super spreaders in real world complex networks without influence overlap

Maji, Giridhar; Dutta, Animesh; Curado Malta, Mariana; Sen, Soumya

Identifying and ranking super spreaders in real world complex networks without influence overlap

Detalhes bibliográficos
Autor(a) principal:	Maji, Giridhar
Data de Publicação:	2021
Outros Autores:	Dutta, Animesh, Curado Malta, Mariana, Sen, Soumya
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10400.22/19804
Resumo:	In the present-days complex networks modeled on real-world data contain millions of nodes and billions of links. Identifying super spreaders in such an extensive network is a challenging task. Super spreaders are the most important or influential nodes in the network that play the central role during an infection spreading or infor mation diffusion process. Depending on the application, either the most influential node needs to be identified, or a set of initial seed nodes are identified that can maximize the collective influence or the total spread in the network. Many centrality measures have been proposed to rank nodes in a complex network such as ‘degree’, ‘closeness’, ‘betweenness’, ‘coreness’ or ‘k-shell’ centrality, among others. All have some kind of inherent limi tations. Mixed degree decomposition or m-shell is an improvement over k-shell that yields better ranking. Many researchers have employed single node identification heuristics to select multiple seed nodes by considering top-k nodes from the ranked list. This approach does not results in the optimal seed nodeset due to the considerable overlap in total spreading influence. Influence overlap occurs when multiple nodes from the seed nodeset in fluence a specific node, and it is counted multiple times during total collective influence computation. In this paper, we exploit the ‘node degree’, ‘closeness’ and ‘coreness’ among the nodes and propose novel heuristic template to rank the super spreaders in a network. We employ k-shell and m-shell as a coreness measure in two variants for a comparative evaluation. We use a geodesic-based constraint (enforcing a minimum distance between seed nodes) to select an initial seed nodeset from that ranked nodes for influence maximization instead of selecting the top-k nodes naively. All models and metrics are updated to avoid overlapping influence during total spread computation. Experimental simulation with the SIR (Susceptible-Infectious-Recovered) spreading model and an evaluation with performance metrics like spreadability, monotonicity of ranking, Kendall’s rank correlation on some benchmark real-world networks establish the superiority of the proposed methods and the improved seed node selection technique.

Metadados do item

id	RCAP_00dd054bad665a058e5e0628f9778f60
oai_identifier_str	oai:recipp.ipp.pt:10400.22/19804
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Identifying and ranking super spreaders in real world complex networks without influence overlapInfluential spreader identificationSpreading overlapSeed selection with minimum geodesicSIR simulationMonotonicityKendall’s rank correlationIn the present-days complex networks modeled on real-world data contain millions of nodes and billions of links. Identifying super spreaders in such an extensive network is a challenging task. Super spreaders are the most important or influential nodes in the network that play the central role during an infection spreading or infor mation diffusion process. Depending on the application, either the most influential node needs to be identified, or a set of initial seed nodes are identified that can maximize the collective influence or the total spread in the network. Many centrality measures have been proposed to rank nodes in a complex network such as ‘degree’, ‘closeness’, ‘betweenness’, ‘coreness’ or ‘k-shell’ centrality, among others. All have some kind of inherent limi tations. Mixed degree decomposition or m-shell is an improvement over k-shell that yields better ranking. Many researchers have employed single node identification heuristics to select multiple seed nodes by considering top-k nodes from the ranked list. This approach does not results in the optimal seed nodeset due to the considerable overlap in total spreading influence. Influence overlap occurs when multiple nodes from the seed nodeset in fluence a specific node, and it is counted multiple times during total collective influence computation. In this paper, we exploit the ‘node degree’, ‘closeness’ and ‘coreness’ among the nodes and propose novel heuristic template to rank the super spreaders in a network. We employ k-shell and m-shell as a coreness measure in two variants for a comparative evaluation. We use a geodesic-based constraint (enforcing a minimum distance between seed nodes) to select an initial seed nodeset from that ranked nodes for influence maximization instead of selecting the top-k nodes naively. All models and metrics are updated to avoid overlapping influence during total spread computation. Experimental simulation with the SIR (Susceptible-Infectious-Recovered) spreading model and an evaluation with performance metrics like spreadability, monotonicity of ranking, Kendall’s rank correlation on some benchmark real-world networks establish the superiority of the proposed methods and the improved seed node selection technique.ELSEVIERRepositório Científico do Instituto Politécnico do PortoMaji, GiridharDutta, AnimeshCurado Malta, MarianaSen, Soumya2022-02-07T11:13:56Z20212021-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/19804eng10.1016/j.eswa.2021.115061metadata only accessinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-08-16T01:49:39Zoai:recipp.ipp.pt:10400.22/19804Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:39:52.333725Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Identifying and ranking super spreaders in real world complex networks without influence overlap
title	Identifying and ranking super spreaders in real world complex networks without influence overlap
spellingShingle	Identifying and ranking super spreaders in real world complex networks without influence overlap Maji, Giridhar Influential spreader identification Spreading overlap Seed selection with minimum geodesic SIR simulation Monotonicity Kendall’s rank correlation
title_short	Identifying and ranking super spreaders in real world complex networks without influence overlap
title_full	Identifying and ranking super spreaders in real world complex networks without influence overlap
title_fullStr	Identifying and ranking super spreaders in real world complex networks without influence overlap
title_full_unstemmed	Identifying and ranking super spreaders in real world complex networks without influence overlap
title_sort	Identifying and ranking super spreaders in real world complex networks without influence overlap
author	Maji, Giridhar
author_facet	Maji, Giridhar Dutta, Animesh Curado Malta, Mariana Sen, Soumya
author_role	author
author2	Dutta, Animesh Curado Malta, Mariana Sen, Soumya
author2_role	author author author
dc.contributor.none.fl_str_mv	Repositório Científico do Instituto Politécnico do Porto
dc.contributor.author.fl_str_mv	Maji, Giridhar Dutta, Animesh Curado Malta, Mariana Sen, Soumya
dc.subject.por.fl_str_mv	Influential spreader identification Spreading overlap Seed selection with minimum geodesic SIR simulation Monotonicity Kendall’s rank correlation
topic	Influential spreader identification Spreading overlap Seed selection with minimum geodesic SIR simulation Monotonicity Kendall’s rank correlation
description	In the present-days complex networks modeled on real-world data contain millions of nodes and billions of links. Identifying super spreaders in such an extensive network is a challenging task. Super spreaders are the most important or influential nodes in the network that play the central role during an infection spreading or infor mation diffusion process. Depending on the application, either the most influential node needs to be identified, or a set of initial seed nodes are identified that can maximize the collective influence or the total spread in the network. Many centrality measures have been proposed to rank nodes in a complex network such as ‘degree’, ‘closeness’, ‘betweenness’, ‘coreness’ or ‘k-shell’ centrality, among others. All have some kind of inherent limi tations. Mixed degree decomposition or m-shell is an improvement over k-shell that yields better ranking. Many researchers have employed single node identification heuristics to select multiple seed nodes by considering top-k nodes from the ranked list. This approach does not results in the optimal seed nodeset due to the considerable overlap in total spreading influence. Influence overlap occurs when multiple nodes from the seed nodeset in fluence a specific node, and it is counted multiple times during total collective influence computation. In this paper, we exploit the ‘node degree’, ‘closeness’ and ‘coreness’ among the nodes and propose novel heuristic template to rank the super spreaders in a network. We employ k-shell and m-shell as a coreness measure in two variants for a comparative evaluation. We use a geodesic-based constraint (enforcing a minimum distance between seed nodes) to select an initial seed nodeset from that ranked nodes for influence maximization instead of selecting the top-k nodes naively. All models and metrics are updated to avoid overlapping influence during total spread computation. Experimental simulation with the SIR (Susceptible-Infectious-Recovered) spreading model and an evaluation with performance metrics like spreadability, monotonicity of ranking, Kendall’s rank correlation on some benchmark real-world networks establish the superiority of the proposed methods and the improved seed node selection technique.
publishDate	2021
dc.date.none.fl_str_mv	2021 2021-01-01T00:00:00Z 2022-02-07T11:13:56Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10400.22/19804
url	http://hdl.handle.net/10400.22/19804
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	10.1016/j.eswa.2021.115061
dc.rights.driver.fl_str_mv	metadata only access info:eu-repo/semantics/openAccess
rights_invalid_str_mv	metadata only access
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	ELSEVIER
publisher.none.fl_str_mv	ELSEVIER
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799131486534762496

Identifying and ranking super spreaders in real world complex networks without influence overlap

Registros relacionados