Identifying and ranking super spreaders in real world complex networks without influence overlap

Detalhes bibliográficos
Autor(a) principal: Maji, Giridhar
Data de Publicação: 2021
Outros Autores: Dutta, Animesh, Curado Malta, Mariana, Sen, Soumya
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.22/19804
Resumo: In the present-days complex networks modeled on real-world data contain millions of nodes and billions of links. Identifying super spreaders in such an extensive network is a challenging task. Super spreaders are the most important or influential nodes in the network that play the central role during an infection spreading or infor­ mation diffusion process. Depending on the application, either the most influential node needs to be identified, or a set of initial seed nodes are identified that can maximize the collective influence or the total spread in the network. Many centrality measures have been proposed to rank nodes in a complex network such as ‘degree’, ‘closeness’, ‘betweenness’, ‘coreness’ or ‘k-shell’ centrality, among others. All have some kind of inherent limi­ tations. Mixed degree decomposition or m-shell is an improvement over k-shell that yields better ranking. Many researchers have employed single node identification heuristics to select multiple seed nodes by considering top-k nodes from the ranked list. This approach does not results in the optimal seed nodeset due to the considerable overlap in total spreading influence. Influence overlap occurs when multiple nodes from the seed nodeset in­ fluence a specific node, and it is counted multiple times during total collective influence computation. In this paper, we exploit the ‘node degree’, ‘closeness’ and ‘coreness’ among the nodes and propose novel heuristic template to rank the super spreaders in a network. We employ k-shell and m-shell as a coreness measure in two variants for a comparative evaluation. We use a geodesic-based constraint (enforcing a minimum distance be­tween seed nodes) to select an initial seed nodeset from that ranked nodes for influence maximization instead of selecting the top-k nodes naively. All models and metrics are updated to avoid overlapping influence during total spread computation. Experimental simulation with the SIR (Susceptible-Infectious-Recovered) spreading model and an evaluation with performance metrics like spreadability, monotonicity of ranking, Kendall’s rank corre­lation on some benchmark real-world networks establish the superiority of the proposed methods and the improved seed node selection technique.
id RCAP_00dd054bad665a058e5e0628f9778f60
oai_identifier_str oai:recipp.ipp.pt:10400.22/19804
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Identifying and ranking super spreaders in real world complex networks without influence overlapInfluential spreader identificationSpreading overlapSeed selection with minimum geodesicSIR simulationMonotonicityKendall’s rank correlationIn the present-days complex networks modeled on real-world data contain millions of nodes and billions of links. Identifying super spreaders in such an extensive network is a challenging task. Super spreaders are the most important or influential nodes in the network that play the central role during an infection spreading or infor­ mation diffusion process. Depending on the application, either the most influential node needs to be identified, or a set of initial seed nodes are identified that can maximize the collective influence or the total spread in the network. Many centrality measures have been proposed to rank nodes in a complex network such as ‘degree’, ‘closeness’, ‘betweenness’, ‘coreness’ or ‘k-shell’ centrality, among others. All have some kind of inherent limi­ tations. Mixed degree decomposition or m-shell is an improvement over k-shell that yields better ranking. Many researchers have employed single node identification heuristics to select multiple seed nodes by considering top-k nodes from the ranked list. This approach does not results in the optimal seed nodeset due to the considerable overlap in total spreading influence. Influence overlap occurs when multiple nodes from the seed nodeset in­ fluence a specific node, and it is counted multiple times during total collective influence computation. In this paper, we exploit the ‘node degree’, ‘closeness’ and ‘coreness’ among the nodes and propose novel heuristic template to rank the super spreaders in a network. We employ k-shell and m-shell as a coreness measure in two variants for a comparative evaluation. We use a geodesic-based constraint (enforcing a minimum distance be­tween seed nodes) to select an initial seed nodeset from that ranked nodes for influence maximization instead of selecting the top-k nodes naively. All models and metrics are updated to avoid overlapping influence during total spread computation. Experimental simulation with the SIR (Susceptible-Infectious-Recovered) spreading model and an evaluation with performance metrics like spreadability, monotonicity of ranking, Kendall’s rank corre­lation on some benchmark real-world networks establish the superiority of the proposed methods and the improved seed node selection technique.ELSEVIERRepositório Científico do Instituto Politécnico do PortoMaji, GiridharDutta, AnimeshCurado Malta, MarianaSen, Soumya2022-02-07T11:13:56Z20212021-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/19804eng10.1016/j.eswa.2021.115061metadata only accessinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-08-16T01:49:39Zoai:recipp.ipp.pt:10400.22/19804Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:39:52.333725Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Identifying and ranking super spreaders in real world complex networks without influence overlap
title Identifying and ranking super spreaders in real world complex networks without influence overlap
spellingShingle Identifying and ranking super spreaders in real world complex networks without influence overlap
Maji, Giridhar
Influential spreader identification
Spreading overlap
Seed selection with minimum geodesic
SIR simulation
Monotonicity
Kendall’s rank correlation
title_short Identifying and ranking super spreaders in real world complex networks without influence overlap
title_full Identifying and ranking super spreaders in real world complex networks without influence overlap
title_fullStr Identifying and ranking super spreaders in real world complex networks without influence overlap
title_full_unstemmed Identifying and ranking super spreaders in real world complex networks without influence overlap
title_sort Identifying and ranking super spreaders in real world complex networks without influence overlap
author Maji, Giridhar
author_facet Maji, Giridhar
Dutta, Animesh
Curado Malta, Mariana
Sen, Soumya
author_role author
author2 Dutta, Animesh
Curado Malta, Mariana
Sen, Soumya
author2_role author
author
author
dc.contributor.none.fl_str_mv Repositório Científico do Instituto Politécnico do Porto
dc.contributor.author.fl_str_mv Maji, Giridhar
Dutta, Animesh
Curado Malta, Mariana
Sen, Soumya
dc.subject.por.fl_str_mv Influential spreader identification
Spreading overlap
Seed selection with minimum geodesic
SIR simulation
Monotonicity
Kendall’s rank correlation
topic Influential spreader identification
Spreading overlap
Seed selection with minimum geodesic
SIR simulation
Monotonicity
Kendall’s rank correlation
description In the present-days complex networks modeled on real-world data contain millions of nodes and billions of links. Identifying super spreaders in such an extensive network is a challenging task. Super spreaders are the most important or influential nodes in the network that play the central role during an infection spreading or infor­ mation diffusion process. Depending on the application, either the most influential node needs to be identified, or a set of initial seed nodes are identified that can maximize the collective influence or the total spread in the network. Many centrality measures have been proposed to rank nodes in a complex network such as ‘degree’, ‘closeness’, ‘betweenness’, ‘coreness’ or ‘k-shell’ centrality, among others. All have some kind of inherent limi­ tations. Mixed degree decomposition or m-shell is an improvement over k-shell that yields better ranking. Many researchers have employed single node identification heuristics to select multiple seed nodes by considering top-k nodes from the ranked list. This approach does not results in the optimal seed nodeset due to the considerable overlap in total spreading influence. Influence overlap occurs when multiple nodes from the seed nodeset in­ fluence a specific node, and it is counted multiple times during total collective influence computation. In this paper, we exploit the ‘node degree’, ‘closeness’ and ‘coreness’ among the nodes and propose novel heuristic template to rank the super spreaders in a network. We employ k-shell and m-shell as a coreness measure in two variants for a comparative evaluation. We use a geodesic-based constraint (enforcing a minimum distance be­tween seed nodes) to select an initial seed nodeset from that ranked nodes for influence maximization instead of selecting the top-k nodes naively. All models and metrics are updated to avoid overlapping influence during total spread computation. Experimental simulation with the SIR (Susceptible-Infectious-Recovered) spreading model and an evaluation with performance metrics like spreadability, monotonicity of ranking, Kendall’s rank corre­lation on some benchmark real-world networks establish the superiority of the proposed methods and the improved seed node selection technique.
publishDate 2021
dc.date.none.fl_str_mv 2021
2021-01-01T00:00:00Z
2022-02-07T11:13:56Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.22/19804
url http://hdl.handle.net/10400.22/19804
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1016/j.eswa.2021.115061
dc.rights.driver.fl_str_mv metadata only access
info:eu-repo/semantics/openAccess
rights_invalid_str_mv metadata only access
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv ELSEVIER
publisher.none.fl_str_mv ELSEVIER
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799131486534762496