Identifying and ranking super spreaders in real world complex networks without influence overlap
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.22/19804 |
Resumo: | In the present-days complex networks modeled on real-world data contain millions of nodes and billions of links. Identifying super spreaders in such an extensive network is a challenging task. Super spreaders are the most important or influential nodes in the network that play the central role during an infection spreading or infor mation diffusion process. Depending on the application, either the most influential node needs to be identified, or a set of initial seed nodes are identified that can maximize the collective influence or the total spread in the network. Many centrality measures have been proposed to rank nodes in a complex network such as ‘degree’, ‘closeness’, ‘betweenness’, ‘coreness’ or ‘k-shell’ centrality, among others. All have some kind of inherent limi tations. Mixed degree decomposition or m-shell is an improvement over k-shell that yields better ranking. Many researchers have employed single node identification heuristics to select multiple seed nodes by considering top-k nodes from the ranked list. This approach does not results in the optimal seed nodeset due to the considerable overlap in total spreading influence. Influence overlap occurs when multiple nodes from the seed nodeset in fluence a specific node, and it is counted multiple times during total collective influence computation. In this paper, we exploit the ‘node degree’, ‘closeness’ and ‘coreness’ among the nodes and propose novel heuristic template to rank the super spreaders in a network. We employ k-shell and m-shell as a coreness measure in two variants for a comparative evaluation. We use a geodesic-based constraint (enforcing a minimum distance between seed nodes) to select an initial seed nodeset from that ranked nodes for influence maximization instead of selecting the top-k nodes naively. All models and metrics are updated to avoid overlapping influence during total spread computation. Experimental simulation with the SIR (Susceptible-Infectious-Recovered) spreading model and an evaluation with performance metrics like spreadability, monotonicity of ranking, Kendall’s rank correlation on some benchmark real-world networks establish the superiority of the proposed methods and the improved seed node selection technique. |
id |
RCAP_00dd054bad665a058e5e0628f9778f60 |
---|---|
oai_identifier_str |
oai:recipp.ipp.pt:10400.22/19804 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Identifying and ranking super spreaders in real world complex networks without influence overlapInfluential spreader identificationSpreading overlapSeed selection with minimum geodesicSIR simulationMonotonicityKendall’s rank correlationIn the present-days complex networks modeled on real-world data contain millions of nodes and billions of links. Identifying super spreaders in such an extensive network is a challenging task. Super spreaders are the most important or influential nodes in the network that play the central role during an infection spreading or infor mation diffusion process. Depending on the application, either the most influential node needs to be identified, or a set of initial seed nodes are identified that can maximize the collective influence or the total spread in the network. Many centrality measures have been proposed to rank nodes in a complex network such as ‘degree’, ‘closeness’, ‘betweenness’, ‘coreness’ or ‘k-shell’ centrality, among others. All have some kind of inherent limi tations. Mixed degree decomposition or m-shell is an improvement over k-shell that yields better ranking. Many researchers have employed single node identification heuristics to select multiple seed nodes by considering top-k nodes from the ranked list. This approach does not results in the optimal seed nodeset due to the considerable overlap in total spreading influence. Influence overlap occurs when multiple nodes from the seed nodeset in fluence a specific node, and it is counted multiple times during total collective influence computation. In this paper, we exploit the ‘node degree’, ‘closeness’ and ‘coreness’ among the nodes and propose novel heuristic template to rank the super spreaders in a network. We employ k-shell and m-shell as a coreness measure in two variants for a comparative evaluation. We use a geodesic-based constraint (enforcing a minimum distance between seed nodes) to select an initial seed nodeset from that ranked nodes for influence maximization instead of selecting the top-k nodes naively. All models and metrics are updated to avoid overlapping influence during total spread computation. Experimental simulation with the SIR (Susceptible-Infectious-Recovered) spreading model and an evaluation with performance metrics like spreadability, monotonicity of ranking, Kendall’s rank correlation on some benchmark real-world networks establish the superiority of the proposed methods and the improved seed node selection technique.ELSEVIERRepositório Científico do Instituto Politécnico do PortoMaji, GiridharDutta, AnimeshCurado Malta, MarianaSen, Soumya2022-02-07T11:13:56Z20212021-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/19804eng10.1016/j.eswa.2021.115061metadata only accessinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-08-16T01:49:39Zoai:recipp.ipp.pt:10400.22/19804Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:39:52.333725Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Identifying and ranking super spreaders in real world complex networks without influence overlap |
title |
Identifying and ranking super spreaders in real world complex networks without influence overlap |
spellingShingle |
Identifying and ranking super spreaders in real world complex networks without influence overlap Maji, Giridhar Influential spreader identification Spreading overlap Seed selection with minimum geodesic SIR simulation Monotonicity Kendall’s rank correlation |
title_short |
Identifying and ranking super spreaders in real world complex networks without influence overlap |
title_full |
Identifying and ranking super spreaders in real world complex networks without influence overlap |
title_fullStr |
Identifying and ranking super spreaders in real world complex networks without influence overlap |
title_full_unstemmed |
Identifying and ranking super spreaders in real world complex networks without influence overlap |
title_sort |
Identifying and ranking super spreaders in real world complex networks without influence overlap |
author |
Maji, Giridhar |
author_facet |
Maji, Giridhar Dutta, Animesh Curado Malta, Mariana Sen, Soumya |
author_role |
author |
author2 |
Dutta, Animesh Curado Malta, Mariana Sen, Soumya |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
Repositório Científico do Instituto Politécnico do Porto |
dc.contributor.author.fl_str_mv |
Maji, Giridhar Dutta, Animesh Curado Malta, Mariana Sen, Soumya |
dc.subject.por.fl_str_mv |
Influential spreader identification Spreading overlap Seed selection with minimum geodesic SIR simulation Monotonicity Kendall’s rank correlation |
topic |
Influential spreader identification Spreading overlap Seed selection with minimum geodesic SIR simulation Monotonicity Kendall’s rank correlation |
description |
In the present-days complex networks modeled on real-world data contain millions of nodes and billions of links. Identifying super spreaders in such an extensive network is a challenging task. Super spreaders are the most important or influential nodes in the network that play the central role during an infection spreading or infor mation diffusion process. Depending on the application, either the most influential node needs to be identified, or a set of initial seed nodes are identified that can maximize the collective influence or the total spread in the network. Many centrality measures have been proposed to rank nodes in a complex network such as ‘degree’, ‘closeness’, ‘betweenness’, ‘coreness’ or ‘k-shell’ centrality, among others. All have some kind of inherent limi tations. Mixed degree decomposition or m-shell is an improvement over k-shell that yields better ranking. Many researchers have employed single node identification heuristics to select multiple seed nodes by considering top-k nodes from the ranked list. This approach does not results in the optimal seed nodeset due to the considerable overlap in total spreading influence. Influence overlap occurs when multiple nodes from the seed nodeset in fluence a specific node, and it is counted multiple times during total collective influence computation. In this paper, we exploit the ‘node degree’, ‘closeness’ and ‘coreness’ among the nodes and propose novel heuristic template to rank the super spreaders in a network. We employ k-shell and m-shell as a coreness measure in two variants for a comparative evaluation. We use a geodesic-based constraint (enforcing a minimum distance between seed nodes) to select an initial seed nodeset from that ranked nodes for influence maximization instead of selecting the top-k nodes naively. All models and metrics are updated to avoid overlapping influence during total spread computation. Experimental simulation with the SIR (Susceptible-Infectious-Recovered) spreading model and an evaluation with performance metrics like spreadability, monotonicity of ranking, Kendall’s rank correlation on some benchmark real-world networks establish the superiority of the proposed methods and the improved seed node selection technique. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021 2021-01-01T00:00:00Z 2022-02-07T11:13:56Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.22/19804 |
url |
http://hdl.handle.net/10400.22/19804 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1016/j.eswa.2021.115061 |
dc.rights.driver.fl_str_mv |
metadata only access info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
metadata only access |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
ELSEVIER |
publisher.none.fl_str_mv |
ELSEVIER |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799131486534762496 |