Vital Node Identification in Complex Networks Using a Machine Learning-Based Approach

20 Pages Posted: 8 Mar 2022

See all articles by Ahmad Asgharian Rezaei

Ahmad Asgharian Rezaei

Royal Melbourne Institute of Technolog (RMIT University)

Justin Munoz

Royal Melbourne Institute of Technolog (RMIT University)

Mahdi Jalili

Royal Melbourne Institute of Technolog (RMIT University)

Hamid Khayyam

Royal Melbourne Institute of Technolog (RMIT University)

Abstract

Vital node identification is the problem of finding nodes of highest importance in complex networks. This problem has crucial applications in various contexts such as viral marketing or controlling the propagation of virus or rumours in real-world networks. Existing approaches for vital node identification mainly focus on capturing the importance of a node through a mathematical expression which directly relates structural properties of the node to its vitality. Although these heuristic approaches have achieved good performance in practice, they have weak adaptability, and their performance is limited to specific settings and certain dynamics. Inspired by the power of machine learning models for efficiently capturing different types of patterns and relations, we propose a machine learning-based, data driven approach for vital node identification. The main idea is to train the model with a small portion of the graph, say 0.5% of the nodes, and do the prediction on the rest of the nodes. The ground-truth vitality for the train data is computed by simulating the SIR diffusion method starting from the train nodes. We use collective feature engineering where each node in the network is represented by incorporating elements of its connectivity, degree and extended coreness. Several machine learning models are trained on the node representations, but the best results are achieved by a Support Vector Regression machine with RBF kernel. The empirical results confirms that the proposed model outperforms state-of-the-art models on a selection of datasets, while it also shows more adaptability to changes in the dynamics parameters.

Keywords: Influential Node RankingComplex NetworksMachine LearningVital Node IdentificationSupport Vector MachinesInfluence MaximizationEpidemic Analysis

Suggested Citation

Asgharian Rezaei, Ahmad and Munoz, Justin and Jalili, Mahdi and Khayyam, Hamid, Vital Node Identification in Complex Networks Using a Machine Learning-Based Approach. Available at SSRN: https://ssrn.com/abstract=4052361 or http://dx.doi.org/10.2139/ssrn.4052361

Ahmad Asgharian Rezaei (Contact Author)

Royal Melbourne Institute of Technolog (RMIT University) ( email )

124 La Trobe Street
Melbourne, 3000
Australia

Justin Munoz

Royal Melbourne Institute of Technolog (RMIT University) ( email )

124 La Trobe Street
Melbourne, 3000
Australia

Mahdi Jalili

Royal Melbourne Institute of Technolog (RMIT University) ( email )

Hamid Khayyam

Royal Melbourne Institute of Technolog (RMIT University) ( email )

124 La Trobe Street
Melbourne, 3000
Australia

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
78
Abstract Views
345
Rank
673,892
PlumX Metrics