关键词: 微博网络/
用户行为/
信息扩散
English Abstract
Modeling information diffusion on microblog networks based on users' behaviors
Liu Hong-Li,Huang Ya-Li,
Luo Chun-Hai,
Hu Hai-Bo
1.Department of Management Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
Fund Project:Project supported by the National Natural Science Foundation of China (Grant Nos. 61473119, 61104139), and the Fundamental Research Funds for the Central Universities, China (Grant No. WN1524301).Received Date:13 March 2016
Accepted Date:03 May 2016
Published Online:05 August 2016
Abstract:Online social networks, such as Facebook, Twitter and YouTube, play a vital role in information sharing and diffusion, and recently many dynamics models on social networks have been proposed to model information diffusion. However most models are theoretical, their parameters do not come from realistic data and their validity and reliability have not been evaluated empirically. In the paper we first analyze the users' behaviors of reading and reposting microblog in Sina Weibo, a Twitter-like website in China, and find that users' number of fans, the average reposted number of users' microblog, the intensity of users' interaction and the similarity between microblog topics and users' topic interests can significantly influence reposting behavior. Then we propose an information diffusion model Susceptible-Infected-Recovered based on Users' Behaviors (SIRUB) on microblog networks, compute the users' probability of reading microblog in the model according to the probability of their logging on microblog in a day, and obtain the reposting probability utilizing the logistic regression which considers 16 possible factors influencing users' reposting behavior. The 16 factors can be divided into three categories: the characteristics of microblog publishers, microblog text features and social relationship characteristics. We utilize the beginning 2/3 microblog data to obtain model parameters and logistic regression coefficients, and the remaining 1/3 data to examine the validity of the model. The experiments on Sina Weibo network show that the model can predict users' reposting behavior accurately only when it considers both reading and reposting probabilities. F-score which considers precision and recall is used to assess prediction effect of the model. The highest F-score for the prediction of SIRUB model on users' reposting behavior is 0.228 which is much larger than those of classical Susceptible-Infected-Recovered (SIR, F-score=0.039) and Susceptible-Infected-Contacted-Recovered (SICR, F-score=0.037) models. The prediction on the spreading scope of microblog for SIR and SICR models is related with users' number of fans while for SIRUB model not. For SIRUB model the mean and standard deviation of the errors of prediction on spreading scope are smaller than those of SIR and SICR models. These results indicate that users' behaviors of reading and reposting microblog should be appropriately taken in account when modeling information diffusion on microblog networks, and that, in general, the prediction performance of the data-driven SIRUB model proposed in the paper is better than those of SIR and SICR models regardless of the prediction of users' reposting behavior or diffusion scope of microblog.
Keywords: microblog network/
user behavior/
information diffusion