Topic Popularity Prediction of Microblog Based on Wavelet Transformation and ARIMA
CHEN Yu-Zhong, FANG Ming-Yue, GUO Wen-Zhong, GUO Kun
1.College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350108 2.Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing,Fuzhou University, Fuzhou 350108
Abstract:The research of topic popularity prediction problem can be an important significance for maximizing the propagation effects of network advertisements as well as guiding and controlling the network consensus. Firstly, according to user relationship and topic factor, user influence is computed and topic influence is defined. Then, based on aging theory, topic energy value is calculated by considering both topic influence and the number of microblogs related to the topic, and the heat of topic is quantified. Finally, an method named topic popularity prediction of microblog based on wavelet transformation and autoregressive integrated moving average model is proposed to predict topic popularity of microblog and forecast when a topic will hit the peak. Experimental results show that the proposed method can effectively predict the topic popularity and the peak of a topic with lower residual error and omission rate.
[1] Chen L, Wang W B, Sheth A P. Are Twitter Users Equal in Predicting Elections? A Study of User Groups in Predicting 2012 U.S. Republican Presidential Primaries // Proc of the 4th International Conference on Social Informatics. Lausanne, Switzerland, 2012: 379-392 [2] Asur S, Huberman B A, Szabo G, et al. Trends in Social Media: Persistence and Decay // Proc of the 5th International AAAI Conference on Weblogs and Social Media. Barcelona, Spain, 2011: 434-437 [3] Cataldi M, di Caro L, Schifanella C. Emerging Topic Detection on Twitter Based on Temporal and Social Terms Evaluation // Proc of the 10th International Workshop on Multimedia Data Mining. Wa-shington, USA, 2010. DOI: 10.1145/1814245.1814249 [4] Lau J H, Collier N, Baldwin T. On-line Trend Analysis with Topic Models: Twitter Trends Detection Topic Model Online // Proc of the 24th International Conference on Computational Linguistics. Mumbai, India, 2012: 1519-1534 [5] Yang D Q, Xiao Y H, Wang W. Statistical Mode-Based Analysis and Prediction of Collective Attentions in Social Networks. Journal of Computer Research and Development, 2010, 47(z1): 378-384 (in Chinese) (阳德青,肖仰华,汪 卫.基于统计模型的社会网络群体关注度的分析与预测.计算机研究与发展, 2010, 47(z1): 378-384) [6] Tian Y. On Trends Analysis and Prediction Based on Micro-Blogging Platforms. Ph.D Dissertation. Wuhan, China: Wuhan University, 2012 (in Chinese) (田 野.基于微博平台的事件趋势分析及预测研究.博士学位论文.武汉:武汉大学, 2012) [7] Lu R, Zhang Y, Yang Q. Predicting News Trend in the Social Network. Journal of Chinese Information Processing, 2013, 26(6): 85-90, 97 (in Chinese) (路 荣,张 旸,杨 青.社交网络中新闻趋势的预测分析.中文信息学报, 2013, 26(6): 85-90, 97) [8] Cataldi M, di Caro L, Schifanella C. Personalized Emerging Topic Detection Based on a Term Aging Model. ACM Trans on Intelligent Systems and Technology, 2013, 5(1): 1-27 [9] Chen C C, Chen Y T, Sun Y L, et al. Life Cycle Modeling of News Events Using Aging Theory // Proc of the 14th European Conference on Machine Learning. Cavtat-Dubrovnik, Croatia, 2003: 47-59 [10] Chen C C, Chen Y T, Chen M C. An Aging Theory for Event Life-Cycle Modeling. IEEE Trans on Systems, Man and Cybernetics: Systems and Humans, 2007, 37(2): 237-248 [11] Zou B X, Liu Q. ARMA-Based Traffic Prediction and Overload Detection of Network. Journal of Computer Research and Development, 2002, 39(12): 1645-1652 (in Chinese) (邹柏贤,刘 强.基于 ARMA 模型的网络流量预测.计算机研究与发展, 2002, 39(12): 1645-1652) [12] Feng H L, Chen D, Lin Q J, et al. Multi-scale Network Traffic Prediction Using a Two-Stage Neural Network Combined Model // Proc of the International Conference on Wireless Communications, Networking and Mobile Computing. Wuhan, China, 2006. DOI: 10.1109/WiCOM.2006.377 [13] Guresen E, Kayakutlu G, Daim T U. Using Artificial Neural Network Models in Stock Market Index Prediction. Expert Systems with Applications, 2011, 38(8): 10389-10397 [14] Hong F, Wu Z M. Multiscale Network Traffic Prediction Model Based on Wavelet. Chinese Journal of Computers, 2006, 29(1): 166-170 (in Chinese) (洪 飞,吴志美.基于小波的多尺度网络流量预测模型.计算机 学报, 2006, 29(1): 166-170) [15] Chen X T, Liu J X. Network Traffic Prediction Based on Wavelet Transformation and FARIMA. Journal on Communications, 2011, 32(4): 153-157, 165 (in Chinese) (陈晓天,刘静娴.改进的基于小波变换和FARIMA模型的网络流量预测算法.通信学报, 2011, 32(4): 153-157, 165) [16] Kumar V. Algorithms for Constraint-Satisfaction Problems: A Survey. AI Magazine, 1992, 13(1): 32-44 [17] Weng J S, Lee B S. Event Detection in Twitter // Proc of the 5th International AIII Conference on Weblogs and Social Media. Barcelona, Spain, 2011: 401-408 [18] Kendall M G. A New Measure of Rank Correlation. Biometrika, 1938, 30(1/2): 81-93