DOI QR코드

DOI QR Code

Predicting the Lifespan and Retweet Times of Tweets Based on Multiple Feature Analysis

  • Bae, Yongjin (SW.Content Research Laboratory, ETRI, Department of Computer Software and Engineering, University of Science and Technology) ;
  • Ryu, Pum-Mo (SW.Content Research Laboratory, ETRI) ;
  • Kim, Hyunki (SW.Content Research Laboratory, ETRI)
  • Received : 2013.07.10
  • Accepted : 2013.12.30
  • Published : 2014.06.01

Abstract

In social network services, such as Facebook, Google+, Twitter, and certain postings attract more people than others. In this paper, we propose a novel method for predicting the lifespan and retweet times of tweets, the latter being a proxy for measuring the popularity of a tweet. We extract information from retweet graphs, such as posting times; and social, local, and content features, so as to construct prediction knowledge bases. Tweets with a similar topic, retweet pattern, and properties are sequentially extracted from the knowledge base and then used to make a prediction. To evaluate the performance of our model, we collected tweets on Twitter from June 2012 to October 2012. We compared our model with conventional models according to the prediction goal. For the lifespan prediction of a tweet, our model can reduce the time tolerance of a tweet lifespan by about four hours, compared with conventional models. In terms of prediction of the retweet times, our model achieved a significantly outstanding precision of about 50%, which is much higher than two of the conventional models showing a precision of around 30% and 20%, respectively.

Keywords

References

  1. H.J. Oh, C.K. Lee, and C.H. Lee, "Analysis of the Empirical Effects of Contextual Matching Advertising for Online News," ETRI J., vol. 34, no. 2, Apr. 2012, pp. 292-295. https://doi.org/10.4218/etrij.12.0211.0171
  2. S. Kong et al., "Predicting Lifespans of Popular Tweets in Microblog," Int. ACM SIGIR, Portland, Oregon, USA, Aug. 12-16, 2012, pp. 1129-1130.
  3. L. Hong et al., "Predicting Popular Messages in Twitter," Int. Conf. WWW, Hyderabad, India, Mar. 28 - Apr. 1, 2011, pp. 57-58.
  4. S. Unankard et al., "On the Prediction of Re-tweeting Activities in Social Networks - A Report on WISE 2012 Challenge," WISE, Paphos, Cyprus, vol. 7651, Nov. 28-30, 2012, pp. 744-754.
  5. C.K. Lee and M.G. Jang, "A Prior Model of Structural SVMs for Domain Adaptation," ETRI J., vol. 33, no. 5, Oct. 2011, pp. 712-719. https://doi.org/10.4218/etrij.11.0110.0571
  6. C.K. Lee and M.G. Jang, "A Modified Fixed-Threshold SMO for 1-Slack Structural SVMs," ETRI J., vol. 32, no. 1, Feb. 2010, pp. 120-128. https://doi.org/10.4218/etrij.10.0109.0425
  7. L. Zhang, Z. Zhang, and P. Jin, "Classification-Based Prediction on the Retweet Actions over Microblog Dataset," WISE, Paphos, Cyprus, Nov. 28-30, 2012, pp. 771-776.
  8. Y. Artzi, P. Pantel, and M. Gamon, "Predicting Responses to Microblog Posts," NAACL HLT, Montreal, Canada, June 3-8, 2012, pp. 602-606.
  9. S. Petrovic et al., "RT to Win! Predicting Message Propagation in Twitter," ICWSM, Barcelona, Catalonia, Spain, July 17-21, 2011, pp. 586-589.
  10. T.R. Zaman et al., "Predicting Information Spreading in Twitter," Workshop CSSWC NIPS, Whistler, Canada, Dec. 10, 2010.
  11. B. Suh et al., "Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network," IEEE, Int. Conf. SocialCom., Minneapolis, MN, USA, Aug. 20-22, 2010, pp. 177-184.
  12. Twitter Inc. The Twitter Rules of Spam and Abuse. Accessed Oct. 20, 2012. http://support.twitter.com/articles/18311-the-twitterrules
  13. C. Castillo, M. Mendoza, and B. Poblete, "Information Credibility on Twitter," Int. Conf. WWW, Hyderabad, India, Mar. 28 - Apr. 1, 2011, pp. 675-684.
  14. W.X. Zhao et al., "Comparing Twitter and Traditional Media Using Topic Models," ECIR, Dublin, Ireland, Apr. 18-21, 2011, pp. 338-349.
  15. C. Wang and B.A. Huberman, "Long Trend Dynamics in Social Media," EPJ Data Sci., vol. 1, no. 1, May 2012.
  16. B. Krishnamurthy, P. Gill, and M. Arlitt, "A Few Chirps about Twitter," WOSN, Glasgow, Scotland, UK, Apr. 1-4, 2008, pp. 19-24.

Cited by

  1. A Multidimensional Nonnegative Matrix Factorization Model for Retweeting Behavior Prediction vol.2015, pp.None, 2014, https://doi.org/10.1155/2015/936397
  2. Classifying Biomedical Literature Providing Protein Function Evidence vol.37, pp.4, 2014, https://doi.org/10.4218/etrij.15.0114.0041
  3. A real-time method to predict social media popularity vol.28, pp.12, 2017, https://doi.org/10.1142/s0129183117501443
  4. Why cannot long-term cascade be predicted? Exploring temporal dynamics in information diffusion processes vol.8, pp.9, 2014, https://doi.org/10.1098/rsos.202245
  5. DIFCURV: A unified framework for Diffusion Curve Fitting and prediction in Online Social Networks vol.12, pp.None, 2021, https://doi.org/10.1016/j.array.2021.100100