DOI QR코드

DOI QR Code

Deep Learning Based Rumor Detection for Arabic Micro-Text

  • Alharbi, Shada (Faculty of Computer Science and Information Technology) ;
  • Alyoubi, Khaled (Faculty of Computer Science and Information Technology) ;
  • Alotaibi, Fahd (Faculty of Computer Science and Information Technology)
  • 투고 : 2021.11.05
  • 발행 : 2021.11.30

초록

Nowadays microblogs have become the most popular platforms to obtain and spread information. Twitter is one of the most used platforms to share everyday life event. However, rumors and misinformation on Arabic social media platforms has become pervasive which can create inestimable harm to society. Therefore, it is imperative to tackle and study this issue to distinguish the verified information from the unverified ones. There is an increasing interest in rumor detection on microblogs recently, however, it is mostly applied on English language while the work on Arabic language is still ongoing research topic and need more efforts. In this paper, we propose a combined Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) to detect rumors on Twitter dataset. Various experiments were conducted to choose the best hyper-parameters tuning to achieve the best results. Moreover, different neural network models are used to evaluate performance and compare results. Experiments show that the CNN-LSTM model achieved the best accuracy 0.95 and an F1-score of 0.94 which outperform the state-of-the-art methods.

키워드

참고문헌

  1. E. Shearer and A. Mitchell, "Platforms in 2020 News Use Across Social Media," Pew Research Center, 2021. .
  2. S. F. Sabbeh and S. Y. Baatwah, "Arabic news credibility on twitter: An enhanced model using hybrid features," J. Theor. Appl. Inf. Technol., vol. 96, no. 8, 2018.
  3. R. H. Knapp, "A psychology of rumor," Public Opin. Q., vol. 8, no. 1, 1944, doi: 10.1086/265665.
  4. P. Meel and D. K. Vishwakarma, "Fake news, rumor, information pollution in social media and web: A contemporary survey of state-of-the-arts, challenges and opportunities," Expert Systems with Applications, vol. 153. 2020, doi: 10.1016/j.eswa.2019.112986.
  5. E. Seo, P. Mohapatra, and T. Abdelzaher, "Identifying rumors and their sources in social networks," in Ground/Air Multisensor Interoperability, Integration, and Networking for Persistent ISR III, 2012, vol. 8389, doi: 10.1117/12.919823.
  6. "scholar (5)." .
  7. A. Y. M. Floos, "Arabic Rumours Identification By Measuring The Credibility Of Arabic Tweet Content," Int. J. Knowl. Soc. Res., vol. 7, no. 2, 2016, doi: 10.4018/ijksr.2016040105.
  8. S. M. Alzanin and A. M. Azmi, "Rumor detection in Arabic tweets using semi-supervised and unsupervised expectation-maximization," Knowledge-Based Syst., vol. 185, 2019, doi: 10.1016/j.knosys.2019.104945.
  9. H. K. Thakur, A. Gupta, A. Bhardwaj, and D. Verma, "Rumor Detection on Twitter Using a Supervised Machine Learning Framework," Int. J. Inf. Retr. Res., vol. 8, no. 3, 2018, doi: 10.4018/ijirr.2018070101.
  10. R. K. Kaliyar, A. Goswami, and P. Narang, "A hybrid model for effective fake news detection with a novel COVID-19 dataset," in ICAART 2021 - Proceedings of the 13th International Conference on Agents and Artificial Intelligence, 2021, vol. 2, doi: 10.5220/0010316010661072.
  11. X. Chen, L. Ke, Z. Lu, H. Su, and H. Wang, "A novel hybrid model for cantonese rumor detection on twitter," Appl. Sci., vol. 10, no. 20, 2020, doi: 10.3390/app10207093.
  12. L. Zhang, J. Li, B. Zhou, and Y. Jia, "Rumor Detection Based on SAGNN: Simplified Aggregation Graph Neural Networks," Mach. Learn. Knowl. Extr., vol. 3, no. 1, 2021, doi: 10.3390/make3010005.
  13. T. Bian et al., "Rumor detection on social media with bidirectional graph convolutional networks," 2020, doi: 10.1609/aaai.v34i01.5393.
  14. M. Shi, K. Wang, and C. Li, "A C-LSTM with word embedding model for news text classification," 2019, doi: 10.1109/ICIS46139.2019.8940289.
  15. M. Z. Asghar, A. Habib, A. Habib, A. Khan, R. Ali, and A. Khattak, "Exploring deep neural networks for rumor detection," J. Ambient Intell. Humaniz. Comput., vol. 12, no. 4, 2021, doi: 10.1007/s12652-019-01527-4.
  16. Q. Li, Q. Zhang, and L. Si, "Rumor detection by exploiting user credibility information, attention and multi-task learning," 2020, doi: 10.18653/v1/p19-1113.
  17. A. Alsaeedi and M. Al-Sarem, "Detecting Rumors on Social Media Based on a CNN Deep Learning Technique," Arab. J. Sci. Eng., vol. 45, no. 12, 2020, doi: 10.1007/s13369-020-04839-2.
  18. N. Rani, P. Das, and A. K. Bhardwaj, "A hybrid deep learning model based on CNN-BiLSTM for rumor detection," 2021, doi: 10.1109/icces51350.2021.9489214.
  19. A. P. Ben Veyseh, M. T. Thai, T. H. Nguyen, and D. Dou, "Rumor detection in social networks via deep contextual modeling," 2019, doi: 10.1145/3341161.3342896.
  20. M. Al-Sarem, W. Boulila, M. Al-Harby, J. Qadir, and A. Alsaeedi, "Deep learning-based rumor detection on microblogging platforms: A systematic review," IEEE Access, vol. 7. 2019, doi: 10.1109/ACCESS.2019.2947855.
  21. K. W. CHURCH, "Word2Vec," Nat. Lang. Eng., vol. 23, no. 1, 2017, doi: 10.1017/s1351324916000334.
  22. T. Mikolov, K. Chen, G. Corrado, and J. Dean, "Efficient estimation of word representations in vector space," 2013.
  23. A. B. Soliman, K. Eissa, and S. R. El-Beltagy, "AraVec: A set of Arabic Word Embedding Models for use in Arabic NLP," in Procedia Computer Science, 2017, vol. 117, doi: 10.1016/j.procs.2017.10.117.
  24. C. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. Bethard, and D. McClosky, "The Stanford CoreNLP Natural Language Processing Toolkit," 2015, doi: 10.3115/v1/p14-5010.
  25. A. Aizawa, "An information-theoretic perspective of tf-idf measures," Inf. Process. Manag., vol. 39, no. 1, 2003, doi: 10.1016/S0306-4573(02)00021-3.
  26. S. Hochreiter and J. Schmidhuber, "Long Short Term Memory. Neural Computation," Neural Comput., vol. 9, no. 8, 1997.
  27. J. Xie, B. Chen, X. Gu, F. Liang, and X. Xu, "Self-Attention-Based BiLSTM Model for Short Text Fine-Grained Sentiment Classification," IEEE Access, vol. 7, 2019, doi: 10.1109/ACCESS.2019.2957510.
  28. X. Wang, W. Jiang, and Z. Luo, "Combination of convolutional and recurrent neural network for sentiment analysis of short texts," 2016.
  29. "Scholar (7)." .
  30. H. Mohaouchane, A. Mourhir, and N. S. Nikolov, "Detecting Offensive Language on Arabic Social Media Using Deep Learning," 2019, doi: 10.1109/SNAMS.2019.8931839.
  31. N. Kalchbrenner, E. Grefenstette, and P. Blunsom, "A convolutional neural network for modelling sentences," in 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference, 2014, vol. 1, doi: 10.3115/v1/p14-1062.
  32. N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Dropout: A simple way to prevent neural networks from overfitting," J. Mach. Learn. Res., vol. 15, 2014.
  33. A. S. Walter et al., "Scholar (8)," Convergence in the information industries. Telecommunications, broadcasting and data processing 1981-1996, vol. 3, no. 1. pp. 125-150, 2004, [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0160738315000444%250Ahttp://eprints.lancs.ac.uk/48376/%255Cnhttp://dx.doi.org/10.1002/zamm.19630430112%250Ahttp://www.sciencedirect.com/science/article/pii/S0160738315000444%250Ahttp://eprints.lancs.ac.uk/48376/%255.
  34. S. Santurkar, D. Tsipras, A. Ilyas, and A. Madry, "How does batch normalization help optimization?," in Advances in Neural Information Processing Systems, 2018, vol. 2018-December.
  35. J. Han and C. Moraga, "The influence of the sigmoid function parameters on the speed of backpropagation learning," in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 1995, vol. 930, doi: 10.1007/3-540-59497-3_175.
  36. Y. Ren, "Python Machine Learning : Machine Learning and Deep Learning With Python ," Int. J. Knowledge-Based Organ., vol. 11, no. 1, 2021.
  37. "scholar (9)." .
  38. T. Chen and C. Guestrin, "XGBoost: A scalable tree boosting system," Proc. ACM SIGKDD Int. Conf. Knowl. Discov. Data Min., vol. 13-17-Augu, 2016.
  39. V. M. Patro and M. Ranjan Patra, "Augmenting Weighted Average with Confusion Matrix to Enhance Classification Accuracy," Trans. Mach. Learn. Artif. Intell., vol. 2, no. 4, 2014, doi: 10.14738/tmlai.24.328.