DOI QR코드

DOI QR Code

Design of a Disaster Big Data Platform for Collecting and Analyzing Social Media

소셜미디어 수집과 분석을 위한 재난 빅 데이터 플랫폼의 설계

  • Nguyen, Van-Quyet (Dept. of Electronics and Computer Engineering, Chonnam National University) ;
  • Nguyen, Sinh-Ngoc (Dept. of Electronics and Computer Engineering, Chonnam National University) ;
  • Nguyen, Giang-Truong (Dept. of Electronics and Computer Engineering, Chonnam National University) ;
  • Kim, Kyungbaek (Dept. of Electronics and Computer Engineering, Chonnam National University)
  • Published : 2017.04.27

Abstract

Recently, during disasters occurrence, dealing with emergencies has been handled well by the early transmission of disaster relating notifications on social media networks (e.g., Twitter or Facebook). Intuitively, with their characteristics (e.g., real-time, mobility) and big communities whose users could be regarded as volunteers, social networks are proved to be a crucial role for disasters response. However, the amount of data transmitted during disasters is an obstacle for filtering informative messages; because the messages are diversity, large and very noise. This large volume of data could be seen as Social Big Data (SBD). In this paper, we proposed a big data platform for collecting and analyzing disasters' data from SBD. Firstly, we designed a collecting module; which could rapidly extract disasters' information from the Twitter; by big data frameworks supporting streaming data on distributed system; such as Kafka and Spark. Secondly, we developed an analyzing module which learned from SBD to distinguish the useful information from the irrelevant one. Finally, we also designed a real-time visualization on the web interface for displaying the results of analysis phase. To show the viability of our platform, we conducted experiments of the collecting and analyzing phases in 10 days for both real-time and historical tweets, which were about disasters happened in South Korea. The results prove that our big data platform could be applied to disaster information based systems, by providing a huge relevant data; which can be used for inferring affected regions and victims in disaster situations, from 21.000 collected tweets.

Keywords