Improved Tweet Bot Detection Using Geo-Location and Device Information

Lee, Al-Chan;Seo, Go-Eun;Shin, Won-Yong;Kim, Donggeon;Cho, Jaehee;

doi:10.6109/jkiice.2015.19.12.2878

Journal of the Korea Institute of Information and Communication Engineering (한국정보통신학회논문지)

Volume 19 Issue 12
/
Pages.2878-2884
/
2015
/
2234-4772(pISSN)
/
2288-4165(eISSN)

The Korea Institute of Information and Commucation Engineering (한국정보통신학회)

DOI QR Code

Improved Tweet Bot Detection Using Geo-Location and Device Information

지리적 공간과 장치 정보를 사용한 개선된 트윗 봇 검출

Lee, Al-Chan (Department of Mobile Systems Engineering, Dankook University) ;
Seo, Go-Eun (Department of Mobile Systems Engineering, Dankook University) ;
Shin, Won-Yong (Department of Computer Science and Engineering, Dankook University) ;
Kim, Donggeon (Department of Statistics and Information Science, Dongduk Women's University) ;
Cho, Jaehee (Department of Management, Kwangwoon University)

Received : 2015.08.10
Accepted : 2015.09.14
Published : 2015.12.31

https://doi.org/10.6109/jkiice.2015.19.12.2878 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Twitter, one of online social network services, is one of the most popular micro-blogs, which generates a large number of automated programs, known as tweet bots because of the open structure of Twitter. While these tweet bots are categorized to legitimate bots and malicious bots, it is important to detect tweet bots since malicious bots spread spam and malicious contents to human users. In the conventional work, temporal information was utilized for the classficiation of human and bot. In this paper, by utilizing geo-tagged tweets that provide high-precision location information of users, we first identify both Twitter users' exact location. Then, we propose a new tweet bot detection algorithm by using both an entropy based on geographic variable of each user and device information of each user. As a main result, the proposed algorithm shows superior bot detection and false alarm probabilities over the conventional result which only uses temporal information.

온라인 소셜 네트워크 서비스 중 하나인 트위터는 가장 보편적으로 사용되는 마이크로 블로그인데, 트위터의 개방적 구조로 인해 자동화 프로그램인 트윗 봇이 많이 생성되고 있다. 이 트윗 봇은 적법한 봇과 악성 봇으로 분류되는데, 이 중 악성 봇은 일반 사용자들에게 많은 양의 스팸 정보나 유해한 컨텐츠를 배포하기 때문에 트윗 봇을 검출하는 작업은 반드시 필요하다. 기존 연구에서는 시간적 정보를 활용하여 사람과 트윗 봇을 분류하였다. 본 논문에서는 먼저 사용자들의 고 정밀 위치 정보를 알려주는 공간 태그된 트윗 정보를 활용하여 트위터 사용자들의 정확한 위치를 알아낸다. 그리고, 각 사용자의 공간 변수에 대한 엔트로피 값 및 사용자의 장치 정보를 사용하여 새로운 봇 검출 알고리즘을 제안한다. 주요 결과로써, 시간 정보만을 이용한 기존 연구결과보다 각 신뢰도별 봇 검출 확률 및 거짓 경보 확률이 모두 우수하게 나타난다.

Keywords

References

C. Wilson, B. Boe, A.Sala, K. P. N. Puttaswamy, and B. Y. Zhao, "User interaction in social networks and their implication," in Proceedings of the 4th ACM European Conference on Computer Systems (EuroSys '09), Nuremberg, Germany, pp. 205-218, Mar./Apr. 2009.
H. Kwak, C. Lee, H. Park, and S. Moon, "What is Twitter, a social network or a news media?," in Proceedings of the 19th International World Wide Web Conference (WWW2010), Raleigh, NC USA, pp. 591-600, Apr. 2010.
M. C. Gonzalez, C. A. Hidalgo, and A. L. Batabasi, "Understanding individual human mobility patterns," Nature, vol. 453, pp. 591-600, Apr. 2010.
D. Wang, D. Pedreschi, C. Song, F. Giannotti, and A.-L. Barabasi, "Human mobility, social ties, and link prediction," in Proceedings of the 17th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining (KDD2011), San Diego, CA USA, pp.1100-1108, Aug. 2011.
B. Hawelka, I. Sitko, E. Beinat, S. Sobolevsky, P. Kazakopoulos, and C. Ratti, "Geo-located Twitter as proxy for global mobility patterns," Cartography and Geographic Information Science, vol. 41, no. 3, pp. 260-271, May 2014. https://doi.org/10.1080/15230406.2014.890072
R. Jurdak, K. Zhao, J. Liu, M. AbouJaoude, M. Cameron, and D. Newth, "Understanding human mobility from Twitter," PLOS ONE, vol. 10, no. 7, pp. 1-16, July 2015.
W.-Y. Shin, B. C. Singh, J. Cho, and A. M. Everett, "A new understanding of friendships in space: Complex networks meet Twitter," Journal of Information Science, vol. 41, no. 6, pp. 751-564, Dec. 2015. https://doi.org/10.1177/0165551515600136
S. Y. Jeon, A. C. Lee, G. E. Seo, and W. Y. Shin, "Relationship between tweet frequency and user velocity on Twitter," Journal of the Korea Institute of Information and Communication Engineering, vol. 19, no. 6, pp. 1380-1386, Jun. 2015. https://doi.org/10.6109/jkiice.2015.19.6.1380
Z. Chu, S. Gianvecchio, H. Wang, and S. Jajodia, "Detecting automation of Twitter accounts: Are you a human, bot, or cyborg?," IEEE Transactions on Dependable and Secure Computing, vol. 9, no.6, pp. 811-824, Dec. 2012. https://doi.org/10.1109/TDSC.2012.75

Cited by

그래프 속성을 이용한 온라인 소셜 네트워크 스팸 탐지 동향 분석 vol.24, pp.5, 2015, https://doi.org/10.6109/jkiice.2020.24.5.567

Journal of the Korea Institute of Information and Communication Engineering (한국정보통신학회논문지)

Improved Tweet Bot Detection Using Geo-Location and Device Information

지리적 공간과 장치 정보를 사용한 개선된 트윗 봇 검출

Abstract

Keywords

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)