DOI QR코드

DOI QR Code

A Malicious Traffic Detection Method Using X-means Clustering

X-means 클러스터링을 이용한 악성 트래픽 탐지 방법

  • 한명지 (서울대학교 컴퓨터공학부) ;
  • 임지혁 (서울대학교 컴퓨터공학부) ;
  • 최준용 (서울대학교 컴퓨터공학부) ;
  • 김현준 (서울대학교 컴퓨터공학부) ;
  • 서정주 (서울대학교 컴퓨터공학부) ;
  • 유철 (서울대학교 컴퓨터공학부) ;
  • 김성렬 (건국대학교 인터넷미디어공학부) ;
  • 박근수 (서울대학교 컴퓨터공학부)
  • Received : 2014.02.28
  • Accepted : 2014.05.30
  • Published : 2014.09.15

Abstract

Malicious traffic, such as DDoS attack and botnet communications, refers to traffic that is generated for the purpose of disturbing internet networks or harming certain networks, servers, or hosts. As malicious traffic has been constantly evolving in terms of both quality and quantity, there have been many researches fighting against it. In this paper, we propose an effective malicious traffic detection method that exploits the X-means clustering algorithm. We also suggest how to analyze statistical characteristics of malicious traffic and to define metrics that are used when clustering. Finally, we verify effectiveness of our method by experiments with two released traffic data.

악성 트래픽은 디도스 공격, 봇넷 통신 등의 인터넷 망을 교란시키거나 특정 네트워크, 서버, 혹은 호스트에 피해를 끼칠 의도를 가지고 발생시키는 트래픽을 지칭한다. 이와 같은 악성 트래픽은 인터넷이 발생한 이래 꾸준히 양과 질에서 진화하고 있고 이에 대한 대응 연구도 계속되고 있다. 이 논문에서는 악성 트래픽을 기존 X-means 클러스터링 알고리즘을 적용하여 효과적으로 탐지하는 방법을 제시하였다. 특히 악성 트래픽의 통계적 특징을 분석하고 클러스터링을 위한 메트릭을 정의하는 방법을 체계적으로 제시하였다. 또한 두 개의 공개된 트래픽 데이터에 대한 실험을 통해 실효성을 검증하였다.

Keywords

Acknowledgement

Supported by : 한국연구재단

References

  1. Y. Xie, S. Z. Yu, "Monitoring the application-layer DDoS attacks for popular websites," IEEE/ACM Transactions on Networking, Vol. 17, No. 1, pp. 15-25, 2009. https://doi.org/10.1109/TNET.2008.925628
  2. G. Gu, R. Perdisci, J. Zhang, W. Lee, "BotMiner: Clustering Analysis of Network Traffic for Protocoland Structure-Independent Botnet Detection," SS'08 Proceedings of the 17th conference on Security symposium, pp. 139-154, 2008.
  3. J. Zhang, R. Perdisci, W. Lee, U. Sarfraz, X. Luo, "Detecting stealthy P2P botnets using statistical traffic fingerprints," Dependable Systems & Networks (DSN), 2011 IEEE/IFIP 41st International Conference on, pp. 121-132, 2011.
  4. H. R. Zeidanloo, M. J. Z. Shooshtari, P. V. Amoli, M. Safari, M. Zamani, "A taxonomy of Botnet detection techniques," Computer Science and Information Technology (ICCSIT), 2010 3rd IEEE International Conference on, Vol. 2, pp. 158-162, 2010.
  5. T. Thapngam, S. Yu, W. Zhou, G. Beliakov, "Discriminating DDoS attack traffic from flash crowd through packet arrival patterns," Computer Communications Workshops (INFOCOM WKSHPS), 2011 IEEE Conference on, pp. 952-957, 2011.
  6. J. Mirkovic, P. Reiher, "A taxonomy of DDoS attack and DDoS defense mechanisms," ACM SIGCOMM Computer Communication Review, Vol. 34, No. 2, pp. 39-53, 2004.
  7. K. Lee, J. Kim, K. H. Kwon, Y. Han, S. Kim, "DDoS attack detection method using cluster analysis," Expert Systems with Applications, Vol. 34, No. 3, pp. 1659-1665, 2008. https://doi.org/10.1016/j.eswa.2007.01.040
  8. S. Saad, I. Traore, A. Ghorbani, B. Sayed, D. Zhao, L. Wei, J. Felix, P. Hakimian, "Detecting P2P botnets through network behavior analysis and machine learning," Privacy, Security and Trust (PST), 2011 Ninth Annual International Conference on, pp.174-180, 2011.
  9. H. Choi, H. Lee, "Identifying botnets by capturing group activities in DNS traffic," Computer Networks, Vol. 56, No. 1, pp. 20-33, 2012. https://doi.org/10.1016/j.comnet.2011.07.018
  10. S. Stover, D. Dittrich, J. Hernandez, S. Dietrich, "Analysis of the Storm and Nugache trojans: P2P is here," ;login:, Vol. 32, No. 6, pp. 18-27, 2007.
  11. G. Sinclair, C. Nunnery, B. H. Kang, "The waledac protocol: The how and why," Malicious and Unwanted Software (MALWARE), 2009 4th International Conference on, pp. 69-77, 2009.
  12. J. A. Hartigan, M. A. Wong, "A K-Means Clustering Algorithm," Journal of the Royal Statistical Society, Series C (Applied Statistics), Vol. 28, pp. 100-108, 1979.
  13. S. C. Johnson, "Hierarchical clustering schemes," Psychometrika, Vol. 32, No. 3, pp. 241-254, 1967. https://doi.org/10.1007/BF02289588
  14. G. W. Milligan, M. C. Cooper, "An examination of procedures for determining the number of clusters in a data set," Psychometrika, Vol. 50, No. 2, pp. 159-179, 1985. https://doi.org/10.1007/BF02294245
  15. D. Pelleg, A. Moore, "X-means: Extending K-means with Efficient Estimation of the Number of Clusters," In Proceedings of the 17th International Conf. on Machine Learning, pp. 727-734, 2000.
  16. DARPA Dataset [Online]. Available: http://www.ll.mit.edu/mission/communications/cyber/CSTcorpora/ideval/data/2000data.html
  17. ISOT Dataset [Online]. Available: http://www.uvic.ca/engineering/ece/isot/datasets/index.php
  18. T. Karagiannis, A. Broido, M. Faloutsos, K. claffy, "Transport layer identification of P2P traffic," Proceedings of the 4th ACM SIGCOMM conference on Internet measurement, pp. 121-134, 2004.