DOI QR코드

DOI QR Code

Using Skip Lists for Managing Replying Comments Posted on Internet Discussion Boards

스킵리스트를 이용한 인터넷 토론 게시판 댓글 관리

  • 이윤정 (부산대학교 U-Port 정보기술 사업단) ;
  • 김은경 (부산대학교 컴퓨터공학과) ;
  • 조환규 (부산대학교 컴퓨터공학과) ;
  • 우균 (부산대학교 컴퓨터공학과)
  • Received : 2010.06.03
  • Accepted : 2010.07.20
  • Published : 2010.08.28

Abstract

In recent years, the number of users who are actively express their opinions about Internet articles is more and more growing up, as the use of cyber community such as weblog or Internet discussion board increases. In fact, it is not difficult to find an article with hundreds of comments in famous Internet discussion boards. Most of the weblogs or Internet discussion boards present comments in the form of list and do not yet support even the basic operation such as searching comments. In this paper, we analysed large sets of comments in Internet discussion board named AGORA. It was found that from the result that the distribution of comment writers follows power-law. So we suppose a new search structure of comments using skip lists. The main idea of our approach is to reflect the probabilistic distribution properties of the commenters following the power-law to the data structure. Our empirical results show that the proposed method performs more efficient in searching the nodes with fewer number of comparison operations than logN, which is the theoretical time complexity of general indexed structure such as B-trees or typical skip lists.

최근 웹 블로그나 인터넷 게시판과 같은 가상 커뮤니티가 활발히 사용됨에 따라 댓글을 통해 자신의 의견을 적극적으로 나타내고자 하는 이용자들이 점점 증가하고 있는 추세다. 실제로 댓글 활동이 활발한 인터넷 토론 게시판에서 수천 개의 댓글이 달린 게시물도 어렵지 않게 찾아볼 수 있다. 대부분의 웹 블로그나 인터넷 게시판에서는 댓글이 작성된 시간에 따라 목록 형태로만 제공되고 있을 뿐 기본적인 검색 기능조차도 지원되지 않고 있다. 본 논문에서는 인터넷 토론 게시판의 댓글 분석을 통해 댓글 작성자의 분포가 거듭제곱 법칙을 따르는 것을 밝혔다. 그리고 이러한 댓글의 통계적 특성을 반영하는 스킵리스트 기반의 댓글 검색 구조를 제안한다. 제안 방법의 주안점 댓글 작성자들의 확률적 특성을 데이터 구조에 반영하는 것이다. 실험을 통해 제안 방법이 B-트리나 일반적인 스킵리스트의 이론적인 계산 복잡도인 logN에 비해 더 빠른 검색을 수행할 수 있음을 보인다.

Keywords

References

  1. David Sifry, “The State of the Live Web,” a web article, April 2007. http://www.sifry.com/alerts/archives/000493.html
  2. M. Gumbrecht, “Blogs as protected space,” Presented at the Workshop on the Weblogging Ecosystem: Aggregation, Analysis, and Dynamics, May 2004.
  3. E. Trevino, “Blogger motivations: Power, pull, and positive feedback,” Presented at Internet Research 6.0, 2005.
  4. J. Sim, H. Cho, H. Yang, I. Ahn, and E. Na, The state of Netizen Internet Utilization on Web 2.0, National Internet Development Agency of Korea, 2006.
  5. G. Mishne and N. Glance, “Leave a reply: An analysis of weblog comments,” Presented at 3rd Annual Workshop on the Weblogging Ecosystem, 2006.
  6. Daum 아고라, http://agora.media.daum.net/
  7. S. Herring, L. Scheidt, S. Bonus, and E. Wright. “Bridging the gap: A genre analysis of weblogs,” In Proc. of the 37th Annual Hawaii International Conference, 2004. https://doi.org/10.1109/HICSS.2004.1265271
  8. J. Leskovec, M. McGlohon, C. Faloutsos, N. Glance, and M. Hurst, “Patterns of Cascading Behavior in Large Blog Graphs,” In Proc. SDM2007, pp.551-556, 2007.
  9. J. Y. Delort, “Identifying commented passages of documents using implicit hyperlinks,” In Proc. of HYPERTEXT'06, pp.89–98, 2006. https://doi.org/10.1145/1149941.1149960
  10. Meishan Hu, Aixin Sun, and E. P. Lim, “Comments-oriented blog summarization by sentence extraction,” In Proc. of CIKM'07, pp.901-904, 2007. https://doi.org/10.1145/1321440.1321571
  11. 이윤정, 지정훈, 우균, 조환규, “인터넷 게시물의 댓글 분석 및 시각화”, 한국콘텐츠학회논문지, 제 9권, 제7호, pp.45-56, 2009.
  12. M. Newman. “Power laws, Pareto distributions and Zipf's law,” Contemporary physics, vol.46, No.5, pp.323-351, 2005. https://doi.org/10.1080/00107510500052444
  13. D. Wang and J. Liu, “Peer-to-Peer asynchronous video streaming using Skip List,” In Proc. of ICME2006, pp.1397-1400, 2006. https://doi.org/10.1109/ICME.2006.262800
  14. T. Ge and S. Zdonik, “A skip-list approach for efficiently processing forecasting queries,” In Proc. VLDB Endow, Vol.1, No.1, pp.984-995, 2008. https://doi.org/10.1145/1453856.1453962
  15. W. Pugh, “Skip Lists: A Probabilistic Alternative to Balanced Trees,” Communications of the ACM, Vol.33, No.6, pp.668-676, ACM, 1990. https://doi.org/10.1145/78973.78977

Cited by

  1. Extracting and Visualizing Dispute comments and Relations on Internet Forum Site vol.12, pp.2, 2012, https://doi.org/10.5392/JKCA.2012.12.02.040