Proceedings of the IEEK Conference (대한전자공학회:학술대회논문집)
- 2002.06c
- /
- Pages.99-102
- /
- 2002
Analysis of Document Clustering Varing Cluster Centroid Decisions
클러스터 중심 결정 방법에 따른 문서 클러스터링 성능 분석
Abstract
K-means clustering algorithm is a very popular clustering technique, which is used in the field of information retrieval. In this paper, We deal with the problem of K-means Algorithm from the view of creating the centroids and suggest a method reflecting document feature and considering the context of each document to determine the new centroids during the process of forming new centroids. For experiment, We used the automatic document summarizer to summarize the Reuter21578 newslire test dataset and achieved 20% improved results to the recall metrics.
Keywords