DOI QR코드

DOI QR Code

A Study on K -Means Clustering

  • Bae, Wha-Soo (Department of Data Science, Inje University) ;
  • Roh, Se-Won (Department of Data Science, Inje University)
  • Published : 2005.08.01

Abstract

This paper aims at studying on K-means Clustering focusing on initialization which affect the clustering results in K-means cluster analysis. The four different methods(the MA method, the KA method, the Max-Min method and the Space Partition method) were compared and the clustering result shows that there were some differences among these methods, especially that the MA method sometimes leads to incorrect clustering due to the inappropriate initialization depending on the types of data and the Max-Min method is shown to be more effective than other methods especially when the data size is large.

Keywords

References

  1. Anderberg M.R (1973). Cluster Analysis for Applications. Academic Press, New York
  2. Forgy, E. (1965). Cluster Analysis of Multivaruate Data; Efficiebcy vs. Interpretability of Classification. Biometrics, 21, 768
  3. Hartigan J.A (1974). Clustering Algorithms. John Wiley & Sons, New York
  4. Kaufman L and Rousseeuw P.J(990). Finding Groups in Data. An Introduction to Cluster Analysis. John Wiley & Sons, Canada
  5. Macqueen J.B. (967). Some Methods for Classification and Analysis of Multivariate Observations. Proc. Svmp. Math and Probability, 5th, Berkeley, 1, 281-297, AD 669871. University of California Press, Berkeley, CA
  6. Pena J.M., Lozano J.A. and Larranaga P. (1999). An empirical comparison of four initialization methods for the K -means algorithm. Pattern Recognition Lett, 20 : 1027-1040 https://doi.org/10.1016/S0167-8655(99)00069-0
  7. SAS/STAT User's Guide Version 8(999), SAS Publishing, 1193-1244

Cited by

  1. Development of the KnowledgeMatrix as an Informetric Analysis System vol.8, pp.1, 2008, https://doi.org/10.5392/JKCA.2008.8.1.068
  2. Investigation into factors influencing antioxidant capacity of vinegars vol.59, pp.4, 2016, https://doi.org/10.1007/s13765-016-0185-4