Projection Pursuit K-Means Visual Clustering

  • Published : 2002.12.01

Abstract

K-means clustering is a well-known partitioning method of multivariate observations. Recently, the method is implemented broadly in data mining softwares due to its computational efficiency in handling large data sets. However, it does not yield a suitable visual display of multivariate observations that is important especially in exploratory stage of data analysis. The aim of this study is to develop a K-means clustering method that enables visual display of multivariate observations in a low-dimensional space, for which the projection pursuit method is adopted. We propose a computationally inexpensive and reliable algorithm and provide two numerical examples.

Keywords

References

  1. Biometrika v.68 On two families of transformations to additivity for binary response data Aranda-Ordaz, F. J.
  2. The Korean Journal of Applied Statistics v.5 A simulation study on projection pursuit discriminant analysis Ahn, K. A.;Rhee, S. S.
  3. The Korean Journal of Applied Statistics v.13 Kernal pattern recognition using K-means clustering method Baek, J. S.;Sim, J. W.
  4. Dynamic Graphics for Statistics Dynamic graphics for data analysis Becker, R. A.;Cleveland, W. S.;Wilks, A. R.;W. S. Cleveland(ed.);M. E. McGill(ed.)
  5. Australian Journal of Zoology v.22 A multivariate study of variation in two species of rock crab of genus Leptograpsus Campbell, M. A.;Mahon, R. J.
  6. Journal of Computational and Graphical Statistics v.6 Manual controls for high-dimensional data projections Cook, D.;Buja, A.
  7. Journal of Computational and Graphical Statistics v.4 Grand tour and projection pursuit Cook, D.;Buja, A.;Cabrera, J.;Hurley, C.
  8. Cluster Analysis Everitt, B. S.
  9. IEEE Transactions for Computers v.23 A projection pursuit algorithm for exploratory data analysis Friedman, J. H.;Tukey, J. W.
  10. Clustering Algorithms Hartigan, J. A.
  11. The Korean Journal of Applied Statistics v.13 Double K-means clustering Huh, M. H.
  12. Recent Advances in Statistical Research and Data Analysis Setting the number of clusters in K-means clustering: Exploratory approach Huh, M. H.;Y. Baba(ed.)
  13. Journal of Data Science and Classification v.4 Low-dimensional K-means clustering 1: Iterative canonical transforms method Huh, M. H.;Kim, M. K.
  14. The Korean Communications in Statistics v.8 Variable arrangement for data visualization Huh, M. Y.;Song, K. R.
  15. Journal of the Korean Statistical Society v.29 On a modified k-spatial medians clustering Jhun, M. S.;Jin, S. H.
  16. Ph.D. Dissertation, Korea University Low-dimensional K-means Clustering Kim, M. K.
  17. The Journal of Data Science and Classification v.3 Interactive visualization of K-means and hierachical clusters Kim, S. S.
  18. Metrika v.51 Interactive visualization of hierarchical clusters using MDS and MST Kim, S. S.;Kwon, S.;Cook, D.
  19. The Korean Journal of Applied Statistics v.8 A clustering method using the Coulomb energy network Lee, ,S. H.;Park, N. H.;Kim, Y. H.
  20. Psychomertika v.45 An examination fo the effect of six types of error perturbation of fifteen clustering algorithms Milligan, G. W.
  21. Multivariate Behavioral Research v.16 A review of Monte Carlo tests of cluster analysis Milligan, G. W.
  22. The Korean Communications in Statistics v.7 Prediction and classification using projection pursuit regression with automatic order selection Park, H. J.;Choi, D. W.;Koo, J. Y.
  23. Communications in Statistics-Simulation and Computation v.19 An effective two-dimensional projection pursuit algorthm Posse, C.
  24. Journal of Computational and Graphical Statics v.4 Tools for two-dimensional exploratory prohection pursuit Posse, C.
  25. Pattern Recognition and Neural Networks Ripley, R.D.
  26. Sankhya v.A57 Asymptoties of K-means clustering based on projection pursuit Stute, W.;Zhu, L. X.
  27. Dynamic Graphics for Statistics Dynamic graphics for data analysis Young, F. W.;Kent, D. P.;Kuhfeld, W. F.;W. S. Cleveland(ed.);M. E. McGill(ed.)