Proceedings of the Korean Operations and Management Science Society Conference (한국경영과학회:학술대회논문집)
- 2005.05a
- /
- Pages.621-628
- /
- 2005
An Optimization Approach to Data Clustering
- Kim, Ju-Mi (Entrue Consulting Partners, LG CNS) ;
- Olafsson, Sigurdur (Entrue Consulting Partners, LG CNS)
- Published : 2005.05.13
Abstract
Scalability of clustering algorithms is critical issues facing the data mining community. This is particularly true for computationally intense tasks such as data clustering. Random sampling of instances is one possible means of achieving scalability but a pervasive problem with this approach is how to deal with the noise that this introduces in the evaluation of the learning algorithm. This paper develops a new optimization based clustering approach using an algorithms specifically designed for noisy performance. Numerical results illustrate that with this algorithm substantial benefits can be achieved in terms of computational time without sacrificing solution quality.
Keywords