Neighborhood Selection with Intrinsic Partitions

데이터 분포에 기반한 유사 군집 선택법

  • 김계현 (포항공과대학교 컴퓨터공학과) ;
  • 최승진 (포항공과대학교 컴퓨터공학과)
  • Published : 2007.10.26

Abstract

We present a novel method for determining k nearest neighbors, which accurately recognizes the underlying clusters in a data set. To this end, we introduce the "tiling neighborhood" which is constructed by tiling a number of small local circles rather than a single circle, as existing neighborhood schemes do. Then we formulate the problem of determining the tiling neighborhood as a minimax optimization, leading to an efficient message passing algorithm. For several real data sets, our method outperformed the k-nearest neighbor method. The results suggest that our method can be an alternative to existing for general classification tasks, especially for data sets which have many missing values.

Keywords