• Title/Summary/Keyword: K-MEANS

Search Result 17,920, Processing Time 0.051 seconds

Linear Discriminant Clustering in Pattern Recognition

  • Sun, Zhaojia;Choi, Mi-Seon;Kim, Young-Kuk
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.717-718
    • /
    • 2008
  • Fisher Linear Discriminant(FLD) is a sample and intuitive linear feature extraction method in pattern recognition. But in some special cases, such as un-separable case, one class data dispersed into several clustering case, FLD doesn't work well. In this paper, a new discriminant named K-means Fisher Linear Discriminant, which combines FLD with K-means clustering is proposed. It could deal with this case efficiently, not only possess FLD's global-view merit, but also K-means' local-view property. Finally, the simulation results also demonstrate its advantage against K-means and FLD individually.

  • PDF

Detection of an Invariant Direction using K-means Clustering (K-means 클러스터링을 이용한 불변 방향 검출)

  • Kim, Dal-Hyoun;Lee, Woo-Ram;Jun, Byoung-Min
    • Proceedings of the KAIS Fall Conference
    • /
    • 2011.05a
    • /
    • pp.389-392
    • /
    • 2011
  • 본 논문에서는 영상의 색 항등성을 달성하기 위해 본질 영상의 핵심인 불변 방향을 K-means 클러스터링을 이용해 검출하는 개선된 알고리즘을 제안한다. 우선, RGB 영상을 K-means 클러스터링 기법에 의해 다수의 클러스터로 분할한다. 이 때, 클러스터 간의 거리 측정은 유클리드 거리이다. 그리고 분할된 클러스터 중 가장 많은 색을 가진 클러스터만을 x-색도 공간으로 도시하여 해당되는 후보 불변 방향을 계산한다. 검출된 후보 불변 방향은 방향별로 프로젝션된 히스토그램에서 3개 이상의 프로젝션된 데이터를 가진 bin들의 개수가 가장 적은 방향이다. 그 후, 분할된 다른 여러 클러스터에 해당되는 후 보 불변 방향을 계산하여 가장 많은 빈도로 나타나는 방향을 영상의 최종 불변 방향으로 결정한다. 실험에서 Ebner에 의해 제안된 데이터집합을 실험 영상으로 사용하였고, 색항등성 측도를 평가 척도로 사용하였다. 실험 결과, 제안한 기법은 형광성 표면을 가진 형광 데이터집합에 보다 적합하였으며, 엔트로피 기법보다 색항등성이 1.5배 이상 높았다.

  • PDF

Revising K-Means Clustering under Semi-Supervision

  • Huh Myung-Hoe;Yi SeongKeun;Lee Yonggoo
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.2
    • /
    • pp.531-538
    • /
    • 2005
  • In k-means clustering, we standardize variables before clustering and iterate two steps: units allocation by Euclidean sense and centroids updating. In applications to DB marketing where clusters are to be used as customer segments with similar consumption behaviors, we frequently acquire additional variables on the customers or the units through marketing campaigns a posteriori. Hence we need to modify the clusters originally formed after each campaign. The aim of this study is to propose a revision method of k-means clusters, incorporating added information by weighting clustering variables. We illustrate the proposed method in an empirical case.

Clustering-based Monitoring and Fault detection in Hot Strip Roughing Mill (군집기반 열간조압연설비 상태모니터링과 진단)

  • SEO, MYUNG-KYO;YUN, WON YOUNG
    • Journal of Korean Society for Quality Management
    • /
    • v.45 no.1
    • /
    • pp.25-38
    • /
    • 2017
  • Purpose: Hot strip rolling mill consists of a lot of mechanical and electrical units. In condition monitoring and diagnosis phase, various units could be failed with unknown reasons. In this study, we propose an effective method to detect early the units with abnormal status to minimize system downtime. Methods: The early warning problem with various units is defined. K-means and PAM algorithm with Euclidean and Manhattan distances were performed to detect the abnormal status. In addition, an performance of the proposed algorithm is investigated by field data analysis. Results: PAM with Manhattan distance(PAM_ManD) showed better results than K-means algorithm with Euclidean distance(K-means_ED). In addition, we could know from multivariate field data analysis that the system reliability of hot strip rolling mill can be increased by detecting early abnormal status. Conclusion: In this paper, clustering-based monitoring and fault detection algorithm using Manhattan distance is proposed. Experiments are performed to study the benefit of the PAM with Manhattan distance against the K-means with Euclidean distance.

K-Means-Based Polynomial-Radial Basis Function Neural Network Using Space Search Algorithm: Design and Comparative Studies (공간 탐색 최적화 알고리즘을 이용한 K-Means 클러스터링 기반 다항식 방사형 기저 함수 신경회로망: 설계 및 비교 해석)

  • Kim, Wook-Dong;Oh, Sung-Kwun
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.17 no.8
    • /
    • pp.731-738
    • /
    • 2011
  • In this paper, we introduce an advanced architecture of K-Means clustering-based polynomial Radial Basis Function Neural Networks (p-RBFNNs) designed with the aid of SSOA (Space Search Optimization Algorithm) and develop a comprehensive design methodology supporting their construction. In order to design the optimized p-RBFNNs, a center value of each receptive field is determined by running the K-Means clustering algorithm and then the center value and the width of the corresponding receptive field are optimized through SSOA. The connections (weights) of the proposed p-RBFNNs are of functional character and are realized by considering three types of polynomials. In addition, a WLSE (Weighted Least Square Estimation) is used to estimate the coefficients of polynomials (serving as functional connections of the network) of each node from output node. Therefore, a local learning capability and an interpretability of the proposed model are improved. The proposed model is illustrated with the use of nonlinear function, NOx called Machine Learning dataset. A comparative analysis reveals that the proposed model exhibits higher accuracy and superb predictive capability in comparison to some previous models available in the literature.

A Study On Predicting Stock Prices Of Hallyu Content Companies Using Two-Stage k-Means Clustering (2단계 k-평균 군집화를 활용한 한류컨텐츠 기업 주가 예측 연구)

  • Kim, Jeong-Woo
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.7
    • /
    • pp.169-179
    • /
    • 2021
  • This study shows that the two-stage k-means clustering method can improve prediction performance by predicting the stock price, To this end, this study introduces the two-stage k-means clustering algorithm and tests the prediction performance through comparison with various machine learning techniques. It selects the cluster close to the prediction target obtained from the k-means clustering, and reapplies the k-means clustering method to the cluster to search for a cluster closer to the actual value. As a result, the predicted value of this method is shown to be closer to the actual stock price than the predicted values of other machine learning techniques. Furthermore, it shows a relatively stable predicted value despite the use of a relatively small cluster. Accordingly, this method can simultaneously improve the accuracy and stability of prediction, and it can be considered as the new clustering method useful for small data. In the future, developing the two-stage k-means clustering is required for the large-scale data application.

Real-Time Traffic Sign Detection Using K-means Clustering and Neural Network (K-means Clustering 기법과 신경망을 이용한 실시간 교통 표지판의 위치 인식)

  • Park, Jung-Guk;Kim, Kyung-Joong
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.491-493
    • /
    • 2011
  • Traffic sign detection is the domain of automatic driver assistant systems. There are literatures for traffic sign detection using color information, however, color-based method contains ill-posed condition and to extract the region of interest is difficult. In our work, we propose a method for traffic sign detection using k-means clustering method, back-propagation neural network, and projection histogram features that yields the robustness for ill-posed condition. Using the color information of traffic signs enables k-means algorithm to cluster the region of interest for the detection efficiently. In each step of clustering, a cluster is verified by the neural network so that the cluster exactly represents the location of a traffic sign. Proposed method is practical, and yields robustness for the unexpected region of interest or for multiple detections.

K-means Algorithm in outside weight region of convergence for initial iteration learning (초기 반복학습 시 수렴영역을 벗어난 가중치에 의한 K-means 알고리즘)

  • Park SoHee;Cho CheHwang
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.143-146
    • /
    • 2001
  • 본 논문에서는 랜덤초기화 방법을 사용하여 초기 코드북을 생성하고, 이를 이용하여 초기 반복학습 시 수렴영역을 벗어난 2 이상의 가중치에 의한 K-means 알고리즘을 제안한다. 기존의 K-means 알고리즘이 국부적으로 최적화되고 초기 반복학습 시에 가중치의 영향이 크다는 점을 이용하여, 제안된 방법에서는 초기 반복학습 시의 가중치를 수렴영역에서 벗어난 큰 값으로 주고 이후 반복학습시의 가증치는 수렴영역 안에 있는 값으로 고정하여 코드북을 설계한다. 또한 초기 코드북을 얻기 위해 Splitting 방법과 같은 추가적인 과정 없이 랜덤한 방법에 의한 초기 코드북을 적용함으로써 제안된 알고리즘이 단순한 구조를 가지며, 구해진 코드북의 성능도 우수함을 확인할 수 있었다.

  • PDF

A Study on K -Means Clustering

  • Bae, Wha-Soo;Roh, Se-Won
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.2
    • /
    • pp.497-508
    • /
    • 2005
  • This paper aims at studying on K-means Clustering focusing on initialization which affect the clustering results in K-means cluster analysis. The four different methods(the MA method, the KA method, the Max-Min method and the Space Partition method) were compared and the clustering result shows that there were some differences among these methods, especially that the MA method sometimes leads to incorrect clustering due to the inappropriate initialization depending on the types of data and the Max-Min method is shown to be more effective than other methods especially when the data size is large.

Color vision test using k-Means clustering (k-Means 클러스터링을 활용한 색각 검사 방안)

  • Lee, Hye-Jin;Park, Young-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.05a
    • /
    • pp.360-362
    • /
    • 2019
  • 본 논문에서는 k-Means 클러스터링을 활용한 컬러 기반 이미지 추출을 통한 색각 검사 방안 연구를 진행한다. 이를 위해, RGB 컬러스페이스 기반의 이미지를 특별한 컬러스페이스 이미지로 변환 후 컬러 패턴 분포에 따라 k-Means 클러스터링을 적용하여 다양한 형태의 이미지를 추출하는 실험을 수행한다. 위의 실험을 통해 하나의 이미지를 컬러 분포 패턴을 통해 클러스터링하여 이미지를 추출을 통하여 정상인과 색각 이상자를 판별할 수 있었다. 실험 결과, 다양한 형태와 색을 가진 이미지를 추출하여 정상인이 보는 이미지와 색각 이상자가 보는 이미지가 다른 것을 확인하였다.