• Title/Summary/Keyword: K means clustering

Search Result 1,118, Processing Time 0.04 seconds

The Efficient Feature Extraction of Handwritten Numerals in GLVQ Clustering Network (GLVQ클러스터링을 위한 필기체 숫자의 효율적인 특징 추출 방법)

  • Jeon, Jong-Won;Min, Jun-Yeong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.2 no.6
    • /
    • pp.995-1001
    • /
    • 1995
  • The structure of a typical pattern recognition consists a pre-processing, a feature extraction(algorithm) and classification or recognition. In classification, when widely varying patterns exist in same category, we need the clustering which organize the similar patterns. Clustering algorithm is two approaches. Firs, statistical approaches which are k-means, ISODATA algorithm. Second, neural network approach which is T. Kohonen's LVQ(Learning Vector Quantization). Nikhil R. Palet al proposed the GLVQ(Generalized LVQ, 1993). This paper suggest the efficient feature extraction methods of handwritten numerals in GLVQ clustering network. We use the handwritten numeral data from 21's authors(ie, 200 patterns) and compare the proportion of misclassified patterns for each feature extraction methods. As results, when we use the projection combination method, the classification ratio is 98.5%.

  • PDF

Dynamic Clustering based Optimization Technique and Quality Assessment Model of Mobile Cloud Computing (동적 클러스터링 기반 모바일 클라우드 컴퓨팅의 최적화 기법 및 품질 평가 모델)

  • Kim, Dae Young;La, Hyun Jung;Kim, Soo Dong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.6
    • /
    • pp.383-394
    • /
    • 2013
  • As a way of augmenting constrained resources of mobile devices such as CPU and memory, many works on mobile cloud computing (MCC), where mobile devices utilize remote resources of cloud services or PCs, have been proposed. Typically, in MCC, many nodes with different operating systems and platform and diverse mobile applications or services are located, and a central manager autonomously performs several management tasks to maintain a consistent level of MCC overall quality. However, as there are a larger number of nodes, mobile applications, and services subscribed by the mobile applications and their interactions are extremely increased, a traditional management method of MCC reveals a fundamental problem of degrading its overall performance due to overloaded management tasks to the central manager, i.e. a bottle neck phenomenon. Therefore, in this paper, we propose a clustering-based optimization method to solve performance-related problems on large-scaled MCC and to stabilize its overall quality. With our proposed method, we can ensure to minimize the management overloads and stabilize the quality of MCC in an active and autonomous way.

Detection of an Invariant Direction using K-means Clustering (K-means 클러스터링을 이용한 불변 방향 검출)

  • Kim, Dal-Hyoun;Lee, Woo-Ram;Jun, Byoung-Min
    • Proceedings of the KAIS Fall Conference
    • /
    • 2011.05a
    • /
    • pp.389-392
    • /
    • 2011
  • 본 논문에서는 영상의 색 항등성을 달성하기 위해 본질 영상의 핵심인 불변 방향을 K-means 클러스터링을 이용해 검출하는 개선된 알고리즘을 제안한다. 우선, RGB 영상을 K-means 클러스터링 기법에 의해 다수의 클러스터로 분할한다. 이 때, 클러스터 간의 거리 측정은 유클리드 거리이다. 그리고 분할된 클러스터 중 가장 많은 색을 가진 클러스터만을 x-색도 공간으로 도시하여 해당되는 후보 불변 방향을 계산한다. 검출된 후보 불변 방향은 방향별로 프로젝션된 히스토그램에서 3개 이상의 프로젝션된 데이터를 가진 bin들의 개수가 가장 적은 방향이다. 그 후, 분할된 다른 여러 클러스터에 해당되는 후 보 불변 방향을 계산하여 가장 많은 빈도로 나타나는 방향을 영상의 최종 불변 방향으로 결정한다. 실험에서 Ebner에 의해 제안된 데이터집합을 실험 영상으로 사용하였고, 색항등성 측도를 평가 척도로 사용하였다. 실험 결과, 제안한 기법은 형광성 표면을 가진 형광 데이터집합에 보다 적합하였으며, 엔트로피 기법보다 색항등성이 1.5배 이상 높았다.

  • PDF

Efficient Data Clustering using Fast Choice for Number of Clusters (빠른 클러스터 개수 선정을 통한 효율적인 데이터 클러스터링 방법)

  • Kim, Sung-Soo;Kang, Bum-Su
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.41 no.2
    • /
    • pp.1-8
    • /
    • 2018
  • K-means algorithm is one of the most popular and widely used clustering method because it is easy to implement and very efficient. However, this method has the limitation to be used with fixed number of clusters because of only considering the intra-cluster distance to evaluate the data clustering solutions. Silhouette is useful and stable valid index to decide the data clustering solution with number of clusters to consider the intra and inter cluster distance for unsupervised data. However, this valid index has high computational burden because of considering quality measure for each data object. The objective of this paper is to propose the fast and simple speed-up method to overcome this limitation to use silhouette for the effective large-scale data clustering. In the first step, the proposed method calculates and saves the distance for each data once. In the second step, this distance matrix is used to calculate the relative distance rate ($V_j$) of each data j and this rate is used to choose the suitable number of clusters without much computation time. In the third step, the proposed efficient heuristic algorithm (Group search optimization, GSO, in this paper) can search the global optimum with saving computational capacity with good initial solutions using $V_j$ probabilistically for the data clustering. The performance of our proposed method is validated to save significantly computation time against the original silhouette only using Ruspini, Iris, Wine and Breast cancer in UCI machine learning repository datasets by experiment and analysis. Especially, the performance of our proposed method is much better than previous method for the larger size of data.

Color vision test using k-Means clustering (k-Means 클러스터링을 활용한 색각 검사 방안)

  • Lee, Hye-Jin;Park, Young-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.05a
    • /
    • pp.360-362
    • /
    • 2019
  • 본 논문에서는 k-Means 클러스터링을 활용한 컬러 기반 이미지 추출을 통한 색각 검사 방안 연구를 진행한다. 이를 위해, RGB 컬러스페이스 기반의 이미지를 특별한 컬러스페이스 이미지로 변환 후 컬러 패턴 분포에 따라 k-Means 클러스터링을 적용하여 다양한 형태의 이미지를 추출하는 실험을 수행한다. 위의 실험을 통해 하나의 이미지를 컬러 분포 패턴을 통해 클러스터링하여 이미지를 추출을 통하여 정상인과 색각 이상자를 판별할 수 있었다. 실험 결과, 다양한 형태와 색을 가진 이미지를 추출하여 정상인이 보는 이미지와 색각 이상자가 보는 이미지가 다른 것을 확인하였다.

New Optimization Algorithm for Data Clustering (최적화에 기반 한 데이터 클러스터링 알고리즘)

  • Kim, Ju-Mi
    • Journal of Intelligence and Information Systems
    • /
    • v.13 no.3
    • /
    • pp.31-45
    • /
    • 2007
  • Large data handling is one of critical issues that the data mining community faces. This is particularly true for computationally intense tasks such as data clustering. Random sampling of instances is one possible means of achieving large data handling, but a pervasive problem with this approach is how to deal with the noise in the evaluation of the learning algorithm. This paper develops a new optimization based clustering approach using an algorithm specifically designed for noisy performance. Numerical results show this algorithm better than the other algorithms such as PAM and CLARA. Also with this algorithm substantial benefits can be achieved in terms of computational time without sacrificing solution quality using partial data.

  • PDF

A Study on Three Phase Partial Discharge Pattern Classification with the Aid of Optimized Polynomial Radial Basis Function Neural Networks (최적화된 pRBF 뉴럴 네트워크에 이용한 삼상 부분방전 패턴분류에 관한 연구)

  • Oh, Sung-Kwun;Kim, Hyun-Ki;Kim, Jung-Tae
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.62 no.4
    • /
    • pp.544-553
    • /
    • 2013
  • In this paper, we propose the pattern classifier of Radial Basis Function Neural Networks(RBFNNs) for diagnosis of 3-phase partial discharge. Conventional methods map the partial discharge/noise data on 3-PARD map, and decide whether the partial discharge occurs or not from 3-phase or neutral point. However, it is decided based on his own subjective knowledge of skilled experter. In order to solve these problems, the mapping of data as well as the classification of phases are considered by using the general 3-PARD map and PA method, and the identification of phases occurring partial discharge/noise discharge is done. In the sequel, the type of partial discharge occurring on arbitrary random phase is classified and identified by fuzzy clustering-based polynomial Radial Basis Function Neural Networks(RBFNN) classifier. And by identifying the learning rate, momentum coefficient, and fuzzification coefficient of FCM fuzzy clustering with the aid of PSO algorithm, the RBFNN classifier is optimized. The virtual simulated data and the experimental data acquired from practical field are used for performance estimation of 3-phase partial discharge pattern classifier.

A Fault Detection System for Wind Power Generator Based on Intelligent Clustering Method (지능형 클러스터링 기법에 기반한 풍력발전 고장 검출 시스템)

  • Moon, Dae-Sun;Kim, Seon-Kook;Kim, Sung-Ho
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.19 no.1
    • /
    • pp.27-33
    • /
    • 2013
  • Nowadays, the utilization of renewable energy sources like wind energy is considered one of the most effective means of generating massive amounts of electricity. This is evident in the rapid increase of wind farms all over the world which comprise a huge number of wind turbines. However, the drawback of utilizing wind turbines is that it requires maintenance, which could be a costly operation. To keep the wind turbines in pristine condition so as to reduce downtime, the implementation of CMS (Condition Monitoring System) and FDS (Fault Detection System) is mandatory. The efficiency and accuracy of these systems are crucial in deciding when to carry out a maintenance process. In this paper, a fault detection system based on intelligent clustering method is proposed. Using SCADA data, the clustering model was trained and evaluated for its accuracy through rigorous simulations. Results show that the proposed approach is able to accurately detect the deteriorating condition of a wind turbine as it nears a downtime period.

Recognition and Tracking of Moving Objects Using Label-merge Method Based on Fuzzy Clustering Algorithm (퍼지 클러스터링 알고리즘 기반의 라벨 병합을 이용한 이동물체 인식 및 추적)

  • Lee, Seong Min;Seong, Il;Joo, Young Hoon
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.67 no.2
    • /
    • pp.293-300
    • /
    • 2018
  • We propose a moving object extraction and tracking method for improvement of animal identification and tracking technology. First, we propose a method of merging separated moving objects into a moving object by using FCM (Fuzzy C-Means) clustering algorithm to solve the problem of moving object loss caused by moving object extraction process. In addition, we propose a method of extracting data from a moving object and a method of counting moving objects to determine the number of clusters in order to satisfy the conditions for performing FCM clustering algorithm. Then, we propose a method to continuously track merged moving objects. In the proposed method, color histograms are extracted from feature information of each moving object, and the histograms are continuously accumulated so as not to react sensitively to noise or changes, and the average is obtained and stored. Thereafter, when a plurality of moving objects are overlapped and separated, the stored color histogram is compared with each other to correctly recognize each moving object. Finally, we demonstrate the feasibility and applicability of the proposed algorithms through some experiments.

An Improved Clustering Method with Cluster Density Independence (클러스터 밀도에 무관한 향상된 클러스터링 기법)

  • Yoo, Byeong-Hyeon;Kim, Wan-Woo;Heo, Gyeongyong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.10a
    • /
    • pp.248-249
    • /
    • 2015
  • Clustering is one of the most important unsupervised learning methods that clusters data into homogeneous groups. However, cluster centers tend leaning to high density clusters because clustering is based on the distances between data points and cluster centers. In this paper, a modified clustering method forcing cluster centers to be apart by introducing a center-scattering term in the Fuzzy C-Means objective function is introduced. The proposed method converges more to real centers with small number of iterations compared to the original one. All the strengths can be verified with experimental results.

  • PDF