• Title/Summary/Keyword: and clustering

Search Result 5,621, Processing Time 0.033 seconds

A Study on Performance Evaluation of Clustering Algorithms using Neural and Statistical Method (클러스터링 성능평가: 신경망 및 통계적 방법)

  • 윤석환;신용백
    • Journal of the Korean Professional Engineers Association
    • /
    • v.29 no.2
    • /
    • pp.71-79
    • /
    • 1996
  • This paper evaluates the clustering performance of a neural network and a statistical method. Algorithms which are used in this paper are the GLVQ(Generalized Loaming vector Quantization) for a neural method and the k -means algorithm for a statistical clustering method. For comparison of two methods, we calculate the Rand's c statistics. As a result, the mean of c value obtained with the GLVQ is higher than that obtained with the k -means algorithm, while standard deviation of c value is lower. Experimental data sets were the Fisher's IRIS data and patterns extracted from handwritten numerals.

  • PDF

Normal Mixture Model with General Linear Regressive Restriction: Applied to Microarray Gene Clustering

  • Kim, Seung-Gu
    • Communications for Statistical Applications and Methods
    • /
    • v.14 no.1
    • /
    • pp.205-213
    • /
    • 2007
  • In this paper, the normal mixture model subjected to general linear restriction for component-means based on linear regression is proposed, and its fitting method by EM algorithm and Lagrange multiplier is provided. This model is applied to gene clustering of microarray expression data, which demonstrates it has very good performances for real data set. This model also allows to obtain the clusters that an analyst wants to find out in the fashion that the hypothesis for component-means is represented by the design matrices and the linear restriction matrices.

Customer Behavior Pattern Discovery by Adaptive Clustering Based on Swarm Intelligence

  • Dai, Weihui
    • Journal of Information Technology Applications and Management
    • /
    • v.17 no.1
    • /
    • pp.127-139
    • /
    • 2010
  • Customer behavior pattern discovery is the fundament for conducting customer oriented services and the services management. But, the composition, need, interest and experience of customers may be continuously changing, thereof lead to the difficulty in refining a stable description of their consistent behavior pattern. This paper presented a new method for the behavior pattern discovery from a changing collection of customers. It was originally inspired from the swarm intelligence of ant colony. By the adaptive clustering, some typical behavior patterns which reflect the characteristics of related customer clusters can extracted dynamically and adaptively.

  • PDF

The Design of GA-based TSK Fuzzy Classifier and Its application (GA기반 TSK 퍼지 분류기의 설계 및 응용)

  • 곽근창;김승석;유정웅;전명근
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2001.12a
    • /
    • pp.233-236
    • /
    • 2001
  • In this paper, we propose a TSK-type fuzzy classifier using PCA(Principal Component Analysis), FCM(Fuzzy C-Means) clustering and hybrid GA(genetic algorithm). First, input data is transformed to reduce correlation among the data components by PCA. FCM clustering is applied to obtain a initial TSK-type fuzzy classifier. Parameter identification is performed by AGA(Adaptive Genetic Algorithm) and RLSE(Recursive Least Square Estimate). we applied the proposed method to Iris data classification problems and obtained a better performance than previous works.

  • PDF

Clustering Observations for Detecting Multiple Outliers in Regression Models

  • Seo, Han-Son;Yoon, Min
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.3
    • /
    • pp.503-512
    • /
    • 2012
  • Detecting outliers in a linear regression model eventually fails when similar observations are classified differently in a sequential process. In such circumstances, identifying clusters and applying certain methods to the clustered data can prevent a failure to detect outliers and is computationally efficient due to the reduction of data. In this paper, we suggest to implement a clustering procedure for this purpose and provide examples that illustrate the suggested procedure applied to the Hadi-Simonoff (1993) method, reverse Hadi-Simonoff method, and Gentleman-Wilk (1975) method.

Analysis of Document Clustering Varing Cluster Centroid Decisions (클러스터 중심 결정 방법에 따른 문서 클러스터링 성능 분석)

  • 오형진;변동률;이신원;박순철;정성종;안동언
    • Proceedings of the IEEK Conference
    • /
    • 2002.06c
    • /
    • pp.99-102
    • /
    • 2002
  • K-means clustering algorithm is a very popular clustering technique, which is used in the field of information retrieval. In this paper, We deal with the problem of K-means Algorithm from the view of creating the centroids and suggest a method reflecting document feature and considering the context of each document to determine the new centroids during the process of forming new centroids. For experiment, We used the automatic document summarizer to summarize the Reuter21578 newslire test dataset and achieved 20% improved results to the recall metrics.

  • PDF

A new identification method of a fuzzy system via double clustering (이중 클러스터링 기법을 이용한 퍼지 시스템의 새로운 동정법)

  • 김은태;이기철;이희진;박민용
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.35C no.7
    • /
    • pp.92-100
    • /
    • 1998
  • In this paper, we suggest a new identification method for sugeno-type fuzzy model via new data clustering strategy. The suggested algorithm is much simpelr than the original identification strategy adopted in. The algorithm suggested in this paper is somewhat similar to that of [2] and [6], that is the algorithm suggested in this paper consists of two steps: coarse tuning and fine tuning. In this paper, double clustering strategy is proposed for coarse tunign. Finally, the resutls of computer simulation are given to demonstrate the validity of this algorithm.

  • PDF

A Study on the Development of Web Based Database System with Clustering for Ship Hull Form and Basic Design (클러스터링을 이용한 웹기반 선형/기본설계 지원 데이터베이스 시스템 구축에 관한 연구)

  • 권영중;이정준
    • Proceedings of the Korea Committee for Ocean Resources and Engineering Conference
    • /
    • 2001.05a
    • /
    • pp.38-42
    • /
    • 2001
  • An engineering database system is presented for ship hull form and hasic design. Web i, nd clustering systems are adopted to develope the system. It is appeared that the system is efficient and useful for integrated ship design.

  • PDF

Multiple Peak Detection Using the Extended Fuzzy Clustering (확장된 퍼지 클러스터링 알고리즘을 이용한 다중 첨두 검출)

  • 김수환;조창호;강경진;이태원
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.29B no.1
    • /
    • pp.102-112
    • /
    • 1992
  • We have already proposed an extended fuzzy clustering algorithm which considers the importance of the data to be classified in a previous paper. In this paper, we suggest the extended fuzzy clustering algorithm based new method to slove a multiple peak detection problem, and prove experimently that this algorithm can detect the multiple peak adaptively to the noise and the shape of peaks.

  • PDF

Comparision of Clustering Methods in 2D Image for the Atomstion of Dangerous Machine (위험기계의 자동화를 위한 2차원 영상의 군집화 기법 비교 연구)

  • 이지용;이병곤
    • Journal of the Korean Society of Safety
    • /
    • v.11 no.1
    • /
    • pp.39-45
    • /
    • 1996
  • In this study, clustering of black BADUK stones' image were performed to recognize the individual stone from its closely spaced and partially occluded Image. And the clustering perfomance was compared between the classical methods and fuzzy C-means method. As a result, 2 BADUK stones' image was segmented precisely in every methods, but more than 3 stones the segmentation was depended on its shape. Fuzzy C-means method could be segmented correctly to 4 stones regardless of its shape, and It could be applied to the unknown number of clusters.

  • PDF