• Title/Summary/Keyword: Cluster index

Search Result 539, Processing Time 0.022 seconds

A Cluster validity Index for Fuzzy Clustering

  • Lee, Haiyoung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.9 no.6
    • /
    • pp.621-626
    • /
    • 1999
  • In this paper a new cluster validation index which is heuristic but able to eliminate the monotonically decreasing tendency occurring in which the number of cluster c gets very large and close to the number of data points n is proposed. We review the FCM algorithm and some conventional cluster validity criteria discuss on the limiting behavior of the proposed validity index and provide some numerical examples showing the effectiveness of the proposed cluster validity index.

  • PDF

Comparison of the Cluster Validation Methods for High-dimensional (Gene Expression) Data (고차원 (유전자 발현) 자료에 대한 군집 타당성분석 기법의 성능 비교)

  • Jeong, Yun-Kyoung;Baek, Jang-Sun
    • The Korean Journal of Applied Statistics
    • /
    • v.20 no.1
    • /
    • pp.167-181
    • /
    • 2007
  • Many clustering algorithms and cluster validation techniques for high-dimensional gene expression data have been suggested. The evaluations of these cluster validation techniques have, however, seldom been implemented. In this paper we compared various cluster validity indices for low-dimensional simulation data and real gene expression data, and found that Dunn's index is the most effective and robust, Silhouette index is next and Davies-Bouldin index is the bottom among the internal measures. Jaccard index is much more effective than Goodman-Kruskal index and adjusted Rand index among the external measures.

PROPERTIES AND SPECTRAL BEHAVIOUR OF CLUSTER RADIO HALOS

  • FERETTI L.;BRUNETTI G.;GIOVANNINI G.;KASSIM N.;ORRU E.;SETTI G.
    • Journal of The Korean Astronomical Society
    • /
    • v.37 no.5
    • /
    • pp.315-322
    • /
    • 2004
  • Several arguments have been presented in the literature to support the connection between radio halos and cluster mergers. The spectral index distributions of the halos in A665 and A2163 provide a new strong confirmation of this connection, i.e. of the fact that the cluster merger plays an important role in the energy supply to the radio halos. Features of the spectral index (flattening and patches) are indication of a complex shape of the radiating electron spectrum, and are therefore in support of electron reacceleration models. Regions of flatter spectrum are found to be related to the recent merger. In the undisturbed cluster regions, instead, the spectrum steepens with the distance from the cluster center. The plot of the integrated spectral index of a sample of halos versus the cluster temperature indicates that clusters at higher temperature tend to host halos with flatter spectra. This correlation provides further evidence of the connection between radio emission and cluster mergers.

The Classification of Forest Communities by Cluster Analysis in Mt. Seokbyung Experimental Forest of Gangwon-Do

  • Chung, Sang-Hoon;Kim, Ji-Hong
    • Journal of Korean Society of Forest Science
    • /
    • v.99 no.5
    • /
    • pp.736-743
    • /
    • 2010
  • This study examined the ecological attributes of classified forest community by cluster analysis in the mixed forest of Mt. Seokbyung Experimental Forest of Gangwon-Do. The vegetation data were collected in randomly established 51 sample plots (2.04 ha) and analysis adopted the cluster analysis, importance value index, and Shannon's diversity index. Main results were as follows; 1) the study area was classified into 4 clusters (A, B, C and D). 2) The cluster A was dominated by Pinus densiflora with an importance value of 71.6%. The most dominant species in the cluster B and cluster C were Larix leptolepis (57.1%) and Quercus mongolica (40.2%), respectively. Finally, The cluster D was dominated by P. densiflora (30.6%) and Q. mongolica (31.0%) with the mixed forest. 3) In the P. densiflora community (cluster A), distribution of DBH class showed a reverse J-shaped curve. In the L. leptolepis community (cluster B), individuals of dominant species had the bell-shaped distribution. Oak species indicated uniform distribution of DBH class (under 25 cm) in the mixed P. densiflora - Q. mongolica community (cluster D). 4) The species diversity index of the communities in descending order were: Pinus densiflora - Q. mongolica community > Larix leptolepis community > Pinus densiflora community > Quercus mongolica community.

Fast Search Algorithm for Determining the Optimal Number of Clusters using Cluster Validity Index (클러스터 타당성 평가기준을 이용한 최적의 클러스터 수 결정을 위한 고속 탐색 알고리즘)

  • Lee, Sang-Wook
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.9
    • /
    • pp.80-89
    • /
    • 2009
  • A fast and efficient search algorithm to determine an optimal number of clusters in clustering algorithms is presented. The method is based on cluster validity index which is a measure for clustering optimality. As the clustering procedure progresses and reaches an optimal cluster configuration, the cluster validity index is expected to be minimized or maximized. In this Paper, a fast non-exhaustive search method for finding the optimal number of clusters is designed and shown to work well in clustering. The proposed algorithm is implemented with the k-mean++ algorithm as underlying clustering techniques using CB and PBM as a cluster validity index. Experimental results show that the proposed method provides the computation time efficiency without loss of accuracy on several artificial and real-life data sets.

Development of a New Cluster Index for Semiconductor Wafer Defects and Simulation - Based Yield Prediction Models (변동계수를 이용한 반도체 결점 클러스터 지표 개발 및 수율 예측)

  • Park, Hang-Yeob;Jun, Chi-Hyuck;Hong, Yu-Shin;Kim, Soo-Young
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.21 no.3
    • /
    • pp.371-385
    • /
    • 1995
  • The yield of semiconductor chips is dependent not only on the average defect density but also on the distribution of defects over a wafer. The distribution of defects leads to consider a cluster index. This paper briefly reviews the existing yield prediction models ad proposes a new cluster index, which utilizes the information about the defect location on a wafer in terms of the coefficient of variation. An extensive simulation is performed under a variety of defect distributions and a yield prediction model is derived through the regression analysis to relate the yield with the proposed cluster index and the average number of defects per chip. The performance of the proposed simulation-based yield prediction model is compared with that of the well-known negative binomial model.

  • PDF

A Variable Selection Procedure for K-Means Clustering

  • Kim, Sung-Soo
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.3
    • /
    • pp.471-483
    • /
    • 2012
  • One of the most important problems in cluster analysis is the selection of variables that truly define cluster structure, while eliminating noisy variables that mask such structure. Brusco and Cradit (2001) present VS-KM(variable-selection heuristic for K-means clustering) procedure for selecting true variables for K-means clustering based on adjusted Rand index. This procedure starts with the fixed number of clusters in K-means and adds variables sequentially based on an adjusted Rand index. This paper presents an updated procedure combining the VS-KM with the automated K-means procedure provided by Kim (2009). This automated variable selection procedure for K-means clustering calculates the cluster number and initial cluster center whenever new variable is added and adds a variable based on adjusted Rand index. Simulation result indicates that the proposed procedure is very effective at selecting true variables and at eliminating noisy variables. Implemented program using R can be obtained on the website "http://faculty.knou.ac.kr/sskim/nvarkm.r and vnvarkm.r".

METALLICITY DETERMINATION FOR A GLOBULAR CLUSTER BY SPECTRAL INDICES

  • LEE SANG-GAK
    • Journal of The Korean Astronomical Society
    • /
    • v.29 no.2
    • /
    • pp.157-170
    • /
    • 1996
  • In order to determine the metallicity of a globuar cluster, M3,by using the spectral indices, a kind of index grid has been establshed by stars in globular clusters, M3, M15, M71 and old open cluster, NGC 188. The indices were measured from the medium resolution spectra of about $2{\AA}$. The summed indices were used to determine metallicity in order to increase signals. It is found that the core depth index is measured more accurately and leads result more accurate than the pseudo-equivalent width index. This method can be further improved by including many more calibration globular clusters of various metallicity to make finer grids. By this method, the metallicity of M3 is determined as $[Fe/H] = -1.46\pm0.15$.

  • PDF

Evaluation on Development Performances of E-Commerce for 50 Major Cities in China (중국 주요 50개 도시의 전자상거래 발전성과에 대한 평가)

  • Jeong, Dong-Bin;Wang, Qiang
    • Journal of Distribution Science
    • /
    • v.14 no.1
    • /
    • pp.67-74
    • /
    • 2016
  • Purpose - In this paper, the degree of similarity and dissimilarity between pairs of 50 major cities in China can be shown on the basis of three evaluation variables(internet businessman index, internet shopping index and e-commerce development index). Dissimilarity distance matrix is used to analyze both similarity and dissimilarity between each fifty city in China by calculating dissimilarity as distance. Higher value signifies higher degree of dissimilarity between two cities. Cluster analysis is exploited to classify 50 cities into a number of different groups such that similar cities are placed in the same group. In addition, multidimensional scaling(MDS) technique can obtain visual representation for exploring the pattern of proximities among 50 major cities in China based on three development performance attributes. Research design, data, and methodology - This research is performed by the 2013 report provided with AliResearch in China(1/1/2013~11/30/2013) and utilized multivariate methods such as dissimilarity distance matrix, cluster analysis and MDS by using CLUSTER, KMEANS, PROXIMITIES and ALSCAL procedures in SPSS 21.0. Results - This research applies two types of cluster analysis and MDS on three development performances based on the 2013 report of Aliresearch. As a result, it is confirmed that grouping is possible by categorizing the types into four clusters which share similar characteristics. MDS is exploited to carry out positioning of both grouped locations of cluster and 50 major cities belonging to each cluster. Since all the values corresponding to Shenzhen, Guangzhou and Hangzhou(which belong to cluster 1 among 50 major cities) are very large, these cities are superior to other cities in all three evaluation attributes. Twelve cities(Beijing, ShangHai, Jinghua, ZhuHai, XiaMen, SuZhou, NanJing, DongWan, ZhangShan, JiaXing, NingBo and FoShan), which belong to cluster 3, are inferior to those of cluster 1 in terms of all three attributes, but they can be expected to be the next e-commerce revolution. The rest of major cities, in particular, which belong to cluster 4 are relatively inferior in all three attributes, so that this automatically evokes creative innovation, which leads to e-commerce development as a whole in China. In terms of internet businessman index, on the other hand, Tainan, Taizhong, and Gaoxiong(which belong to cluster 2) are situated superior to others. However, these three cities are inferior to others in an internet shopping index sense. The rest of major cities, in particular, which belong to cluster 4 are relatively inferior in all three evaluation attributes, so that this automatically evokes innovation and entrepreneurship, which leads to e-commerce development as a whole in China. Conclusions - This study suggests the implications to help e-governmental officers and companies make strategies in both Korea and China. This is expected to give some useful information in understanding the recent situation of e-commerce in China, by looking over development performances of 50 major cities. Therefore, we should develop marketing, branding and communication relevant to online Chinese consumers. One of these efforts will be incentives like loyalty points and coupons that can encourage consumers and building in-house logistics networks.

Analysis of Change Transitions in Regional Types in Emergency Department Patient Flows of in Jeonlado (2014-2018) (전라지역 응급실 환자의 유출입 분석 및 지역유형 변화 추이)

  • Lee, Jae-Hyeon;Lee, Sung-Min;Kim, Seongjung;Oh, Mi-Ra
    • Journal of Convergence for Information Technology
    • /
    • v.10 no.12
    • /
    • pp.126-131
    • /
    • 2020
  • This study analyzed the inflow and outflow patterns of emergency department patients, to identify changes in regional types in cities, counties, and districts in Jeonlado, Korea. Data of areas in Jeonlado for 2014 to 2018 were extracted from the National Emergency Department Information System. The extracted data includes the patients' and emergency medical institution addresses, which were used to calculate the relevance index (RI) and commitment index (CI). The calculated indices were classified into regional types by applying cluster analysis. A non-parametric method, Kruskal-Wallis test, was employed to examine the differences between years for RI and CI by regional types. The results of cluster analysis using the relevance and commitment indices revealed three regional types. Regions in cluster 1 were classified as outflow type, in cluster 2 as inflow type, and in cluster 3 as self-sufficient. RI and CI were calculated for each cluster or regional type. There were no significant differences between years in cluster 2 (inflow type) and cluster 3 (self-sufficient type). In cluster 1 (outflow type), there were no significant differences in CI between the years; however, there were significant differences in RI between 2014 and 2017, and 2014 and 2018. It is difficult to see that the emergency medical environment has improved due to the increased concentration of emergency medical care.