• Title/Summary/Keyword: and clustering

Search Result 5,621, Processing Time 0.032 seconds

A practical application of cluster analysis using SPSS

  • Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.6
    • /
    • pp.1207-1212
    • /
    • 2009
  • Basic objective in cluster analysis is to discover natural groupings of items or variables. In general, clustering is conducted based on some similarity (or dissimilarity) matrix or the original input text data. Various measures of similarities (or dissimilarities) between objects (or variables) are developed. We introduce a real application problem of clustering procedure in SPSS when the distance matrix of the objects (or variables) is only given as an input data. It will be very helpful for the cluster analysis of huge data set which leads the size of the proximity matrix greater than 1000, particularly. Syntax command for matrix input data in SPSS for clustering is given with numerical examples.

  • PDF

An Optimization Approach to Data Clustering

  • Kim, Ju-Mi;Olafsson, Sigurdur
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2005.05a
    • /
    • pp.621-628
    • /
    • 2005
  • Scalability of clustering algorithms is critical issues facing the data mining community. This is particularly true for computationally intense tasks such as data clustering. Random sampling of instances is one possible means of achieving scalability but a pervasive problem with this approach is how to deal with the noise that this introduces in the evaluation of the learning algorithm. This paper develops a new optimization based clustering approach using an algorithms specifically designed for noisy performance. Numerical results illustrate that with this algorithm substantial benefits can be achieved in terms of computational time without sacrificing solution quality.

  • PDF

Clustering Scheme for (m,k)-Firm Streams in Wireless Sensor Networks

  • Kim, Ki-Il
    • Journal of information and communication convergence engineering
    • /
    • v.14 no.2
    • /
    • pp.84-88
    • /
    • 2016
  • As good example of potential application-specific requirement, (m,k)-firm real-time streams have been recently introduced to deliver multimedia data efficiently in wireless sensor networks. In addition to stream model, communication protocols to meet specific (m,k)-firm real-time streams have been newly developed or extended from existing protocols. However, since the existing schemes for an (m,k)-firm stream have been proposed under typical flat architecture, the scalability problem remains unsolved when the number of real-time flows increases in the networks. To solve this problem, in this paper, we propose a new clustering scheme for an (m,k)-firm stream. The two different clustering algorithms are performed according to either the (m,k)-firm requirement or the deadline. Simulation results are presented to demonstrate the suitability of the proposed scheme under hierarchical architecture by showing that its performance is acceptable irrespective of the increase in the number of flows.

The reduction of computer time in small-signal stability analysis in power systems : with clustering technique (전력계통의 미소신호 안정도 해석에서 계산시간 단축에 관한 연구 : 크러스터링 기법에 대하여)

  • Kwon, Sae-Hyuk;Kim, Deok-Young
    • Proceedings of the KIEE Conference
    • /
    • 1992.07a
    • /
    • pp.138-140
    • /
    • 1992
  • This paper represents how to reduce the computer time in small signal stability analysis by selecting the dominant oscillation modes with frequency of 0.5 to 1.2 Hz using the clustering technique. Clustering technique links the buses which are expected to be similar with zero-impedance lines and the voltage variations of these buses are regarded to be identical. The computer time was reduced remarkably with this technique and the effect of clustering will be powerful in the analysis of large-scale power systems.

  • PDF

Fuzzy Controller Modeling for Electromagnetic Levitation Systems based on Clustering Algorithm (클러스터링에 기초한 자기부상시스템의 퍼지제어기 모델링)

  • Kim, Min-Soo;Byun, Yeun-Sub;Lee, Kwan-Sup
    • Proceedings of the KSR Conference
    • /
    • 2006.11a
    • /
    • pp.145-159
    • /
    • 2006
  • This paper describes the development of a clustering based fuzzy controller of an electromagnetic suspension vehicle using gain scheduling method and Kalman filter for a simplified single magnet system. Electromagnetic suspension vehicle systems are highly nonlinear and essentially unstable systems For achieving the levitation control of the DC electromagnetic suspension system, we considered a fuzzy system modeling method based on clustering algorithm which a set of input/output data is collected from the well defined Linear Quadratic Gaussian(LQG) controller. Simulation results show that the proposed clustering based fuzzy controller methodology robustly yields uniform performance with adequate gap response over the mass variation range.

  • PDF

Distributed Recommendation System Using Clustering-based Collaborative Filtering Algorithm (클러스터링 기반 협업 필터링 알고리즘을 사용한 분산 추천 시스템)

  • Jo, Hyun-Je;Rhee, Phill-Kyu
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.14 no.1
    • /
    • pp.101-107
    • /
    • 2014
  • This paper presents an efficient distributed recommendation system using clustering collaborative filtering algorithm in distributed computing environments. The system was built based on Hadoop distributed computing platform, where distributed Min-hash clustering algorithm is combined with user based collaborative filtering algorithm to optimize recommendation performance. Experiments using Movie Lens benchmark data show that the proposed system can reduce the execution time for recommendation compare to sequential system.

Projection Pursuit K-Means Visual Clustering

  • Kim, Mi-Kyung;Huh, Myung-Hoe
    • Journal of the Korean Statistical Society
    • /
    • v.31 no.4
    • /
    • pp.519-532
    • /
    • 2002
  • K-means clustering is a well-known partitioning method of multivariate observations. Recently, the method is implemented broadly in data mining softwares due to its computational efficiency in handling large data sets. However, it does not yield a suitable visual display of multivariate observations that is important especially in exploratory stage of data analysis. The aim of this study is to develop a K-means clustering method that enables visual display of multivariate observations in a low-dimensional space, for which the projection pursuit method is adopted. We propose a computationally inexpensive and reliable algorithm and provide two numerical examples.

The Document Clustering using LSI of IR (LSI를 이용한 문서 클러스터링)

  • 고지현;최영란;유준현;박순철
    • Proceedings of the Korea Society for Industrial Systems Conference
    • /
    • 2002.06a
    • /
    • pp.330-335
    • /
    • 2002
  • The most critical issue in information retrieval system is to have adequate results corresponding to user requests. When all documents related with user inquiry retrieve, it is not easy not only to find correct document what user wants but is limited. Therefore, clustering method that grouped by corresponding documents has widely used so far. In this paper, we cluster on the basis of the meaning rather than the index term in the existing document and a LSI method is applied by this reason. Furthermore, we distinguish and analyze differences from the clustering using widely-used K-Means algorithm for the document clustering.

  • PDF

A Study of Germanium Substrate Vacancy Clustering Formation using Monte Carlo Method (Monte Carlo방법을 이용한 Germanium 기판의 결공형 클러스터링 형성에 대한 연구)

  • Lee, Jun-Ha
    • Journal of the Semiconductor & Display Technology
    • /
    • v.10 no.2
    • /
    • pp.45-48
    • /
    • 2011
  • In this paper, vacancy clustering formation and diffusion of germanium substrate was studied. The analysis method was adopted Monte Carlo method. At temperatures higher than melting point, fewer clusters formed, but there was less variation in the number of clusters than at lower temperatures, as the time increased. Equilibrium diffusivities in the clustering region were $10^2$ lower than those of free vacancies in the initial stage of kinetic lattice Monte Carlo simulations. They were expressed according to three temperature regimes: at temperatures above 1,100 K, at temperatures of 1,100-900 K, and at temperatures below 900 K. The effective mean migration energy, 1.1 eV, closely coincided with that of the 1.0-1.2 eV in experiments.

Clustering Algorithm Using Hashing in Classification of Multispectral Satellite Images

  • Park, Sung-Hee;Kim, Hwang-Soo;Kim, Young-Sup
    • Korean Journal of Remote Sensing
    • /
    • v.16 no.2
    • /
    • pp.145-156
    • /
    • 2000
  • Clustering is the process of partitioning a data set into meaningful clusters. As the data to process increase, a laster algorithm is required than ever. In this paper, we propose a clustering algorithm to partition a multispectral remotely sensed image data set into several clusters using a hash search algorithm. The processing time of our algorithm is compared with that of clusters algorithm using other speed-up concepts. The experiment results are compared with respect to the number of bands, the number of clusters and the size of data. It is also showed that the processing time of our algorithm is shorter than that of cluster algorithms using other speed-up concepts when the size of data is relatively large.