• Title/Summary/Keyword: and cluster set.

Search Result 605, Processing Time 0.029 seconds

Improved Multidimensional Scaling Techniques Considering Cluster Analysis: Cluster-oriented Scaling (클러스터링을 고려한 다차원척도법의 개선: 군집 지향 척도법)

  • Lee, Jae-Yun
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.2
    • /
    • pp.45-70
    • /
    • 2012
  • There have been many methods and algorithms proposed for multidimensional scaling to mapping the relationships between data objects into low dimensional space. But traditional techniques, such as PROXSCAL or ALSCAL, were found not effective for visualizing the proximities between objects and the structure of clusters of large data sets have more than 50 objects. The CLUSCAL(CLUster-oriented SCALing) technique introduced in this paper differs from them especially in that it uses cluster structure of input data set. The CLUSCAL procedure was tested and evaluated on two data sets, one is 50 authors co-citation data and the other is 85 words co-occurrence data. The results can be regarded as promising the usefulness of CLUSCAL method especially in identifying clusters on MDS maps.

Creating Generic Cluster Indicators based upon an Agent-centred Cluster Framework (행위주체 중심 클러스터 사고 틀에 기반한 클러스터 지표 개발에 관한 연구)

  • Jeong, Jun-Ho;Kim, Hag-Soo
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.13 no.3
    • /
    • pp.416-441
    • /
    • 2010
  • This paper attempts to develop a framework articulating a suite of indicators of cluster development, based upon existing work on the economic geography of clusters, cluster frameworks and indicators and cluster policies. Unlike other work the framework adopted here emphasizes adaptive and proactive roles played by agents, whether individual or collective, within the cluster when understanding it as a learning environment to capture an implication made from adopting the cluster perspective. Some possible indicators are operationalized and suggested even if they are not definitive and exhaustive. The conceptual framework and the specific indicators suggested can provide policy-makers and key stakeholders in clusters with a proper set of tools for measuring the level of cluster development, maneuvering a broader strategic planning exercise for successful cluster development.

  • PDF

On the clustering of huge categorical data

  • Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1353-1359
    • /
    • 2010
  • Basic objective in cluster analysis is to discover natural groupings of items. In general, clustering is conducted based on some similarity (or dissimilarity) matrix or the original input data. Various measures of similarities between objects are developed. In this paper, we consider a clustering of huge categorical real data set which shows the aspects of time-location-activity of Korean people. Some useful similarity measure for the data set, are developed and adopted for the categorical variables. Hierarchical and nonhierarchical clustering method are applied for the considered data set which is huge and consists of many categorical variables.

Density-based Outlier Detection in Multi-dimensional Datasets

  • Wang, Xite;Cao, Zhixin;Zhan, Rongjuan;Bai, Mei;Ma, Qian;Li, Guanyu
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.12
    • /
    • pp.3815-3835
    • /
    • 2022
  • Density-based outlier detection is one of the hot issues in data mining. A point is determined as outlier on basis of the density of points near them. The existing density-based detection algorithms have high time complexity, in order to reduce the time complexity, a new outlier detection algorithm DODMD (Density-based Outlier Detection in Multidimensional Datasets) is proposed. Firstly, on the basis of ZH-tree, the concept of micro-cluster is introduced. Each leaf node is regarded as a micro-cluster, and the micro-cluster is calculated to achieve the purpose of batch filtering. In order to obtain n sets of approximate outliers quickly, a greedy method is used to calculate the boundary of LOF and mark the minimum value as LOFmin. Secondly, the outliers can filtered out by LOFmin, the real outliers are calculated, and then the result set is updated to make the boundary closer. Finally, the accuracy and efficiency of DODMD algorithm are verified on real dataset and synthetic dataset respectively.

A Cluster modeling using New Convergence properties (새로운 수렴특성을 이용한 클러스터 모델링)

  • Kim, Sung-Suk;Baek, Chan-Soo;Kim, Sung-Soo;Ryu, Joeng-Woong
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.382-384
    • /
    • 2004
  • In this parer, we propose a clustering that perform algorithm using new convergence properties. For detection and optimization of cluster, we use to similarity measure with cumulative probability and to inference the its parameters with MLE. A merits of using the cumulative probability in our method is very effectiveness that robust to noise or unnecessary data for inference the parameters. And we adopt similarity threshold to converge the number of cluster that is enable to past convergence and delete the other influence for this learning algorithm. In the simulation, we show effectiveness of our algorithm for convergence and optimization of cluster in riven data set.

  • PDF

The Image Compression Using the Central Vectors of Clusters (Cluster의 중심벡터를 이용하는 영상 압축)

  • Cho, Che-Hwang
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.1
    • /
    • pp.5-12
    • /
    • 1995
  • In the case where the set of training vectors constitute clusters, the codevectors of the codebook which is used to compression for speech and images in the vector quantization are regarded as the central vectors of the clusters constituted by given training vectors. In this work, we consider the distribution of Euclidean distance obtaining in the process of searching for the minimum distance between vectors, and propose the method searching for the proper number of and the central vectors of clusters. And then, the proposed method shows more than the about 4[dB] SNR than the LBG algorithm and the competitive learning algorithm

  • PDF

Cluster Model of Multilingual Training of University Students: Theory and Practice of Engineering Education

  • Suvorova, Svetlana;Khilchenko, Tatyana;Gnatyshina, Elena;Uvarina, Natalia;Savchenkov, Alexey
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.10
    • /
    • pp.107-112
    • /
    • 2022
  • Nowadays clusters are recognized as an important instrument for promoting industrial development, innovation, competitiveness and growth. An educational cluster is a set of interrelated vocational educational institutions of various levels that are united by industry with each other and are connected by partnership with industry enterprises. This article attempts to develop and describe cluster model of university students' multilingual training. The purpose of this study is to describe multilingual training of university students and their polycultural competencies formation and to define the process of multilingual training in form of a cluster. The authors consider clusters as an integral part of the educational campus within the concept framework of Shadrinsk State Pedagogical University. To determine the essence of the concept of a cluster model of university students' multilingual training, theoretical, empirical, observational, and diagnostic methods were implemented, such as a review of scientific literature, a compilation of best practices, observation, statistical methods, etc. The authors analyzed the programs of partner universities and organized international webinars and internships for bachelors and masters abroad and developed online courses "Foreign language for undergraduate students and masters". Experimental data obtained during the implementation of cluster training show the effectiveness of the formation of students' polycultural competencies.

Integrated Linux Cluster System Administration Tool (리눅스 클러스터 시스템 통합 관리 도구)

  • 김은회;김지연;박용관;권성주;최재영
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.8 no.6
    • /
    • pp.639-646
    • /
    • 2002
  • In this paper, we discuss the system configuration and the design issues of CATS-i, a set of installation and administration tools for Linux cluster systems. CATS-i enables users to manage cluster systems easily, quickly, and safety. It integrates many functions, ranging from installing operating systems and applications to real-time monitoring and management of various important resources. In addition, batch job submission and management are included. These functions support a single system image. Finally, a powerful graphic user interface based on Java lets users quickly understand the cluster status and conveniently use the CATS-i on the Web.

Clustering Technique for Multivariate Data Analysis

  • Lee, Jin-Ki
    • Journal of the military operations research society of Korea
    • /
    • v.6 no.2
    • /
    • pp.89-127
    • /
    • 1980
  • The multivariate analysis techniques of cluster analysis are examined in this article. The theory and applications of the techniques and computer software concerning these techniques are discussed and sample jobs are included. A hierarchical cluster analysis algorithm, available in the IMSL software package, is applied to a set of data extracted from a group of subjects for the purpose of partitioning a collection of 26 attributes of a weapon system into six clusters of superattributes. A nonhierarchical clustering procedure were applied to a collection of data of tanks considering of twenty-four observations of ten attributes of tanks. The cluster analysis shows that the tanks cluster somewhat naturally by nationality. The principal componant analysis and the discriminant analysis show that tank weight is the single most important discriminator among nationality although they are not shown in this article because of the space restriction. This is a part of thesis for master's degree in operations research.

  • PDF

Facture Simulation using Molecular Dynamics on a PC Cluster (PC 클러스터 상에서 분자동역학을 이용한 파괴 모사)

  • Choi, Deok-Kee;Ryu, Han-Kyu
    • Proceedings of the KSME Conference
    • /
    • 2001.11a
    • /
    • pp.252-257
    • /
    • 2001
  • With the help of newly arrived technology such as PC clustering, molecular dynamics (MD) seems to be promising for large-scale materials simulations. A cost-effective cluster is set up using commodity PCs connected over Ethernet with fast switching devices and free software Linux. Executing MD simulations in the parallel sessions makes it possible to carry out large-scale materials simulations at acceptable computation time and costs. In this study, the MD computer code for fracture simulation is modified to comply with MPI (Message Passing Interface) specification, and runs on the PC cluster in parallel mode flawlessly. It is noted that PC clusters can provide a rather inexpensive high-performance computing environment comparing to supercomputers, if properly arranged.

  • PDF