• Title/Summary/Keyword: Optimal Number of Clusters

Search Result 79, Processing Time 0.038 seconds

The Classification of Forest Cover Types by Consecutive Application of Multivariate Statistical Analysis in the Natural Forest of Western Mt. Jiri (다변량 통계 분석법의 연속 적용에 의한 서부 지리산 천연림의 산림 피복형 분류)

  • Chung, Sang Hoon;Kim, Ji Hong
    • Journal of Korean Society of Forest Science
    • /
    • v.102 no.3
    • /
    • pp.407-414
    • /
    • 2013
  • This study was conducted to classify forest cover types using the multivariate statistical analysis in the natural forest of western Mt. Jiri. On the basis of the vegetation data by point quarter sampling, the adopted analytical methods were species-area curve (SAC), hierarchical cluster analysis (HCA), indicator species analysis (ISA), and multiple discriminant analysis (MDA). SAC selected the outlier tree species which was likely to have no influence on the classification of forest cover types, excluded from all analytical process. Based on forest vegetative information, HCA classified the study area into 2 to 10 clusters and ISA indicated that the optimal number of clusters were seven. MDA was taken to test the clusters that classified with HCA and ISA. The seven clusters were classified appropriately as overall classification success were 91.3%. The classified forest cover types were named by the ratio of the dominant species in the upper layer of each cluster. They were (1) Quercus mongolica Pure forest, (2) Mixed mesophytic forest, (3) Q. mongolica - Q. serrata forest, (4) Abies koreana - Q. mongolica forest, (5) Fraxinus mandshurica forest, (6) Q. serrata forest, and (7) Carpinus laxiflora forest.

Suggestions for Multi-Layer Planting Model in Seoul Area Based on a Cluster Analysis and Interspecific Association (식생 군집분석과 종간친화력 분석을 통한 서울형 다층구조 식재모델 제안)

  • Kim, Min-Kyung;Sim, Woo-Kyung
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.38 no.4
    • /
    • pp.106-127
    • /
    • 2010
  • Although multi-layer planting methods are more widely used as a method for clustered planting and environmental programs such as plant remediation, difficulties have been faced in applying those to planting design. This study develops a basic planting model that can be applied to multi-layer planting in basis on an analysis of forest structures in the Seoul area. An optimal number of clusters was determined through the ISA (Indicator Species Analysis), and 7 basic clusters were found through a cluster analysis by using PC ORD 4.0 software specifically developed for ecological analysis. The 7 basic clusters include the following communities: the Quercus acutissima Community, Sorbus alnifolia-Quercus mongolica Community, Pinus rigida-Pinus densifiora Community, Rododendron mucronulatum var. mucronulatum-Quercus mongolica Community, Juniperus rigida-Quercus mongolica Community, Rododendron mucronulatum var. mucronulatum-Pinus densiflora Community, and Rododendron sclippenbachii-Quercus mongolica Community. The study also selected 57 species with at least a 10% frequency among the plant species existing in the Seoul area and suggested both a companion species and available similar alternative species by conducting an additional interspecific association analysis. This study may help to enhance usefulness of the model in architectural planting design. In addition, the two results named above were synthesized to develop a multi-layer planting model that can be utilized in landscape planting design by selecting similar alternative species through the interspecific association analysis, which includes 7 clusters of natural plants. The multi-layer planting model can be widely applied to design planting because the model has an average target cover range based on the average value of a transformed likelihood.

A Study for an Optimal Load Balancing Algorithm based on the Real-Time Server Monitor of a Real Server (리얼 서버의 실시간 서버 모니터에 의한 최적 로드 밸런싱 알고리즘에 관한 연구)

  • Han, Il-Seok;Kim, Wan-Yong;Kim, Hag-Bae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2003.11a
    • /
    • pp.201-204
    • /
    • 2003
  • At a consequence of WWW large popularity, the internet has suffered from various performance problems, such as network congestion and overloaded servers. These days, it is not uncommon to find servers refusing connections because they are overloaded. Web server performance has always been a key issue in the design and operation of on-line systems. With regard to Internet, performance is also critical, because users want fast and easy access to all objects (e.g., documents, graphics, audio, and video) available on the net. To solve this problem, a number of companies are exploring the benefits of having multiple geographically or locally distributed Internet sites. This requires a comprehensive scheme for traffic management, which includes the principle of an optimal load balancing of client requests across multiple clusters of real servers. This paper focuses on the performance analysis of Web server and we apply these results to load balancing in clustering web server. It also discusses the mam steps needed to carry out a WWW performance analysis effort and shows relations between the workload characteristics and system resource usage. Also, we will introduce an optimal load balancing algorithm base on the RTSM (Real-Time Server Monitor) and Fuzzy Inference Engine for the local status of a real server, and the benefits is provided with of the suggested method.

  • PDF

An optimal feature selection algorithm for the network intrusion detection system (네트워크 침입 탐지를 위한 최적 특징 선택 알고리즘)

  • Jung, Seung-Hyun;Moon, Jun-Geol;Kang, Seung-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.10a
    • /
    • pp.342-345
    • /
    • 2014
  • Network intrusion detection system based on machine learning methods is quite dependent on the selected features in terms of accuracy and efficiency. Nevertheless, choosing the optimal combination of features from generally used features to detect network intrusion requires extensive computing resources. For instance, the number of possible feature combinations from given n features is $2^n-1$. In this paper, to tackle this problem we propose a optimal feature selection algorithm. Proposed algorithm is based on the local search algorithm, one of representative meta-heuristic algorithm for solving optimization problem. In addition, the accuracy of clusters which obtained using selected feature components and k-means clustering algorithm is adopted to evaluate a feature assembly. In order to estimate the performance of our proposed algorithm, comparing with a method where all features are used on NSL-KDD data set and multi-layer perceptron.

  • PDF

HI gas kinematics of paired galaxies in the cluster environment from ASKAP pilot observations

  • Kim, Shin-Jeong;Oh, Se-Heon;Kim, Minsu;Park, Hye-Jin;Kim, Shinna
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.46 no.2
    • /
    • pp.70.1-70.1
    • /
    • 2021
  • We examine the HI gas kinematics and distributions of galaxy pairs in group or cluster environments from high-resolution Australian Square Kilometer Array Pathfinder (ASKAP) WALLABY pilot observations. We use 32 well-resolved close pair galaxies from the Hydra, Norma, and NGC 4636, two clusters and a group of which are identified by their spectroscopy information and additional visual inspection. We perform profile decomposition of HI velocity profiles of the galaxies using a new tool, BAYGAUD which allows us to separate a line-of-sight velocity profile into an optimal number of Gaussian components based on Bayesian MCMC techniques. Then, we construct super profiles via stacking of individual HI velocity profiles after aligning their central velocities. We fit a model which consists of double Gaussian components to the super profiles, and classify them as kinematically cold and warm HI gas components with respect to their velocity dispersions, narrower or wider 𝜎, respectively. The kinematically cold HI gas reservoir (M_cold/M_HI) of the paired galaxies is found to be relatively higher than that of unpaired control samples in the clusters and the group, showing a positive correlation with the HI mass in general. Additionally, we quantify the gravitational instability of the HI gas disk of the sample galaxies using their Toomre Q parameters and HI morphological disturbances. While no significant difference is found for the Q parameter values between the paired and unpaired galaxies, the paired galaxies tend to have larger HI asymmetry values which are derived using their moment0 map compared to those of the non-paired control sample galaxies in the distribution.

  • PDF

An Optimal Allocation Mechanism of Location Servers in A Linear Arrangement of Base Stations (선형배열 기지국을 위한 위치정보 서버의 최적할당 방식)

  • Lim, Kyung-Shik
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.2
    • /
    • pp.426-436
    • /
    • 2000
  • Given a linear arrangement of n base stations which generate multiple types of traffic among themselves, we consider the problem of finding a set of disjoint clusters to cover n base statons so that a cluster is assigned a location server. Our goal is to minimize the total communication cost for the entire network where the cost of intra-cluster communication is usually lower than that of intercluster communication for each type of traffic. The optimization problem is transformed into an equivavalent problem using the concept of relative cost, which generates the difference of communication costs between intracluster and intercluster communications. Using the relative cost matrix, an efficient algorithm of O($mm^2$), where m is the number of clusters in a partition, is designed by dynamic programming. The algorithm also finds all thevalid partitions in the same polynomial time, given the size constraint on a cluster, and the total allowable communication cost for the entire network.

  • PDF

A Study on Type of Location Characteristics of Transfer Stations Using Data on Traffic Cards - Focused on Daegu City - (교통카드자료를 이용한 환승정류장의 유형별 입지특성에 관한 연구 - 대구시를 중심으로 -)

  • Kim, Ki-Hyuk;Lee, Seung-Cheol
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.31 no.4D
    • /
    • pp.519-526
    • /
    • 2011
  • In this study, the characteristics of transfer station are analyzed using the data of public transportation. The traffic card of Daegu does not include boarding information. The boarding data are calculated using traffic card data and the BMS data. It is found that transfer has increased by the distance from CBD and the numbers of routes, and decresed by the waiting time. Oneway ANNOVA are carried out to find the optimal number of clusters. Three clusters are chosen in this study. The center of the first cluster shows 2.99, so it has a characteristic of CBD. The second is 6.73, the outskirts of town, and the third is 12.78, the outlying areas.

A Study on Measuring the Similarity Among Sampling Sites in Lake Yongdam with Water Quality Data Using Multivariate Techniques (다변량기법을 활용한 용담호 수질측정지점 유사성 연구)

  • Lee, Yosang;Kwon, Sehyug
    • Journal of Environmental Impact Assessment
    • /
    • v.18 no.6
    • /
    • pp.401-409
    • /
    • 2009
  • Multivariate statistical approaches to classify sampling sites with measuring their similarity by water quality data and understand the characteristics of classified clusters have been discussed for the optimal water quality monitering network. For empirical study, data of two years (2005, 2006) at the 9 sampling sites with the combination of 2 depth levels and 7 important variables related to water quality is collected in Yongdam reservoir. The similarity among sampling sites is measured with Euclidean distances of water quality related variables and they are classified by hierarchical clustering method. The clustered sites are discussed with principal component variables in the view of the geographical characteristics of them and reducing the number of measuring sites. Nine sampling sites are clustered as follows; One cluster of 5, 6, and 7 sampling sites shows the characteristic of low water depth and main stream of water. The sites of 2 and 4 are clustered into the same group by characteristics of hydraulics which come from that of main stream. But their changing pattern of water quality looks like different since the site of 2 is near to dam. The sampling sites of 3, 8, and 9 are individually positioned due to the different tributary.

Optimizing the maximum reported cluster size for normal-based spatial scan statistics

  • Yoo, Haerin;Jung, Inkyung
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.4
    • /
    • pp.373-383
    • /
    • 2018
  • The spatial scan statistic is a widely used method to detect spatial clusters. The method imposes a large number of scanning windows with pre-defined shapes and varying sizes on the entire study region. The likelihood ratio test statistic comparing inside versus outside each window is then calculated and the window with the maximum value of test statistic becomes the most likely cluster. The results of cluster detection respond sensitively to the shape and the maximum size of scanning windows. The shape of scanning window has been extensively studied; however, there has been relatively little attention on the maximum scanning window size (MSWS) or maximum reported cluster size (MRCS). The Gini coefficient has recently been proposed by Han et al. (International Journal of Health Geographics, 15, 27, 2016) as a powerful tool to determine the optimal value of MRCS for the Poisson-based spatial scan statistic. In this paper, we apply the Gini coefficient to normal-based spatial scan statistics. Through a simulation study, we evaluate the performance of the proposed method. We illustrate the method using a real data example of female colorectal cancer incidence rates in South Korea for the year 2009.

Spatial Focalization of Zen-Meditation Brain Based on EEG

  • Liu, Chuan-Yi;Lo, Pei-Chen
    • Journal of Biomedical Engineering Research
    • /
    • v.29 no.1
    • /
    • pp.17-24
    • /
    • 2008
  • The aim of this paper is to report our preliminary results of investigating the spatial focalization of Zen-meditation EEG (electroencephalograph) in alpha band (8-13 Hz). For comparison, the study involved two groups of subjects, practitioners (experimental group) and non-practitioners (control group). To extract EEG alpha rhythm, wavelet analysis was applied to multi-channel EEG signals. Normalized alpha-power vectors were then constructed from spatial distribution of alpha powers, that were classified by Fuzzy C-means based algorithm to explore various brain spatial characteristics during meditation (or, at rest). Optimal number of clusters was determined by correlation coefficients of the membership-value vectors of each cluster center. Our results show that, in the experimental group, the incidence of frontal alpha activity varied in accordance with the meditation stage. The results demonstrated three different spatiotemporal modules consisting with three distinctive meditation stages normally recognized by meditation practitioners. The frontal alpha activity in two groups decreased in different ways. Particularly, monotonic decline was observed in the control group, and the experimental group showed increasing results. The phenomenon might imply various mechanisms employed by meditation and relaxation in modulating parietal alpha.