• Title/Summary/Keyword: Hierarchical cluster analysis

Search Result 302, Processing Time 0.023 seconds

Charaterization of Cities in Seoul Metropolitan Area by Cluster Analysis (군집분석을 이용한 수도권 도시의 유형화에 관한 연구)

  • Song, Min-Kyung;Chang, Hoon
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.18 no.1
    • /
    • pp.83-88
    • /
    • 2010
  • This paper has analyzed Seoul metropolitan area on the basis of cluster characteristics and it is to understand the traits of each clusters. In order to modelize the area, 10 different indicators were selected among components of a city such as population, activities, land and facilities. Also through principal component analysis, similar characteristics or congenialities of the variables were derived as a common factor. The result was organized by factor score from hierarchical clustering method and as a final result, metropolitan area was clustered into five areas.

Cluster analysis by month for meteorological stations using a gridded data of numerical model with temperatures and precipitation (기온과 강수량의 수치모델 격자자료를 이용한 기상관측지점의 월별 군집화)

  • Kim, Hee-Kyung;Kim, Kwang-Sub;Lee, Jae-Won;Lee, Yung-Seop
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1133-1144
    • /
    • 2017
  • Cluster analysis with meteorological data allows to segment meteorological region based on meteorological characteristics. By the way, meteorological observed data are not adequate for cluster analysis because meteorological stations which observe the data are located not uniformly. Therefore the clustering of meteorological observed data cannot reflect the climate characteristic of South Korea properly. The clustering of $5km{\times}5km$ gridded data derived from a numerical model, on the other hand, reflect it evenly. In this study, we analyzed long-term grid data for temperatures and precipitation using cluster analysis. Due to the monthly difference of climate characteristics, clustering was performed by month. As the result of K-Means cluster analysis is so sensitive to initial values, we used initial values with Ward method which is hierarchical cluster analysis method. Based on clustering of gridded data, cluster of meteorological stations were determined. As a result, clustering of meteorological stations in South Korea has been made spatio-temporal segmentation.

The Habitat Classification of mammals in Korea based on the National Ecosystem Survey (전국자연환경조사를 활용한 포유류 서식지 유형의 분류)

  • Lee, Hwajin;Ha, Jeongwook;Cha, Jinyeol;Lee, Junghyo;Yoon, Heenam;Chung, Chulun;Oh, Hongshik;Bae, Soyeon
    • Journal of Environmental Impact Assessment
    • /
    • v.26 no.2
    • /
    • pp.160-170
    • /
    • 2017
  • The purpose of this study is to perform clustering of the habitat types and to identify the characteristics of species in the habitat types using mammal data (70,562) of the 3rd National Ecosystem Survey conducted from 2006 to 2012. The 15 habitat types recorded in the field-paper of the 3rd National ecosystem survey were reclassified, which was followed by the statistical analysis of mammal habitat types. In the habitat types cluster analysis, non-hierarchical cluster analysis (k-means cluster analysis), hierarchical cluster analysis, and non-metric multidimensional scaling method were applied to 14 habitat types recorded more than 30 times. A total of 7 Orders, 16 Families, and 39 Species of mammals were identified in the 3rd National Ecosystem Survey collected nationwide. When 11 clusters were classified by habitat types, the simple structure index was the highest (ssi = 0.07). As a result of the similarities and hierarchies between habitat types suggested by the hierarchical clustering analysis, the residential areas were the most different habitat types for mammals; the next following type was a cluster together with rivers and coasts. The results of the non-metric multidimensional scaling analysis demonstrated that both Mus musculus and Rattus norvegicus restrictively appeared in a residential area, which is the most discriminating habitat type. Lutra lutra restrictively appeared in coastal and river areas. In summary, according to our results, the mammalian habitat can be divided into the following four types: (1) the forest type (using forest as the main habitat and migration route); (2) the river type (using water as the main habitat); (3) the residence habitat (living near residential area); and (4) the lowland type (consuming grain or seeds as the main feeding resource).

The Comparative Analysis of the Reasons for Decreases in Marin Fishery Resources Based on AHP & duster Analysis (AHP - 군집분석을 이용한 주요어종의 자원감소 원인 비교분석에 관한 연구)

  • Park, Cheol-Hyung;Lee, Sang-Go
    • The Journal of Fisheries Business Administration
    • /
    • v.40 no.3
    • /
    • pp.127-146
    • /
    • 2009
  • This study is to estimate the factor weights of the reasons for decreases in marine fishery resources using the Analytical Hierarchy Process. Furthermore, it classifies 20 fishes under a fishery resource recovery plan into various groups of fishes according to these factor weights using the non-hierarchial cluster analysis. The factors of decreases in marine fishery resources are identified as bio-ecological, technology-system, economic-business, and fishing village-society factors. Two of the most important factors of decreases in resource are turned out to be the economic-business and bio-ecological factors, estimated as 31% and 30% respectively. The technology-system and fishing village-society factors are estimated as 21% and 18% respectively. The study utilizes non-hierarchical cluster analysis in order to classify 20 fishes into 2, 3, and 4 groups. K-means cluster analysis is applied for grouping in conjunction with ANOVA to identify statistical differences in factors. Once again, the economic-business and bio-economic factors play main role in grouping 2-groups of fishes case. The third group of fishes in addition to the previous 2 groups of fishes appears as those 4 factors of decrease evenly play about the same role at a 3-groups of fishes case. Finally, the economic-business and bio-economic factors are turned out to be evenly important in the 4th group once there are 4-groups of fishes.

  • PDF

Agglomerative Hierarchical Clustering Analysis with Deep Convolutional Autoencoders (합성곱 오토인코더 기반의 응집형 계층적 군집 분석)

  • Park, Nojin;Ko, Hanseok
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.1
    • /
    • pp.1-7
    • /
    • 2020
  • Clustering methods essentially take a two-step approach; extracting feature vectors for dimensionality reduction and then employing clustering algorithm on the extracted feature vectors. However, for clustering images, the traditional clustering methods such as stacked auto-encoder based k-means are not effective since they tend to ignore the local information. In this paper, we propose a method first to effectively reduce data dimensionality using convolutional auto-encoder to capture and reflect the local information and then to accurately cluster similar data samples by using a hierarchical clustering approach. The experimental results confirm that the clustering results are improved by using the proposed model in terms of clustering accuracy and normalized mutual information.

PCR Analysis for Genetic Distances of Two Charybdis Crab Populations

  • Yoon, Jong-Man
    • Development and Reproduction
    • /
    • v.26 no.2
    • /
    • pp.91-98
    • /
    • 2022
  • Genomic DNA (gDNA) set apart from two populations of Korean Charybdis crab (Charybdis japonica) was augmented by PCR experiments. The five oligonucleotides primers (ONT-primers) were spent to yield the number of unique loci shared to each crab population (ULSECP) and number of loci shared by the two crab populations (LSTCP). 305 fragments (FRAGs) were identified in the Charybdis crab population A (CCPA), and 344 in the Charybdis crab population B (CCPB): 44 number of ULSECP (14.43%) in the CCPA and 110 (31.98%) in the CCPB. 44 number of LSTCP, with an average of 8.8 per primer, were detected in the two crab populations. The bandsharing (BS) value between entity's no. 01 and no. 10 was the lowest (0.371) between the two CCPs. The average bandsharing (ABS) values of individuals in the CCPA (0.575±0.014) were lesser than in those originated from the CCPB (0.705±0.011) (p < 0.05). The polar hierarchical dendrogram (PHD) achieved by the five ONT-primers denotes three genetic clusters (GCs): cluster I (CHARYBCRAB 01, 04, 05, 06, and 08), cluster II (CHARYBCRAB 02, 03, 07, 09, 10, and 11) and cluster III (CHARYBCRAB 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, and 22). The shortest genetic distance (GD) displaying significant molecular difference (MD) was between individuals CHARYBCRAB no. 18 and CHARYBCRAB no. 17 (0.055).

Evaluation of Shopping Items: Focused on Purchase of Foreign Tourists in South Korea

  • Jeong, Dong-Bin
    • East Asian Journal of Business Economics (EAJBE)
    • /
    • v.7 no.2
    • /
    • pp.21-30
    • /
    • 2019
  • Purpose - In this work, we categorize the 21 shopping items which foreign tourists purchase in South Korea and monitor the level of dissimilarity (or similarity) between each item by utilizing distance matrix, and both hierarchical and k-means cluster analyses, respectively, based on several purpose of visit attributes in 2017. In addition, multidimensional scaling (MDS) method is applied for mining visual appearance of proximities among shopping items based on purpose of visit attributes. Research design and methodology - This study is carried out in 2017 by Ministry of Culture, Sports and Tourism and conduct a face-to-face survey of foreign tourists from 20 countries who purchase shopping items in South Korea. CLUSTER, PROXIMITIES and ALSCAL modules in IBM SPSS 23.0 are used to perform this work. Results - We ascertain that 21 shopping items can be classified into five similar groups which have homogeneous traits by going through two-step cluster analysis. We can position homogeneous places of cluster and shopping items joining each cluster. Conclusions - We can relatively assess patterns and characteristics of each shopping item, come by useful information in activating shopping tour based on the actual state of recognition of foreign tourists and practically apply to each tourism industry on underlying results.

The Studies on the Statistical Reliability and Significancy of the Questionnaire for the Sasang Constitution (사상체질판별(四象體質判別) 검사지(檢査紙) 문항(問項)의 타당성(妥當性)과 신뢰성(信賴性) 및 응답자(應答者) 개체분석(應答者)에 관한 연구(硏究))

  • Lee, Hwa-Seop;An, Taek-Won
    • Journal of Haehwa Medicine
    • /
    • v.12 no.2
    • /
    • pp.177-197
    • /
    • 2004
  • 1. The values of Cronbach's alpha for the Taeyang, Taeum, Soyang and Soeum questionnaire were 0.7955, 0.7776, 0.8545, and 0.8601 respectively. These results indicate a highly satisfactory level of internal consistency for the questionnaire. 2. If the deletion of an item increases Cronbach's alpha then what that means is that the deletion of that item improves reliability. Therefore, any items that result in substantially greater values of alpha than the overall alpha may need to be deleted from the questionnaire to improve its reliability. 3. Factor analysis was performed on the 81 questionnaires. Based on the scree plot and the number and decrement of eigen values greater than one, three to four factor solution was most significant. 4. The hierarchical cluster analysis was performed on the 81 Sasang constitution questionnaire. These results suggested that two or four clusters identified with homogeneous groups 5. The hierarchical cluster analysis was performed on the 1046 responders. These results suggested that two, three, or four clusters might identified with homogeneous groups. Furthemore, there were statistically significant difference among the each group by ANOVA(P<0.0001).

  • PDF

Market Segmentation Based on Types of Motivations to Visit Coffee Shops (커피전문점 방문동기유형에 따른 시장세분화)

  • Lee, Yong-Sook;Kim, Eun-Jung;Park, Heung-Jin
    • The Korean Journal of Franchise Management
    • /
    • v.7 no.1
    • /
    • pp.21-29
    • /
    • 2016
  • Purpose - The primary purpose of this study is to employ effective marketing methods using market segmentation of coffee shops by determining how motivations to visit coffee shops have different impacts on demographic profile of visitors and characteristics of coffee shop visits, so as to draw out a better understanding of customers of coffee market. Research design, data, and methodology - Data were collected using surveys of self-administered questionnaires toward coffee shop users in Daejeon, Korea. A number of samples used in data analysis were 253 excluding unusable responses. The data were analyzed through frequency, reliability, and factor analysis using SPSS 20.0. Factor analysis was conducted through the principal component analysis and varimax rotation method to derive factors of one or more eigen values. In addition, the cluster analysis, multivariate ANOVA, and cross-tab analysis were used for the market segmentation based on the types of motivation for coffee shop visits. The process of the cluster analysis is as follows. Four clusters were derived through hierarchical clustering, and k-means cluster analysis was then carried out using mean value of the four clusters as the initial seed value. Result - The factor analysis delineated four dimensions of motivation to visit coffee shops: ostentation motivation, hedonic motivation, esthetic motivation, utility motivation. The cluster analysis yielded four clusters: utility and esthetic seekers, hedonic seekers, utility seekers, ostentation seekers. In order to further specify the profile of four clusters, each cluster was cross tabulated with socio-demographics and characteristics of coffee shop visits. Four clusters are significantly different from each other by four types of motivations for coffee shop visits. Conclusions - This study has empirically examined the difference in demographic profile of visitors and characteristics of coffee shop visits by motivation to visit coffee shops. There are significant differences according to age, education background, marital status, occupation and monthly income. In addition, coffee shops use pattern characterization in frequency of visits to coffee shops, relationships with companion, purpose of visit, information sources, brand type, average expense per visit, important elements of selection attribute were significantly different depending on motivations for coffee shop visits.

Classification of Bodytype on Adult Male for the Apparel Sizing System (Part 3) -Bodytype of Trunk from the Photoqraphic Data- (남성복의 치수규격을 위한 체형분류(제3보) -사진자료에 의한 동체부의 분류-)

  • 김구자
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.19 no.6
    • /
    • pp.924-932
    • /
    • 1995
  • Concept of the comfort and fitness has become a major concern in the basic function of the ready.made clothes. Until now ready-made clothes were not made by on the basis of the bodytype, but by the body size only This research was performed to classify and characterize the bodytypes of Korean adult males. Sample size was 1290 subjects and their age range was from 19 to 54 years old. 25 variables from the photographic data were applied to analyze the bodytype of trunk. Data were analyzed by the multivariate method, especially factor and cluster analysis. The groups forming a cluster can be subdivided into 5 sets by crosstabulation extracted by the hierarchical cluster analysis. 5 bodytypes classified by the photographic sources could be combined with the anthropometric data and were demonstrated with 5 silhouette. Type 3 and 4 in trunk were dominant and were composed of the majority of 55.6% of the subjects. Bodytypes of Korean males were influenced by the degree of posture erectness and of curvature of the front side of the body in waist and abdomen.

  • PDF