• Title/Summary/Keyword: 결정군집

Search Result 507, Processing Time 0.032 seconds

Fast VQ Codebook Design by Sucessively Bisectioning of Principle Axis (주축의 연속적 분할을 통한 고속 벡터 양자화 코드북 설계)

  • Kang, Dae-Seong;Seo, Seok-Bae;Kim, Dai-Jin
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.4
    • /
    • pp.422-431
    • /
    • 2000
  • This paper proposes a new codebook generation method, called a PCA-Based VQ, that incorporates the PCA (Principal Component Analysis) technique into VQ (Vector Quantization) codebook design. The PCA technique reduces the data dimensions by transforming input image vectors into the feature vectors. The cluster of feature vectors in the transformed domain is bisectioned into two subclusters by an optimally chosen partitioning hyperplane. We expedite the searching of the optimal partitioning hyperplane that is the most time consuming process by considering that (1) the optimal partitioning hyperplane is perpendicular to the first principal axis of the feature vectors, (2) it is located on the equilibrium point of the left and right cluster's distortions, and (3) the left and right cluster's distortions can be adjusted incrementally. This principal axis bisectioning is successively performed on the cluster whose difference of distortion between before and after bisection is the maximum among the existing clusters until the total distortion of clusters becomes as small as the desired level. Simulation results show that the proposed PCA-based VQ method is promising because its reconstruction performance is as good as that of the SOFM (Self-Organizing Feature Maps) method and its codebook generation is as fast as that of the K-means method.

  • PDF

Data Mining Analysis of Educational and Research Achievements of Korean Universities Using Public Open Data Services (정보공시 자료를 이용한 교육/연구성과 영향요인 추출 및 대학의 군집 분석)

  • Shin, Sun Mi;Kim, Hyeon Cheol
    • The Journal of Korean Association of Computer Education
    • /
    • v.17 no.1
    • /
    • pp.117-130
    • /
    • 2014
  • The purpose of this study is to provide useful knowledge for improving indicators that represent competitiveness and educational competency of the university by deriving a new pattern or the meaningful results from the data of information disclosure of universities using statistical analysis and data mining techniques. To achieve this, a model of decision tree was made and various factors that affect education/research performance such as employment rate, the number of technology transfer and papers per full-time faculty were explored. In addition to this, the cluster analysis of universities was conducted using attributes related to evaluation of university. According to the analysis, common factors affecting higher education/research performance are following indicators ; incoming student recruitment rate, enrollment rate, and the number of students per full-time faculty. In the cluster analysis, when performed by the entire university, the size, location of the university respectively, clusters are mainly formed by well-known universities, art physical non-science and engineering religious leaders training universities, and others. The main influencing factors of this cluster are higher education/research performance indicators such as employment rate and the number of technology transfer.

  • PDF

Speech Synthesis using Diphone Clustering and Improved Spectral Smoothing (다이폰 군집화와 개선된 스펙트럼 완만화에 의한 음성합성)

  • Jang, Hyo-Jong;Kim, Kwan-Jung;Kim, Gye-Young;Choi, Hyung-Il
    • The KIPS Transactions:PartB
    • /
    • v.10B no.6
    • /
    • pp.665-672
    • /
    • 2003
  • This paper describes a speech synthesis technique by concatenating unit phoneme. At that time, a major problem is that discontinuity is happened from connection part between unit phonemes, especially from connection part between unit phonemes recorded by different persons. To solve the problem, this paper uses clustered diphone, and proposes a spectral smoothing technique, not only using formant trajectory and distribution characteristic of spectrum but also reflecting human's acoustic characteristic. That is, the proposed technique performs unit phoneme clustering using distribution characteristic of spectrum at connection part between unit phonemes and decides a quantity and a scope for the smoothing by considering human's acoustic characteristic at the connection part of unit phonemes, and then performs the spectral smoothing using weights calculated along a time axes at the border of two diphones. The proposed technique removes the discontinuity and minimizes the distortion which can be occurred by spectrum smoothing. For the purpose of the performance evaluation, we test on five hundred diphones which are extracted from twenty sentences recorded by five persons, and show the experimental results.

Regional and Temporal Characteristics of Aquatic Organism Communities in Rice Paddy Fields, using Submerged Funnel Trap (수중트랩으로 채집된 논 수서생물 군집의 지역 및 시기별 특성)

  • Yoon, Sung-Soo;Kim, Myung-Hyun;Eo, Jinu;Kwon, Soon-Ik;Nam, Hyung-Kyu;Song, Young-Ju
    • Korean Journal of Environmental Biology
    • /
    • v.36 no.2
    • /
    • pp.99-106
    • /
    • 2018
  • Extensive monitoring of aquatic organisms in rice paddy fields has been difficult due to laborious sampling methods such as quadrat sampling using a hand net. This study aimed to analyze temporal and regional community compositions of aquatic organisms collected with a less time-consuming sampling method. This method involved using submerged funnel traps in rice paddy fields. Submerged funnel traps were useful for capturing taxa containing species that are indicative of environmental changes and highly mobile species that feed on waterbirds. Fifteen taxa including Ampullariidae, Cobitidae, Chironomidae, Hydrophilidae and Dytiscidae determined the community compositions. Among the major taxa, only Chironomidae resistant to environmental disturbances represented temporal variations of aquatic organism communities in rice paddy fields. Ampullariidae, Dytiscidae, and Hydrophilidae, which are prone to be affected by anthropogenic activities, differed among regions.

Comparison of the Similarity Among the Plant Communities of the Grazing Pasture by the Cluster-Analysis (군집분석을 이용한 방목초지 식물군락의 유사성 비교)

  • Park, Geun-Je;Spatz, G.
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.24 no.4
    • /
    • pp.293-300
    • /
    • 2004
  • This study was carried out to investigate the ecological behaviour forage value and similarity among the plant communities of the grazing pasture near Witzenhausen in middle part of Germany. Sixteen plant communities of the different grazing pasture were mostly the Molinio-Arrhenatheretea and Festuco-Brometea, and those were named the class of plant sociological nomenclature. The ecological behaviour and forage value of the communities except mesobromion(half dry grassland community) were relatively good for forage production. The correlation coefficient between class No. 14 and 12 of plant communities was highest, and the similarity among the communities were greatly affected by botanical composition. The resemblance measure of the cluster-analysis by complete-linkage-method for the similarity among plant communities was better the euclidean distance than those of others. The clustering analysis showed that the communities of relatively similar botanical composition were closely grouped.

Estimation of urban drinking water consumption patterns based on smart water grid monitoring data by k-means clustering in Vietnam (k-means 군집화 기법을 이용한 베트남 스마트워터그리드 계측 데이터 기반 도시 물 사용 패턴 추정)

  • Koo, Kang Min;Han, Kuk Heon;Lee, Gyumin;Jun, Kyung Soo;Yum, Kyung Taek
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.419-419
    • /
    • 2021
  • 수자원 관리 패러다임은 공급 위주에서 수요관리로 전환되고 있다. 가용한 수자원은 한정적이나 급속한 인구증가와 도시화로 인한 물 수요의 증가로 수요관리의 효율성이 중시되고 있기 때문이다. 기존 상수도시스템은 노후화로 가동효율이 점차 낮아지고 있으며, 인력으로 월 또는 격월로 소비자의 물 사용량을 검침해 실시간 관리가 불가능하여 수요와 공급의 불균형을 초래한다. 이러한 문제를 해결할 대안으로 IT 기술과 전통적인 물관리 기술을 접목한 Smart Water Grid는 양방향 통신장치를 이용해 실시간으로 소비자의 물 사용량을 모니터링한다. 물 사용 특성을 잘 파악하면 보다 정확한 물 수요 예측이 가능하다. 특히 소비자들의 시간별, 평일, 주말, 그리고 주별 물 사용 특성을 파악하면 미래 물 수요 예측에 도움이 된다. 예측된 물 수요량에 따라 물 공급 배분 계획을 수립하여 운영 효율성을 높일 수 있다. 물 수요예측 방법 중 k-mean 군집분석은 시간별 물 사용량을 이용해 서로 유사한 여러 개의 부분집합으로 할당하여 분류하는 Machine learing 방법으로 물 사용의 유사성을 파악할 수 있다. SWG 연구단은 2019년 Vietnam Hai Duong province에 SWG Pilot plant를 구축하고 27개의 Smart water meter를 설치하여 운영하고 있다. 이에 본 연구에서는 소비자의 물 사용 특성을 분석하기 위해 27개 SWM로부터 수신된 2019년 11월 14일부터 2020년 12월 3일까지 1시간 단위의 물 사용량 데이터를 수집하였다. 그리고 k-mean 군집 방법을 이용해 시간별, 평일, 주말, 그리고 주별 물 사용 특성을 분석하였다. 이 때 최적의 군집 개수 결정을 위해 Elbow 방법을 적용하였다. 분석 결과 각 소비자의 물 사용량 특성에 따라 평균 물 수요패턴 추정이 가능하며, 향후 물 수요 예측에 도움이 될 것으로 사료된다.

  • PDF

The Difference of Growth Environment and High School Students' Career Decision Making (고등학생의 성장환경에 따른 진로의사결정의 유형과 자기효능감의 차이)

  • Kim, Jin-Hee;Paik, Sun-Ah
    • Journal of Korean Home Economics Education Association
    • /
    • v.25 no.1
    • /
    • pp.1-14
    • /
    • 2013
  • The purpose of the study was investigated on that whether there was any statistical difference or not in terms of the career decision making according to the growth environment of High school students. The researcher surveyed 600 questionnaires for high school students and was using 539 questionnaires in the final analysis. They were analyzed by cluster analyses according to the growth environment scale and it classified into two cluster groups. The cluster group A and B had a significant difference on scores of the growth environment scale. The group A had more interested in activities such as political, social, intellectual, and cultural ones and participated at social and leisure activities. Moreover, the group was emphasis on moral and religious values. The group B got the higher score than group A about the score of the rational type of the career decision making: the group A got the higher score than the group B to the score of the dependent type. On the matter of the job information collection sub-scale, the group A got the higher score than the group B: the group A scored higher points than group B about the goal establishment and the self-estimate sub-scales. Therefore, in order to do the career-guidance of youths, each family had to provide positive growth environment which required various stimuli and affluent in experience settings for them to mature.

  • PDF

Clustering and classification of residential noise sources in apartment buildings based on machine learning using spectral and temporal characteristics (주파수 및 시간 특성을 활용한 머신러닝 기반 공동주택 주거소음의 군집화 및 분류)

  • Jeong-hun Kim;Song-mi Lee;Su-hong Kim;Eun-sung Song;Jong-kwan Ryu
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.6
    • /
    • pp.603-616
    • /
    • 2023
  • In this study, machine learning-based clustering and classification of residential noise in apartment buildings was conducted using frequency and temporal characteristics. First, a residential noise source dataset was constructed . The residential noise source dataset was consisted of floor impact, airborne, plumbing and equipment noise, environmental, and construction noise. The clustering of residential noise was performed by K-Means clustering method. For frequency characteristics, Leq and Lmax values were derived for 1/1 and 1/3 octave band for each sound source. For temporal characteristics, Leq values were derived at every 6 ms through sound pressure level analysis for 5 s. The number of k in K-Means clustering method was determined through the silhouette coefficient and elbow method. The clustering of residential noise source by frequency characteristic resulted in three clusters for both Leq and Lmax analysis. Temporal characteristic clustered residential noise source into 9 clusters for Leq and 11 clusters for Lmax. Clustering by frequency characteristic clustered according to the proportion of low frequency band. Then, to utilize the clustering results, the residential noise source was classified using three kinds of machine learning. The results of the residential noise classification showed the highest accuracy and f1-score for data labeled with Leq values in 1/3 octave bands, and the highest accuracy and f1-score for classifying residential noise sources with an Artificial Neural Network (ANN) model using both frequency and temporal features, with 93 % accuracy and 92 % f1-score.

Community structure of sessile organisms on PVC plates according to different submerged timings and durations in Jangmok Bay, Korea (남해 장목만에서 PVC판 투입시기와 투입기간에 따른 부착생물군집 구조)

  • Park, So-Hyun;Seo, Jin-Young;Choi, Jin-Woo
    • The Korean Journal of Malacology
    • /
    • v.27 no.2
    • /
    • pp.99-105
    • /
    • 2011
  • This study was conducted to compare the species composition of sessile organisms on the artificial substrates of PVC submerged at different time intervals and duration in Jangmok Bay, Geoje Island, southern coast of Korea. Three PVC plates were submerged at one month interval from March to October and retrieved in November, 2007. A mussel, Mytilus galloprovincialis exclusively occupied the artificial substrates submerged from March to April and occurred as a dominant species to July. An ascidian, Styela plicata occurred as a dominant sessile species from May to August. Balanus amphtrite, Bugula sp., and hydrozoans occurred as dominant species on the plates submerged from July to September. There was a mis-match between the peak time of settlement and dominance of sessile organisms due to the interspecies competitions when the PVC plates were retrieved in November. There was no clear relationship between submerged duration and the abundance of sessile organisms due to the different settlement period. M. galloprovincialis seemed to be a strong competitor which could exclude the previous recruiters of macroalgae by overgrowth and occupy the substrate surface and maintain its high population density by preventing the settlement of other species until late autumn. These results suggested that the composition of sessile organisms in vacant hard substrates could be determined by the combined effects of supply-timing of larvae and post-settlement competitions.

Selection of Optimal Variables for Clustering of Seoul using Genetic Algorithm (유전자 알고리즘을 이용한 서울시 군집화 최적 변수 선정)

  • Kim, Hyung Jin;Jung, Jae Hoon;Lee, Jung Bin;Kim, Sang Min;Heo, Joon
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.22 no.4
    • /
    • pp.175-181
    • /
    • 2014
  • Korean government proposed a new initiative 'government 3.0' with which the administration will open its dataset to the public before requests. City of Seoul is the front runner in disclosure of government data. If we know what kind of attributes are governing factors for any given segmentation, these outcomes can be applied to real world problems of marketing and business strategy, and administrative decision makings. However, with respect to city of Seoul, selection of optimal variables from the open dataset up to several thousands of attributes would require a humongous amount of computation time because it might require a combinatorial optimization while maximizing dissimilarity measures between clusters. In this study, we acquired 718 attribute dataset from Statistics Korea and conducted an analysis to select the most suitable variables, which differentiate Gangnam from other districts, using the Genetic algorithm and Dunn's index. Also, we utilized the Microsoft Azure cloud computing system to speed up the process time. As the result, the optimal 28 variables were finally selected, and the validation result showed that those 28 variables effectively group the Gangnam from other districts using the Ward's minimum variance and K-means algorithm.