• 제목/요약/키워드: k-Means Clustering

검색결과 1,119건 처리시간 0.037초

Assessment through Statistical Methods of Water Quality Parameters(WQPs) in the Han River in Korea

  • Kim, Jae Hyoun
    • 한국환경보건학회지
    • /
    • 제41권2호
    • /
    • pp.90-101
    • /
    • 2015
  • Objective: This study was conducted to develop a chemical oxygen demand (COD) regression model using water quality monitoring data (January, 2014) obtained from the Han River auto-monitoring stations. Methods: Surface water quality data at 198 sampling stations along the six major areas were assembled and analyzed to determine the spatial distribution and clustering of monitoring stations based on 18 WQPs and regression modeling using selected parameters. Statistical techniques, including combined genetic algorithm-multiple linear regression (GA-MLR), cluster analysis (CA) and principal component analysis (PCA) were used to build a COD model using water quality data. Results: A best GA-MLR model facilitated computing the WQPs for a 5-descriptor COD model with satisfactory statistical results ($r^2=92.64$,$Q{^2}_{LOO}=91.45$,$Q{^2}_{Ext}=88.17$). This approach includes variable selection of the WQPs in order to find the most important factors affecting water quality. Additionally, ordination techniques like PCA and CA were used to classify monitoring stations. The biplot based on the first two principal components (PCs) of the PCA model identified three distinct groups of stations, but also differs with respect to the correlation with WQPs, which enables better interpretation of the water quality characteristics at particular stations as of January 2014. Conclusion: This data analysis procedure appears to provide an efficient means of modelling water quality by interpreting and defining its most essential variables, such as TOC and BOD. The water parameters selected in a COD model as most important in contributing to environmental health and water pollution can be utilized for the application of water quality management strategies. At present, the river is under threat of anthropogenic disturbances during festival periods, especially at upstream areas.

L 및 LH-모멘트법과 지역빈도분석에 의한 가뭄우량의 추정(I) - L-모멘트법을 중심으로 - (Estimation of Drought Rainfall by Regional Frequency Analysis using L and LH-Moments(I) - On the Method of L-Moments -)

  • 이순혁;윤성수;맹승진;류경식;주호길
    • 한국농공학회지
    • /
    • 제45권5호
    • /
    • pp.97-109
    • /
    • 2003
  • This study is mainly conducted to derive the design drought rainfall by the consecutive duration using probability weighted moments with rainfall in the regional drought frequency analysis. It is anticipated to suggest optimal design drought rainfall of hydraulic structures for the water requirement and drought frequency of occurrence for the safety of water utilization through this study. Preferentially, this study was conducted to derive the optimal regionalization of the precipitation data that can be classified by the climatologically and geographically homogeneous regions all over the regions except Cheju and Ulreung islands in Korea. Five homogeneous regions in view of topographical and climatological aspects were accomplished by K-means clustering method. Using the L-moment ratio diagram and Kolmogorov-Smirnov test, generalized extreme value distribution was confirmed as the best fitting one among applied distributions. At-site and regional parameters of the generalized extreme value distribution were estimated by the method of L-moments. Design drought rainfalls using L-moments following the consecutive duration were derived by the at-site and regional analysis using the observed and simulated data resulted from Monte Carlo techniques. Relative root-mean-square error (RRMSE), relative bias (RBIAS) and relative reduction (RR) in RRMSE for the design drought rainfall derived by at-site and regional analysis in the observed an simulated data were computed and compared. In has shown that the regional frequency analysis procedure can substantially more reduce the RRMSE. RBIAS and RR in RRMSE than those of at-site analysis in the prediction of design drought rainfall. Consequently, optimal design drought rainfalls following the regions and consecutive durations were derived by the regional frequency analysis.

한국 성인 남성의 피부색 분류와 선호색에 대한 연구 (Clustering of Skin Colors on Korean Adult Males and Their Preference Colors)

  • 김구자
    • 한국의류학회지
    • /
    • 제27권11호
    • /
    • pp.1338-1349
    • /
    • 2003
  • The color of apparels has the close interdependency on the skin colors of the wearers. This study was carried out to group the skin colors of Korean males into several similar skin colors and to analyze their preference colors. The skin colors were measured quantitatively and classified into several clusters that has similar hue, value and chroma with Munsell color system that is internationally used to communicate the colors. Sample size was 420 Korean males. With color spectrometer, JX-777, 4 points of the body were measured. All subjects had been shown with 40 color chips and answered their preference colors. Data were analysed by K-means Cluster analysis, Duncan test, Frequency and Chi square test using SPSS WIN 10 statistical package. Findings were as follows: 1. The skin colors of Korean males were mixed with skin colors of YR, R, and Y. 2. 420 subjects who have YR color were clustered in 3 kinds of skin color groups. 3. The average face color of total subjects was 4.81YR 5.91/4.97 in Munsell color system, 60.74 in L value, 13.71 in a value, 24.54 in b value. 136 observations out of 420 subjects were composed of Type 1: 4.50YR 6.35/4.87 and 192 observations were composed of Type 2: 4.62YR 5.86/5.12 and 92 observations were composed of Type 3: 5.67YR 5.37/4.79. 4. The average skin color of total 420 subjects was 6.26YR 6.07/4.41 and 62.33 in L value, 10.64 in a value, 23.48 in b value. The average skin color of Type 1 was 6.27YR 6.44/4.27 and of Type 2 was 6.15YR 5.91/4.49 and of Type 3 was 6.49YR 5.84/4.43 respectively. 5. 3 groups showed that the most preference color of sport$.$casual was 2.5Y 8/16 and 7.5PB 4/16 and the most preference color to their skins was 7.5PB 4/16 and 7.5YR 7/16.

다목적 표본조사를 위한 다변량 층화 : 어업비계통생산량조사를 위한 표본설계 사례 (Multivariate Stratification Method for the Multipurpose Sample Survey : A Case Study of the Sample Design for Fisher Production Survey)

  • 박진우;김영원;이석훈;신지은
    • 한국조사연구학회지:조사연구
    • /
    • 제9권1호
    • /
    • pp.69-85
    • /
    • 2008
  • 층화는 표본설계 단계에서 예비정보를 활용하는 대표적인 방법으로 대부분의 전국 단위의 표본설계에서 널리 활용된다. 층화의 효율을 극대화시키기 위해서는 조사목적에 부합되는 적절한 층화변수를 선택하는 것이 매우 중요하다. 하나의 표본을 통해 여러 개의 관심변수를 동시에 조사하는 다목적조사에서 다변량 층화변수가 있을 때 층화 전략을 세우는 것은 매우 복잡한 양상을 띤다. 본 연구에서는 관심변수의 수가 매우 많은 다목적조사를 위한 층화전략을 다룬다. 층화를 위해 구체적으로 사용하는 통계적 도구는 요인분석과 군집분석 등의 다변량 통계기법인데, 먼저 요인분석을 통해 적절한 층화변수들을 선정한 후 그 변수들을 이용하여 군집분석을 통해 층화를 하는 전략을 소개한다. 본 연구에서는 구체적으로 해양수산부의 어업비계통생산량조사를 위한 표본설계에서의 층화과정을 다룬다.

  • PDF

주변 확률을 고려하지 않는 확률적 흥미도 측도 계열 유사성 측도의 서열화 (A study on the ordering of PIM family similarity measures without marginal probability)

  • 박희창
    • Journal of the Korean Data and Information Science Society
    • /
    • 제26권2호
    • /
    • pp.367-376
    • /
    • 2015
  • 데이터마이닝 기법 중의 하나인 군집분석은 다양한 특성을 지닌 관찰대상에 대해 유사성을 바탕으로 동질적인 군집으로 묶은 후, 동일 군집에 속해 있는 공통된 특성을 조사하는데 이용되는 기법이다. 본 논문에서는 주변 확률을 고려하지 않는 확률적 흥미도 측도 기반 유사성 측도인 Yule I과 II, Michael, Digby, Baulieu, 그리고 Dispersion 측도에 대해 상한 및 하한을 설정함으로써 이들의 대소관계를 규명하였다. 그 결과, 세 가지 유형의 대소 관계가 성립한다는 사실을 수식의 증명뿐만 아니라 실제 데이터 및 모의실험 데이터에 의해서도 확인할 수 있었다. 이들 측도들은 각 경계에 있는 측도와는 더욱 더 유사한 값을 가지므로 각 측도의 상한 및 하한은 여러 가지 측도들을 분류하는 도구가 되며, 실제 값의 관점에서 각 측도들의 관계를 알게 되면 주어진 알고리즘의 안정화에 도움이 될 수 있을 것이다.

비디오 감시시스템을 위한 영역 기반의 움직이는 물체 분할 (Region-Based Moving Object Segmentation for Video Monitoring System)

  • 이경미;김종배;이창우;김항준
    • 전자공학회논문지CI
    • /
    • 제40권1호
    • /
    • pp.30-38
    • /
    • 2003
  • 본 논문은 비디오 영상에서 움직이는 물체를 분할하는 방법을 제안한다. 물체들의 크기가 작거나 서로 겹쳐있을 경우(occlusion), 또는 잡음이 많은 경우에도 안정적인 이 방법은 움직임 검출(motion detection)과 움직임 분할(motion segmentation) 두 단계로 구성되어 있다. 움직임 검출을 하기 위하여 인접 영상간의 차영상(difference image) 분석을 통해 움직임이 있는 부분을 추출하며, 이때 적응적 임계치 방법을 이용하여 빛의 변화나 노이즈가 포함된 환경에서도 안정적으로 추출한다. 움직임 분할 단계에서는 움직임이 검출된 부분을 초기영역으로 분할 한 뒤, 이 영역들의 모션정보에 따라 이웃 한 영역들을 병합함으로써 독립적으로 움직이는 물체를 분할한다. 이러한 방법은 검출된 영역에 대해서만 움직임 분할을 함으로 많은 계산효과를 얻을 수 있으며 실제 도로영상에서 제안된 방법을 실험해본 결과 비디오 감시시스템에 적합함을 알 수 있었다.

물류기업의 업종과 사업특성이 경영성과에 미치는 영향에 관한 연구 -일본 물류기업을 대상으로- (A Study on the Effects of Industry Types and Business Characteristics on Management Performance: For Japanese Logistics Companies)

  • 구경모
    • 한국항만경제학회지
    • /
    • 제34권2호
    • /
    • pp.51-68
    • /
    • 2018
  • 본 연구는 물류시장의 업종 간의 경영성과 차이를 비교하는 동시에 물류기업의 사업특성 차이를 분석하였다. 나아가 업종과 사업특성의 차이가 경영성과에 미치는 영향을 검정하였다. 분석 방법으로는 분산분석, 군집분석을 이용하였고, 연구의 시사점은 다음과 같다. 첫째. 일본의 물류시장은 업종 간 경영성과 차이가 나타났고, 창고서비스업이 타 업종에 비해 수익성과 안정성이 높았다. 둘째, 업종별 사업행동의 차이가 나타났고, 타 업종에 비해 해상화물운송업은 자본집약도가 높았고, 창고서비스업은 사업선도력과 신용거래도가 높았다. 마지막으로 업종과 사업특성 군집의 두 요인은 경영성과에 미치는 상호작용이 유의미하게 검정되었는데, 수익성에 있어 화물운송업과 창고서비스업의 사업특성의 작용이 달랐다. 반면, 안정성에 있어 전 업종은 공통적으로 자본집약도를 낮추고 사업선도력을 재고하는 사업특성이 유효하게 작용하였다.

Automated Training from Landsat Image for Classification of SPOT-5 and QuickBird Images

  • Kim, Yong-Min;Kim, Yong-Il;Park, Wan-Yong;Eo, Yang-Dam
    • 대한원격탐사학회지
    • /
    • 제26권3호
    • /
    • pp.317-324
    • /
    • 2010
  • In recent years, many automatic classification approaches have been employed. An automatic classification method can be effective, time-saving and can produce objective results due to the exclusion of operator intervention. This paper proposes a classification method based on automated training for high resolution multispectral images using ancillary data. Generally, it is problematic to automatically classify high resolution images using ancillary data, because of the scale difference between the high resolution image and the ancillary data. In order to overcome this problem, the proposed method utilizes the classification results of a Landsat image as a medium for automatic classification. For the classification of a Landsat image, a maximum likelihood classification is applied to the image, and the attributes of ancillary data are entered as the training data. In the case of a high resolution image, a K-means clustering algorithm, an unsupervised classification, was conducted and the result was compared to the classification results of the Landsat image. Subsequently, the training data of the high resolution image was automatically extracted using regular rules based on a RELATIONAL matrix that shows the relation between the two results. Finally, a high resolution image was classified and updated using the extracted training data. The proposed method was applied to QuickBird and SPOT-5 images of non-accessible areas. The result showed good performance in accuracy assessments. Therefore, we expect that the method can be effectively used to automatically construct thematic maps for non-accessible areas and update areas that do not have any attributes in geographic information system.

Delineation of Rice Productivity Projected via Integration of a Crop Model with Geostationary Satellite Imagery in North Korea

  • Ng, Chi Tim;Ko, Jonghan;Yeom, Jong-min;Jeong, Seungtaek;Jeong, Gwanyong;Choi, Myungin
    • 대한원격탐사학회지
    • /
    • 제35권1호
    • /
    • pp.57-81
    • /
    • 2019
  • Satellite images can be integrated into a crop model to strengthen the advantages of each technique for crop monitoring and to compensate for weaknesses of each other, which can be systematically applied for monitoring inaccessible croplands. The objective of this study was to outline the productivity of paddy rice based on simulation of the yield of all paddy fields in North Korea, using a grid crop model combined with optical satellite imagery. The grid GRAMI-rice model was used to simulate paddy rice yields for inaccessible North Korea based on the bidirectional reflectance distribution function-adjusted vegetation indices (VIs) and the solar insolation. VIs and solar insolation for the model simulation were obtained from the Geostationary Ocean Color Imager (GOCI) and the Meteorological Imager (MI) sensors of the Communication Ocean and Meteorological Satellite (COMS). Reanalysis data of air temperature were achieved from the Korea Local Analysis and Prediction System (KLAPS). Study results showed that the yields of paddy rice were reproduced with a statistically significant range of accuracy. The regional characteristics of crops for all of the sites in North Korea were successfully defined into four clusters through a spatial analysis using the K-means clustering approach. The current study has demonstrated the potential effectiveness of characterization of crop productivity based on incorporation of a crop model with satellite images, which is a proven consistent technique for monitoring of crop productivity in inaccessible regions.

유아기 자녀를 둔 결혼이주여성의 양육행위 유형별 모성이데올로기 및 양육스트레스 (Motherhood Ideology and Parenting Stress according to Parenting Behavior Patterns of Married Immigrant Women with Young Children)

  • 문소현;김미옥;나현
    • 대한간호학회지
    • /
    • 제49권4호
    • /
    • pp.449-460
    • /
    • 2019
  • Purpose: This study aims to provide base data for designing education and counseling programs for child-raising by identifying the types, characteristics and predictors of parenting behaviors of married immigrant women. Methods: We used a self-report questionnaire to survey 126 immigrant mothers of young children, who agreed to participate, and who could speak Korean, Vietnamese, Chinese, Filipino, or English, at two children's hospitals and two multicultural support centers. Statistical analysis was conducted using descriptive analysis, K-means clustering, ${\chi}^2$ test, Fisher's exact test, one-way ANOVA, $Sch{\acute{e}}ffe^{\prime}s$ test, and multinominal logistic regression. Results: We identified three clusters of parenting behaviors: 'affectionate acceptance group' (38.9%), 'active engaging group' (26.2%), and 'passive parenting group' (34.9%). Passive parenting and affectionate acceptance groups were distinguished by the conversation time between couples (p=.028, OR=5.52), ideology of motherhood (p=.032, OR=4.33), and parenting stress between parent and child (p=.049, OR=0.22). Passive parenting was distinguished from active engaging group by support from spouses for participating in multicultural support centers or relevant programs (p=.011, OR=2.37), and ideology of motherhood (p=.001, OR=16.65). Ideology of motherhood was also the distinguishing factor between affectionate acceptance and active engaging groups (p=.041, OR=3.85). Conclusion: Since immigrant women's parenting type depends on their ideology of motherhood, parenting stress, and spousal relationships in terms of communication and support to help their child-raising and socio-cultural adaptation, it is necessary to provide them with systematic education and support, as well as interventions across personal, family, and community levels.