• Title/Summary/Keyword: k-Means clustering

Search Result 1,107, Processing Time 0.024 seconds

Clustering of Skin Colors on Korean Adult Males and Their Preference Colors (한국 성인 남성의 피부색 분류와 선호색에 대한 연구)

  • 김구자
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.27 no.11
    • /
    • pp.1338-1349
    • /
    • 2003
  • The color of apparels has the close interdependency on the skin colors of the wearers. This study was carried out to group the skin colors of Korean males into several similar skin colors and to analyze their preference colors. The skin colors were measured quantitatively and classified into several clusters that has similar hue, value and chroma with Munsell color system that is internationally used to communicate the colors. Sample size was 420 Korean males. With color spectrometer, JX-777, 4 points of the body were measured. All subjects had been shown with 40 color chips and answered their preference colors. Data were analysed by K-means Cluster analysis, Duncan test, Frequency and Chi square test using SPSS WIN 10 statistical package. Findings were as follows: 1. The skin colors of Korean males were mixed with skin colors of YR, R, and Y. 2. 420 subjects who have YR color were clustered in 3 kinds of skin color groups. 3. The average face color of total subjects was 4.81YR 5.91/4.97 in Munsell color system, 60.74 in L value, 13.71 in a value, 24.54 in b value. 136 observations out of 420 subjects were composed of Type 1: 4.50YR 6.35/4.87 and 192 observations were composed of Type 2: 4.62YR 5.86/5.12 and 92 observations were composed of Type 3: 5.67YR 5.37/4.79. 4. The average skin color of total 420 subjects was 6.26YR 6.07/4.41 and 62.33 in L value, 10.64 in a value, 23.48 in b value. The average skin color of Type 1 was 6.27YR 6.44/4.27 and of Type 2 was 6.15YR 5.91/4.49 and of Type 3 was 6.49YR 5.84/4.43 respectively. 5. 3 groups showed that the most preference color of sport$.$casual was 2.5Y 8/16 and 7.5PB 4/16 and the most preference color to their skins was 7.5PB 4/16 and 7.5YR 7/16.

Multivariate Stratification Method for the Multipurpose Sample Survey : A Case Study of the Sample Design for Fisher Production Survey (다목적 표본조사를 위한 다변량 층화 : 어업비계통생산량조사를 위한 표본설계 사례)

  • Park, Jin-Woo;Kim, Young-Won;Lee, Seok-Hoon;Shin, Ji-Eun
    • Survey Research
    • /
    • v.9 no.1
    • /
    • pp.69-85
    • /
    • 2008
  • Stratification is a feature of the majority of field sample design. This paper considers the multivariate stratification strategy for multipurpose sample survey with several auxiliary variables. In a multipurpose survey, stratification procedure is very complicated because we have to simultaneously consider the efficiencies of stratification for several variables of interest. We propose stratification strategy based on factor analysis and cluster analysis using several stratification variables. To improve the efficiency of stratification, we first select the stratification variables by factor analysis, and then apply the K-means clustering algorithm to the formation of strata. An application of the stratification strategy in the sampling design for the Fisher Production Survey is discussed, and it turns out that the variances of estimators are significantly less than those obtained by simple random sampling.

  • PDF

A study on the ordering of PIM family similarity measures without marginal probability (주변 확률을 고려하지 않는 확률적 흥미도 측도 계열 유사성 측도의 서열화)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.367-376
    • /
    • 2015
  • Today, big data has become a hot keyword in that big data may be defined as collection of data sets so huge and complex that it becomes difficult to process by traditional methods. Clustering method is to identify the information in a big database by assigning a set of objects into the clusters so that the objects in the same cluster are more similar to each other clusters. The similarity measures being used in the cluster analysis may be classified into various types depending on the nature of the data. In this paper, we computed upper and lower limits for probability interestingness measure based similarity measures without marginal probability such as Yule I and II, Michael, Digby, Baulieu, and Dispersion measure. And we compared these measures by real data and simulated experiment. By Warrens (2008), Coefficients with the same quantities in the numerator and denominator, that are bounded, and are close to each other in the ordering, are likely to be more similar. Thus, results on bounds provide means of classifying various measures. Also, knowing which coefficients are similar provides insight into the stability of a given algorithm.

Region-Based Moving Object Segmentation for Video Monitoring System (비디오 감시시스템을 위한 영역 기반의 움직이는 물체 분할)

  • 이경미;김종배;이창우;김항준
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.40 no.1
    • /
    • pp.30-38
    • /
    • 2003
  • This paper presents an efficient region-based motion segmentation method for segmenting of moving objects in a traffic scene with a focus on a Video Monitoring System (VMS). The presented method consists of two phases: motion detection and motion segmentation. Using the adaptive thresholding technique, the differences between two consecutive frames are analyzed to detect the movements of objects in a scene. To segment the detected regions into meaningful objects which have the similar intensity and motion information, the regions are initially segmented using a k-means clustering algorithm and then, the neighboring regions with the similar motion information are merged. Since we deal with not the whole image, but the detected regions in the segmentation phase, the computational cost is reduced dramatically. Experimental results demonstrate robustness in the occlusions among multiple moving objects and the change in environmental conditions as well.

A Study on the Effects of Industry Types and Business Characteristics on Management Performance: For Japanese Logistics Companies (물류기업의 업종과 사업특성이 경영성과에 미치는 영향에 관한 연구 -일본 물류기업을 대상으로-)

  • Koo, Kyoung-Mo
    • Journal of Korea Port Economic Association
    • /
    • v.34 no.2
    • /
    • pp.51-68
    • /
    • 2018
  • This paper compares the differences in management performance in the logistics market and analyzes the differences in business characteristics depending on the industry types. In addition, the effects of industry types and business characteristics on management performance are examined. The analysis method used is ANOVA and K-means clustering. The implication of the study are as follows. First, in the logistics market in Japan, there was a difference in management performance among the types of industry. The warehousing service type had the highest profitability and stability among all the industry types. Second, differences in business characteristics by industry types were tested. It was found that offshore cargo transportation type was more capital intensive than the other types. In addition, warehousing service type had higher business leadership and credit transaction than others. Third, industry types and clusters based on business characteristics had a significant impact on management performance through interaction effects. For the profitability variables in detail, other clusters had a significant effect between transportation types(onshore and offshore cargo) and warehousing service type. On the other hand, in stability variables, one cluster was effective in all types, which is a characteristic that lowers both capital intensity and business leadership.

Automated Training from Landsat Image for Classification of SPOT-5 and QuickBird Images

  • Kim, Yong-Min;Kim, Yong-Il;Park, Wan-Yong;Eo, Yang-Dam
    • Korean Journal of Remote Sensing
    • /
    • v.26 no.3
    • /
    • pp.317-324
    • /
    • 2010
  • In recent years, many automatic classification approaches have been employed. An automatic classification method can be effective, time-saving and can produce objective results due to the exclusion of operator intervention. This paper proposes a classification method based on automated training for high resolution multispectral images using ancillary data. Generally, it is problematic to automatically classify high resolution images using ancillary data, because of the scale difference between the high resolution image and the ancillary data. In order to overcome this problem, the proposed method utilizes the classification results of a Landsat image as a medium for automatic classification. For the classification of a Landsat image, a maximum likelihood classification is applied to the image, and the attributes of ancillary data are entered as the training data. In the case of a high resolution image, a K-means clustering algorithm, an unsupervised classification, was conducted and the result was compared to the classification results of the Landsat image. Subsequently, the training data of the high resolution image was automatically extracted using regular rules based on a RELATIONAL matrix that shows the relation between the two results. Finally, a high resolution image was classified and updated using the extracted training data. The proposed method was applied to QuickBird and SPOT-5 images of non-accessible areas. The result showed good performance in accuracy assessments. Therefore, we expect that the method can be effectively used to automatically construct thematic maps for non-accessible areas and update areas that do not have any attributes in geographic information system.

Delineation of Rice Productivity Projected via Integration of a Crop Model with Geostationary Satellite Imagery in North Korea

  • Ng, Chi Tim;Ko, Jonghan;Yeom, Jong-min;Jeong, Seungtaek;Jeong, Gwanyong;Choi, Myungin
    • Korean Journal of Remote Sensing
    • /
    • v.35 no.1
    • /
    • pp.57-81
    • /
    • 2019
  • Satellite images can be integrated into a crop model to strengthen the advantages of each technique for crop monitoring and to compensate for weaknesses of each other, which can be systematically applied for monitoring inaccessible croplands. The objective of this study was to outline the productivity of paddy rice based on simulation of the yield of all paddy fields in North Korea, using a grid crop model combined with optical satellite imagery. The grid GRAMI-rice model was used to simulate paddy rice yields for inaccessible North Korea based on the bidirectional reflectance distribution function-adjusted vegetation indices (VIs) and the solar insolation. VIs and solar insolation for the model simulation were obtained from the Geostationary Ocean Color Imager (GOCI) and the Meteorological Imager (MI) sensors of the Communication Ocean and Meteorological Satellite (COMS). Reanalysis data of air temperature were achieved from the Korea Local Analysis and Prediction System (KLAPS). Study results showed that the yields of paddy rice were reproduced with a statistically significant range of accuracy. The regional characteristics of crops for all of the sites in North Korea were successfully defined into four clusters through a spatial analysis using the K-means clustering approach. The current study has demonstrated the potential effectiveness of characterization of crop productivity based on incorporation of a crop model with satellite images, which is a proven consistent technique for monitoring of crop productivity in inaccessible regions.

Motherhood Ideology and Parenting Stress according to Parenting Behavior Patterns of Married Immigrant Women with Young Children (유아기 자녀를 둔 결혼이주여성의 양육행위 유형별 모성이데올로기 및 양육스트레스)

  • Moon, So-Hyun;Kim, Miok;Na, Hyeun
    • Journal of Korean Academy of Nursing
    • /
    • v.49 no.4
    • /
    • pp.449-460
    • /
    • 2019
  • Purpose: This study aims to provide base data for designing education and counseling programs for child-raising by identifying the types, characteristics and predictors of parenting behaviors of married immigrant women. Methods: We used a self-report questionnaire to survey 126 immigrant mothers of young children, who agreed to participate, and who could speak Korean, Vietnamese, Chinese, Filipino, or English, at two children's hospitals and two multicultural support centers. Statistical analysis was conducted using descriptive analysis, K-means clustering, ${\chi}^2$ test, Fisher's exact test, one-way ANOVA, $Sch{\acute{e}}ffe^{\prime}s$ test, and multinominal logistic regression. Results: We identified three clusters of parenting behaviors: 'affectionate acceptance group' (38.9%), 'active engaging group' (26.2%), and 'passive parenting group' (34.9%). Passive parenting and affectionate acceptance groups were distinguished by the conversation time between couples (p=.028, OR=5.52), ideology of motherhood (p=.032, OR=4.33), and parenting stress between parent and child (p=.049, OR=0.22). Passive parenting was distinguished from active engaging group by support from spouses for participating in multicultural support centers or relevant programs (p=.011, OR=2.37), and ideology of motherhood (p=.001, OR=16.65). Ideology of motherhood was also the distinguishing factor between affectionate acceptance and active engaging groups (p=.041, OR=3.85). Conclusion: Since immigrant women's parenting type depends on their ideology of motherhood, parenting stress, and spousal relationships in terms of communication and support to help their child-raising and socio-cultural adaptation, it is necessary to provide them with systematic education and support, as well as interventions across personal, family, and community levels.

Implementation of CNN-based classification model for flood risk determination (홍수 위험도 판별을 위한 CNN 기반의 분류 모델 구현)

  • Cho, Minwoo;Kim, Dongsoo;Jung, Hoekyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.3
    • /
    • pp.341-346
    • /
    • 2022
  • Due to global warming and abnormal climate, the frequency and damage of floods are increasing, and the number of people exposed to flood-prone areas has increased by 25% compared to 2000. Floods cause huge financial and human losses, and in order to reduce the losses caused by floods, it is necessary to predict the flood in advance and decide to evacuate quickly. This paper proposes a flood risk determination model using a CNN-based classification model so that timely evacuation decisions can be made using rainfall and water level data, which are key data for flood prediction. By comparing the results of the CNN-based classification model proposed in this paper and the DNN-based classification model, it was confirmed that it showed better performance. Through this, it is considered that it can be used as an initial study to determine the risk of flooding, determine whether to evacuate, and make an evacuation decision at the optimal time.

Factors Influencing Medical Care Utilization according to Decline of Region: Urban Decline Index and Medical Vulnerability Index as Indicators (지역쇠퇴 유형별 의료이용행태 영향요인: 도시쇠퇴 지표와 의료취약지 지표를 활용하여)

  • Jeong, Ji Yun;Jeong, Jae Yeon;Yoon, In Hye;Choi, Hwa Young;Lee, Hae Jong
    • Health Policy and Management
    • /
    • v.32 no.2
    • /
    • pp.205-215
    • /
    • 2022
  • Background: The purpose of this study is to identify the factors infecting the medical care utilization from a new perspective by newly classifying the categories of administrative districts using the urban decline index and medical vulnerability index as indicators. Methods: This study targeted 150,940 people who used medical services using the 2015 cohort database (DB), 2010-2015 urban regeneration analysis index DB, and 2014-2015 public health and medical statistics DB. The decline of the region was classified using the urban decline index typed using k-means clustering and the medical vulnerability index typed using the quantile score calculation. Regression analysis was performed 3 times with medical expenditure, length of stay, and the number of outpatient visits as dependent variables. Results: There were 37 stable region (47.4%), 29 health vulnerable region (37.2%), and 12 decline region (15.4%). The health vulnerable region had lower medical expenditure, fewer outpatient visits, and a higher length of stay than the stable region. The decline region was all higher than the stable region but had no significant effect. Conclusion: The factors that cause the health disparity between regions are not only factors related to individual health behavior but also environmental factors of the local community. Therefore, there is a need for a systematic alternative that properly considers the resources within the community and reflects the characteristics of the population.