• 제목/요약/키워드: discriminant function analysis

검색결과 248건 처리시간 0.028초

세 집단 판별분석 상황에서의 영향함수 유도 및 그 응용 (Derivation and Application of In uence Function in Discriminant Analysis for Three Groups)

  • 이혜정;김홍기
    • 응용통계연구
    • /
    • 제24권5호
    • /
    • pp.941-949
    • /
    • 2011
  • 본 논문에서는 세 집단만을 판별분석 할 경우에 계산되는 오분류확률에 영향을 미치는 이상치 판별을 목적으로 하며, 쉽게 응용 가능한 간단한 영향함수식을 제시하였다. 그리고 제시된 수식을 이용하여 안면 데이터로 세 가지 사상체질을 분류해보고 각 관찰값들의 오분류확률에 대한 영향함수를 계산하였다. 이상치를 제거하고 재 판별분석을 하는 데 있어, 오분류확률에 대한 영향함수를 이용하는 것이 효율적인 방법임을 확인하였다.

한국 남성의 얼굴 피부색 판별을 위한 색채 변수에 관한 연구 (A Study on the Discriminant Variables of Face Skin Colors for the Korean Males)

  • 김구자
    • 한국의류학회지
    • /
    • 제29권7호
    • /
    • pp.959-967
    • /
    • 2005
  • The color of apparels has the interaction of the face skin colors of the wearers. This study was carried out to classify the face skin colors of Korean males into several similar face skin colors in order to extract favorable colors which flatter to their face skin colors. The criterion that select the new subjects who have the classified face skin colors have to be decided. With color spectrometer, JX-777, face skin colors of subjects were measured quantitatively and classified into three clusters that had similar hue, value and chroma with Munsell Color System. Sample size was 418 Korean males and other 15 of new males subjects. Data were analyzed by K-means cluster analysis, ANOVA, Duncan multiple range test, Stepwise discriminant analysis using SPSS Win. 12. Findings were as follows: 1. 418 subjects who have YR colors were clustered into 3 kinds of face skin color groups. 2. Discriminant variables of face skin colors was 4 variables : L value of forehead, v value of cheek, c value of forehead, and b value of cheek from standardized canonical discriminant function coefficient 1 and c value of forehead, L value of forehead, b value of cheek. and L value of cheek from standardized canonical discriminant function coefficient 2. 3. Hit ratio of type 1 was $92.3\%$, of type 2 was $96.5\%$ and of type 3 was $92.6\%$ by the canonical discriminant function of 4 variables. 4. The canonical discriminant function equation 1 and 2 were calculated with the unstandardized canonical discriminant function coefficient and constant, the cutting score, and range of the score were computed. 5. The criterion that select the new subjects who have the classified face skin colors was decided.

관능특성 및 판별함수를 이용한 한우고기 맛 등급 분석 (Palatability Grading Analysis of Hanwoo Beef using Sensory Properties and Discriminant Analysis)

  • 조수현;서그러운달님;김동훈;김재희
    • 한국축산식품학회지
    • /
    • 제29권1호
    • /
    • pp.132-139
    • /
    • 2009
  • 본 연구에서는 1,300명의 소비자들이 직접 먹어보고 평가한 한우고기 데이터를 이용하여 쇠고기 맛 등급을 구분 해 내기 위한 판별분석 방법들을 비교하였다. 한우 관능평가의 주요 세 변수인 연도, 다즙성, 향미를 포함한 정준 판별분석과 대표적인 맛 변수로 여겨지는 전반적인 기호도 만을 이용하여 선형판별분석과 비모수 판별분석을 하였다. 전반적인 기호도와 같은 한 개의 변수만을 사용할 경우 두 가지 모두 비슷한 분류율을 나타내지만 선형판별 함수는 이해와 사용 측면에서 장점이 있었던 반면에 비모수적 방법은 커널함수와 띠폭에 대한 선택이 불편하지만 잘 선택하면 정확한 분류율을 높일 수 있는 장점이 있었다. 그러나 다른 정보를 가진 변수들이 있음에도 불구하고 한 개의 변수만을 이용한 판별 분석은 판별에 영향을 미치는 다른 중요한 변수들의 정보를 활용하지 못한다는 문제점이 있다. 한편, 정준판별분석의 경우 정준판별함수의 오분류율이 일변량 선형 판별함수와 비모수 판별함수의 오분류율에 비해 크게 떨어지지 않으면서 분포에 대한 특별한 가정이 필요하지 않아 통계적 가정이 까다롭지 않고 또한 맛에 중요한 요인인 연도, 다즙성, 향미의 세 개변수를 모두 사용하므로 맛 정보를 최대로 활용한다는 장점이 있었다. 따라서 본 연구결과 연도, 다즙성, 향미의 세가지 변수 정보를 모두 포함한 다변량 정준판별분석법을 이용하는 것이 맛 등급을 구분하는데 가장 적절할 것으로 판단되었다.

Principal Discriminant Variate (PDV) Method for Classification of Multicollinear Data: Application to Diagnosis of Mastitic Cows Using Near-Infrared Spectra of Plasma Samples

  • Jiang, Jian-Hui;Tsenkova, Roumiana;Yu, Ru-Qin;Ozaki, Yukihiro
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1244-1244
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from mastitic and healthy cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from mastitic and healthy cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA and FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference, thereby providing a useful means for spectroscopy-based clinic applications.

  • PDF

PRINCIPAL DISCRIMINANT VARIATE (PDV) METHOD FOR CLASSIFICATION OF MULTICOLLINEAR DATA WITH APPLICATION TO NEAR-INFRARED SPECTRA OF COW PLASMA SAMPLES

  • Jiang, Jian-Hui;Yuqing Wu;Yu, Ru-Qin;Yukihiro Ozaki
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1042-1042
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from daily monitoring of two Japanese cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from two cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA md FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference.

  • PDF

한국 여성의 얼굴 피부색 판별을 위한 색채 변수에 관한 연구 (A Study on the Discriminant Variables of Face Skin Colors for the Korean Females)

  • 김구자;정혜원
    • 한국의류학회지
    • /
    • 제29권7호
    • /
    • pp.978-986
    • /
    • 2005
  • The color of apparel products have a close relationship with the face skin colors of consumers. In order to extract the favorable colors which flatter to consumer's face skin colors, this study was carried our to classify the face skin colors of Korean females. The criteria that select new subjects who have the classified face skin colors have to be decided. With color spectrometer, JX-777, face skin colors of subjects were measured and classified into three clusters that had similar hue, value and chroma with Munsell Color System. Sample size was 324 Korean females and other new 10 college girls. Data were analyzed by K-means cluster analysis, ANOVA, Duncan multiple range test, Stepwise discriminant analysis using SPSS Win. 12. Findings were as follows: 1. 324 subjects who have YR colors were clustered into 3 face skin color groups. 2. Discriminant variables of face skin colors were 5 variables : b value of cheek, V value of forehead, L value of cheek, C value of forehead and H value of cheek by the standardized canonical discriminant function coefficient 1. 3. Hit ratio of type 1 was $96.8\%$, of type 2 was $94.9\%$, of type 3 was $100.0\%$ and mean of hit ratio was $96.9\%$ by canonical discriminant function of 5 variables. 4. With the unstandardized canonical discriminant function coefficient and constant, canonical discriminant function equation 1 and 2 were calculated. And cutting score and range of score of the classified types were computed. The criteria that select the new subjects were decided.

국내(國內) 신속대응(迅速對應)시스템 도입업체(導入業體)의 판별분석(判別分析) 연구(硏究) (A Study of Discriminant Analysis about Korean Quick Response System Adoption)

  • 고은주
    • 패션비즈니스
    • /
    • 제4권3호
    • /
    • pp.103-114
    • /
    • 2000
  • The purpose of this study was to test the discriminant analysis model of Quick Response system and to examine the detailed relationship between each discriminant factor and Quick Response adoption. In this discriminant analysis model of Quick Response system, firm size, strategic type, product category, fashion trend, selling time and the Quick Response benefits were included as discriminant factors. Onehundred and two subjects were randomly selected for the survey study and discriminant analysis, descriptive analysis, t-test, and x square test were used for the data analysis. The results of this study were: 1. Wilks Lambda and F value support the discriminant analysis model that, taken together firm size, strategic type, product category, fashion trend, selling time and the Quick Response benefits significantly help to explain Quick Response adoption. 2. The importance of discriminant ability was, in order, firm size, the Quick Response benefits, women's wear, fashion trend, analyzer, selling time, reactor, defender and men's wear. 3. The discriminant function had the high hit ratio, so this can be well used for the classification of Quick Response adoption/nonadoption.

  • PDF

데이터마이닝 기법을 이용한 사상체질 판별함수에 관한 연구 (Study on Classification Function into Sasang Constitution Using Data Mining Techniques)

  • 김규곤;김종원;이의주;김종열;최선미
    • 동의생리병리학회지
    • /
    • 제18권6호
    • /
    • pp.1938-1944
    • /
    • 2004
  • In this study, when we make a diagnosis of constitution using QSCC Ⅱ(Questionnaire of Sasang Constitution Classification). data mining techniques are applied to seek the classification function for improving the accuracy. Data used in the analysis are the questionnaires of 1051 patients who had been treated in Dong Eui Oriental Medical Hospital and Kyung Hee Oriental Medical Hospital. The criteria for data cleansing are the response pattern in the opposite questionnaires and the positive proportion of specific questionnaires in each constitution. And the criteria for variable selection are the test of homogeneity in frequency analysis and the coefficients in the linear discriminant function. Discriminant analysis model and decision tree model are applied to seek the classification function into Sasang constitution. The accuracy in learning sample is similar in two models, the higher accuracy in test sample is obtained in discriminant analysis model.

Local Influence Assessment of the Misclassification Probability in Multiple Discriminant Analysis

  • Jung, Kang-Mo
    • Journal of the Korean Statistical Society
    • /
    • 제27권4호
    • /
    • pp.471-483
    • /
    • 1998
  • The influence of observations on the misclassification probability in multiple discriminant analysis under the equal covariance assumption is investigated by the local influence method. Under an appropriate perturbation we can get information about influential observations and outliers by studying the curvatures and the associated direction vectors of the perturbation-formed surface of the misclassification probability. We show that the influence function method gives essentially the same information as the direction vector of the maximum slope. An illustrative example is given for the effectiveness of the local influence method.

  • PDF

학령후기 여아의 상반신 체형 연구 (A Study on the Upper Body Shapes of Late Elementary Schoolgirls)

  • 장정아
    • 한국의류산업학회지
    • /
    • 제8권1호
    • /
    • pp.107-112
    • /
    • 2006
  • This study is done to classify the upper body shapes for late elementary schoolgirls. The sampling was done for 11~12 years-old-girls resident in Busan and Kyungnam. Based on the somatometric charateristics of them, 33 anthropometic and 7 photogrphic measurment data were acquired from every girl. These data are statistically analyzed with the following methods; Factor Analysis, Cluster Analysis, and Discriminant Analysis. Resulting from the factor analysis, it is shown that 79.95% of the whole variances can be explained with 8 factors. Through the cluster analysis, 3 types of upper body shapes can be categorized as follows: Type I has average horizontal size, big vertical size and lots of protruded chest ; Type III has big horizontal size, the mean vertical size, and big upper angle of the back ; Type II has small horizontal and vertical size and long surface length of the upper body. Through the discriminant analysis, the high discriminative items in discriminant function are follows: Upper chest circumference, arm length and waist front length of discriminant function I and waist depth, front length, back breadth, nipple to nipple breadth and upper chest circumference of discriminant function II have large coefficient values.