• Title/Summary/Keyword: discriminant function analysis

Search Result 248, Processing Time 0.026 seconds

Derivation and Application of In uence Function in Discriminant Analysis for Three Groups (세 집단 판별분석 상황에서의 영향함수 유도 및 그 응용)

  • Lee, Hae-Jung;Kim, Hong-Gie
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.5
    • /
    • pp.941-949
    • /
    • 2011
  • The influence function is used to develop criteria to detect outliers in discriminant analysis. We derive the influence function of observations that estimate the the misclassification probability in discriminant analysis for three groups. The proposed measures are applied to the facial image data to define outliers and redo the discriminant analysis excluding the outliers. The study proves that the derived influence function is more efficient than using the discriminant probability approach.

A Study on the Discriminant Variables of Face Skin Colors for the Korean Males (한국 남성의 얼굴 피부색 판별을 위한 색채 변수에 관한 연구)

  • Kim, Ku-Ja
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.29 no.7 s.144
    • /
    • pp.959-967
    • /
    • 2005
  • The color of apparels has the interaction of the face skin colors of the wearers. This study was carried out to classify the face skin colors of Korean males into several similar face skin colors in order to extract favorable colors which flatter to their face skin colors. The criterion that select the new subjects who have the classified face skin colors have to be decided. With color spectrometer, JX-777, face skin colors of subjects were measured quantitatively and classified into three clusters that had similar hue, value and chroma with Munsell Color System. Sample size was 418 Korean males and other 15 of new males subjects. Data were analyzed by K-means cluster analysis, ANOVA, Duncan multiple range test, Stepwise discriminant analysis using SPSS Win. 12. Findings were as follows: 1. 418 subjects who have YR colors were clustered into 3 kinds of face skin color groups. 2. Discriminant variables of face skin colors was 4 variables : L value of forehead, v value of cheek, c value of forehead, and b value of cheek from standardized canonical discriminant function coefficient 1 and c value of forehead, L value of forehead, b value of cheek. and L value of cheek from standardized canonical discriminant function coefficient 2. 3. Hit ratio of type 1 was $92.3\%$, of type 2 was $96.5\%$ and of type 3 was $92.6\%$ by the canonical discriminant function of 4 variables. 4. The canonical discriminant function equation 1 and 2 were calculated with the unstandardized canonical discriminant function coefficient and constant, the cutting score, and range of the score were computed. 5. The criterion that select the new subjects who have the classified face skin colors was decided.

Palatability Grading Analysis of Hanwoo Beef using Sensory Properties and Discriminant Analysis (관능특성 및 판별함수를 이용한 한우고기 맛 등급 분석)

  • Cho, Soo-Hyun;Seo, Gu-Reo-Un-Dal-Nim;Kim, Dong-Hun;Kim, Jae-Hee
    • Food Science of Animal Resources
    • /
    • v.29 no.1
    • /
    • pp.132-139
    • /
    • 2009
  • The objective of this study was to investigate the most effective analysis methods for palatability grading of Hanwoo beef by comparing the results of discriminant analysis with sensory data. The sensory data were obtained from sensory testing by 1,300 consumers evaluated tenderness, juiciness, flavor-likeness and overall acceptability of Hanwoo beef samples prepared by boiling, roasting and grilling cooking methods. For the discriminant analysis with one factor, overall acceptability, the linear discriminant functions and the non-parametric discriminant function with the Gaussian kernel were estimated. The linear discriminant functions were simple and easy to understand while the non-parametric discriminant functions were not explicit and had the problem of selection of kernel function and bandwidth. With the three palatability factors such as tenderness, juiciness and flavor-likeness, the canonical discriminant analysis was used and the ability of classification was calculated with the accurate classification rate and the error rate. The canonical discriminant analysis did not need the specific distributional assumptions and only used the principal component and canonical correlation. Also, it contained the function of 3 factors (tenderness, juiciness and flavor-likeness) and accurate classification rate was similar with the other discriminant methods. Therefore, the canonical discriminant analysis was the most proper method to analyze the palatability grading of Hanwoo beef.

Principal Discriminant Variate (PDV) Method for Classification of Multicollinear Data: Application to Diagnosis of Mastitic Cows Using Near-Infrared Spectra of Plasma Samples

  • Jiang, Jian-Hui;Tsenkova, Roumiana;Yu, Ru-Qin;Ozaki, Yukihiro
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1244-1244
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from mastitic and healthy cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from mastitic and healthy cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA and FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference, thereby providing a useful means for spectroscopy-based clinic applications.

  • PDF

PRINCIPAL DISCRIMINANT VARIATE (PDV) METHOD FOR CLASSIFICATION OF MULTICOLLINEAR DATA WITH APPLICATION TO NEAR-INFRARED SPECTRA OF COW PLASMA SAMPLES

  • Jiang, Jian-Hui;Yuqing Wu;Yu, Ru-Qin;Yukihiro Ozaki
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1042-1042
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from daily monitoring of two Japanese cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from two cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA md FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference.

  • PDF

A Study on the Discriminant Variables of Face Skin Colors for the Korean Females (한국 여성의 얼굴 피부색 판별을 위한 색채 변수에 관한 연구)

  • Kim, Ku-Ja;Chung, Hae-Won
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.29 no.7 s.144
    • /
    • pp.978-986
    • /
    • 2005
  • The color of apparel products have a close relationship with the face skin colors of consumers. In order to extract the favorable colors which flatter to consumer's face skin colors, this study was carried our to classify the face skin colors of Korean females. The criteria that select new subjects who have the classified face skin colors have to be decided. With color spectrometer, JX-777, face skin colors of subjects were measured and classified into three clusters that had similar hue, value and chroma with Munsell Color System. Sample size was 324 Korean females and other new 10 college girls. Data were analyzed by K-means cluster analysis, ANOVA, Duncan multiple range test, Stepwise discriminant analysis using SPSS Win. 12. Findings were as follows: 1. 324 subjects who have YR colors were clustered into 3 face skin color groups. 2. Discriminant variables of face skin colors were 5 variables : b value of cheek, V value of forehead, L value of cheek, C value of forehead and H value of cheek by the standardized canonical discriminant function coefficient 1. 3. Hit ratio of type 1 was $96.8\%$, of type 2 was $94.9\%$, of type 3 was $100.0\%$ and mean of hit ratio was $96.9\%$ by canonical discriminant function of 5 variables. 4. With the unstandardized canonical discriminant function coefficient and constant, canonical discriminant function equation 1 and 2 were calculated. And cutting score and range of score of the classified types were computed. The criteria that select the new subjects were decided.

A Study of Discriminant Analysis about Korean Quick Response System Adoption (국내(國內) 신속대응(迅速對應)시스템 도입업체(導入業體)의 판별분석(判別分析) 연구(硏究))

  • Ko, Eun-Ju
    • Journal of Fashion Business
    • /
    • v.4 no.3
    • /
    • pp.103-114
    • /
    • 2000
  • The purpose of this study was to test the discriminant analysis model of Quick Response system and to examine the detailed relationship between each discriminant factor and Quick Response adoption. In this discriminant analysis model of Quick Response system, firm size, strategic type, product category, fashion trend, selling time and the Quick Response benefits were included as discriminant factors. Onehundred and two subjects were randomly selected for the survey study and discriminant analysis, descriptive analysis, t-test, and x square test were used for the data analysis. The results of this study were: 1. Wilks Lambda and F value support the discriminant analysis model that, taken together firm size, strategic type, product category, fashion trend, selling time and the Quick Response benefits significantly help to explain Quick Response adoption. 2. The importance of discriminant ability was, in order, firm size, the Quick Response benefits, women's wear, fashion trend, analyzer, selling time, reactor, defender and men's wear. 3. The discriminant function had the high hit ratio, so this can be well used for the classification of Quick Response adoption/nonadoption.

  • PDF

Study on Classification Function into Sasang Constitution Using Data Mining Techniques (데이터마이닝 기법을 이용한 사상체질 판별함수에 관한 연구)

  • Kim Kyu Kon;Kim Jong Won;Lee Eui Ju;Kim Jong Yeol;Choi Sun-Mi
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.18 no.6
    • /
    • pp.1938-1944
    • /
    • 2004
  • In this study, when we make a diagnosis of constitution using QSCC Ⅱ(Questionnaire of Sasang Constitution Classification). data mining techniques are applied to seek the classification function for improving the accuracy. Data used in the analysis are the questionnaires of 1051 patients who had been treated in Dong Eui Oriental Medical Hospital and Kyung Hee Oriental Medical Hospital. The criteria for data cleansing are the response pattern in the opposite questionnaires and the positive proportion of specific questionnaires in each constitution. And the criteria for variable selection are the test of homogeneity in frequency analysis and the coefficients in the linear discriminant function. Discriminant analysis model and decision tree model are applied to seek the classification function into Sasang constitution. The accuracy in learning sample is similar in two models, the higher accuracy in test sample is obtained in discriminant analysis model.

Local Influence Assessment of the Misclassification Probability in Multiple Discriminant Analysis

  • Jung, Kang-Mo
    • Journal of the Korean Statistical Society
    • /
    • v.27 no.4
    • /
    • pp.471-483
    • /
    • 1998
  • The influence of observations on the misclassification probability in multiple discriminant analysis under the equal covariance assumption is investigated by the local influence method. Under an appropriate perturbation we can get information about influential observations and outliers by studying the curvatures and the associated direction vectors of the perturbation-formed surface of the misclassification probability. We show that the influence function method gives essentially the same information as the direction vector of the maximum slope. An illustrative example is given for the effectiveness of the local influence method.

  • PDF

A Study on the Upper Body Shapes of Late Elementary Schoolgirls (학령후기 여아의 상반신 체형 연구)

  • Jang, Jeong-Ah
    • Fashion & Textile Research Journal
    • /
    • v.8 no.1
    • /
    • pp.107-112
    • /
    • 2006
  • This study is done to classify the upper body shapes for late elementary schoolgirls. The sampling was done for 11~12 years-old-girls resident in Busan and Kyungnam. Based on the somatometric charateristics of them, 33 anthropometic and 7 photogrphic measurment data were acquired from every girl. These data are statistically analyzed with the following methods; Factor Analysis, Cluster Analysis, and Discriminant Analysis. Resulting from the factor analysis, it is shown that 79.95% of the whole variances can be explained with 8 factors. Through the cluster analysis, 3 types of upper body shapes can be categorized as follows: Type I has average horizontal size, big vertical size and lots of protruded chest ; Type III has big horizontal size, the mean vertical size, and big upper angle of the back ; Type II has small horizontal and vertical size and long surface length of the upper body. Through the discriminant analysis, the high discriminative items in discriminant function are follows: Upper chest circumference, arm length and waist front length of discriminant function I and waist depth, front length, back breadth, nipple to nipple breadth and upper chest circumference of discriminant function II have large coefficient values.