• Title/Summary/Keyword: various multivariate statistical methods

Search Result 82, Processing Time 0.026 seconds

Bayesian Analysis of Multivariate Threshold Animal Models Using Gibbs Sampling

  • Lee, Seung-Chun;Lee, Deukhwan
    • Journal of the Korean Statistical Society
    • /
    • v.31 no.2
    • /
    • pp.177-198
    • /
    • 2002
  • The estimation of variance components or variance ratios in linear model is an important issue in plant or animal breeding fields, and various estimation methods have been devised to estimate variance components or variance ratios. However, many traits of economic importance in those fields are observed as dichotomous or polychotomous outcomes. The usual estimation methods might not be appropriate for these cases. Recently threshold linear model is considered as an important tool to analyze discrete traits specially in animal breeding field. In this note, we consider a hierarchical Bayesian method for the threshold animal model. Gibbs sampler for making full Bayesian inferences about random effects as well as fixed effects is described to analyze jointly discrete traits and continuous traits. Numerical example of the model with two discrete ordered categorical traits, calving ease of calves from born by heifer and calving ease of calf from born by cow, and one normally distributed trait, birth weight, is provided.

A Study on the Construction and Analysis of Fractional Designs by Using Arrays for Factorial Experiments (배열을 이용한 효과적인 일부실시법의 설계 및 분석방법에 관한 연구)

  • Kim, Sang-Ik
    • Journal of Korean Society for Quality Management
    • /
    • v.40 no.1
    • /
    • pp.15-24
    • /
    • 2012
  • For the construction of fractional factorial designs, the various arrays can be widely used. In this paper we review the statistical properties of fractional designs constructed by two arrays such as orthogonal array and partially balanced array, and develop a quick and easy method for analyzing unreplicated saturated designs. The proposed method can be characterized that we control the error rate by experiment-wise way and exploit the multivariate Student $t$-distribution. Especially the proposed method can be used efficiently together with some exploratory analysis methods, such as half normal probability plot method.

Fingerprinting Differentiation of Astragalus membranaceus Roots According to Ages Using 1H-NMR Spectroscopy and Multivariate Statistical Analysis

  • Shin, Yoo-Soo;Bang, Kyong-Hwan;In, Dong-Su;Sung, Jung-Sook;Kim, Seon-Young;Ku, Bon-Cho;Kim, Suk-Weon;Lee, Dong-Ho;Choi, Hyung-Kyoon
    • Biomolecules & Therapeutics
    • /
    • v.17 no.2
    • /
    • pp.133-137
    • /
    • 2009
  • The root of Astragalus membranaceus is a traditional folk medicine that has been used for many therapeutic purposes in Asia. It reportedly acts as an immunostimulant, tonic, hepatoprotective, diuretic, antidiabetic, analgesic, expectorant, sedative, and anticancer drug. In this study, metabolomic profiling was applied to the roots of A. membranaceus of different ages using NMR coupled with two multivariate statistical analysis methods: such as principal components analysis (PCA) and canonical discriminant analysis (CDA). This allowed various metabolites to be assigned in NMR spectra, including $\gamma$-aminobutyric acid (GABA), aspartic acid, succinic acid, glutamic acid, glutamine, N-acetyl aspartic acid, acetic acid, arginine, alanine, threonine, lactic acid, and valine. The score plot from PCA and also CDA allowed a clear separation between samples according to age.

Classification of Forest Cover Types in the Baekdudaegan, South Korea

  • Chung, Sang Hoon;Lee, Sang Tae
    • Journal of Forest and Environmental Science
    • /
    • v.37 no.4
    • /
    • pp.269-279
    • /
    • 2021
  • This study was carried out to introduce the forest cover types of the Baekdudaegan inhabiting the number of native tree species. In order to understand the vegetation distribution characteristics of the Baekdudaegan, a vegetation survey was conducted on the major 20 mountains of the Baekdudaegan. The vegetation data were collected from 3,959 sample points by the point-centered quarter method. Each mountain was classified into 4-7 forests by using various multivariate statistical methods such as cluster analysis, indicator species analysis, multiple discriminant analysis, and species composition analysis. The forests were classified mainly according to the relative abundance of Quercus mongolica. There was a total of 111 classified forests and these forests were integrated into the following nine forest cover types using the percentage similarity index and by clustering according to vegetation type: 1) Mongolian oak, 2) Mongolian oak and other deciduous, 3) Oaks (Mixed Quercus spp.), 4) Korean red pine, 5) Korean red pine and oaks, 6) ash, 7) mixed mesophytic, 8) subalpine zone coniferous, and 9) miscellaneous forest. Forests grouped within the subalpine zone coniferous and miscellaneous classifications were characterized by similar environmental conditions and those forests that did not fit in any other category, respectively.

Forest Type Classification and Successional Trends in the Natural Forest of Mt. Deogyu (덕유산 일대 천연림의 산림형 분류와 천이경향)

  • Hwang, Kwang Mo;Chung, Sang Hoon;Kim, Ji Hong
    • Journal of Korean Society of Forest Science
    • /
    • v.105 no.2
    • /
    • pp.157-166
    • /
    • 2016
  • This study was carried out to classify the current forest cover types and to propose the successional trends in the natural forest of Mt. Deogyu. The vegetation data were collected by the point-centered quarter method. The forest cover types were classified by various multivariate statistical analysis methods such as cluster analysis, indicator species analysis and multiple discriminant analysis. This forests were classified into five forest types by the species composition of upper layer and topographic positions: Quercus mongolica forest in the ridge, Fraxinus mandushurica-F. rhynchophylla-Cornus controversa forest and F. mandushurica forest in the valley, the Q. serrata - Pinus densiflora - Q. mongolica forest and P. densiflora forest in the low-slope. As a result of the forest successional trends depending on ecological and environmental characteristics in each forest type, the current forest types were expected that the forest succession would be proceeded toward Q. mongolica forest, F. mandshurica forest, mixed mesophytic forest, and oak-Carpinus laxiflora forest.

Principal selected response reduction in multivariate regression (다변량회귀에서 주선택 반응변수 차원축소)

  • Yoo, Jae Keun
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.4
    • /
    • pp.659-669
    • /
    • 2021
  • Multivariate regression often appears in longitudinal or functional data analysis. Since multivariate regression involves multi-dimensional response variables, it is more strongly affected by the so-called curse of dimension that univariate regression. To overcome this issue, Yoo (2018) and Yoo (2019a) proposed three model-based response dimension reduction methodologies. According to various numerical studies in Yoo (2019a), the default method suggested in Yoo (2019a) is least sensitive to the simulated models, but it is not the best one. To release this issue, the paper proposes an selection algorithm by comparing the other two methods with the default one. This approach is called principal selected response reduction. Various simulation studies show that the proposed method provides more accurate estimation results than the default one by Yoo (2019a), and it confirms practical and empirical usefulness of the propose method over the default one by Yoo (2019a).

Functional Data Classification of Variable Stars

  • Park, Minjeong;Kim, Donghoh;Cho, Sinsup;Oh, Hee-Seok
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.4
    • /
    • pp.271-281
    • /
    • 2013
  • This paper considers a problem of classification of variable stars based on functional data analysis. For a better understanding of galaxy structure and stellar evolution, various approaches for classification of variable stars have been studied. Several features that explain the characteristics of variable stars (such as color index, amplitude, period, and Fourier coefficients) were usually used to classify variable stars. Excluding other factors but focusing only on the curve shapes of variable stars, Deb and Singh (2009) proposed a classification procedure using multivariate principal component analysis. However, this approach is limited to accommodate some features of the light curve data that are unequally spaced in the phase domain and have some functional properties. In this paper, we propose a light curve estimation method that is suitable for functional data analysis, and provide a classification procedure for variable stars that combined the features of a light curve with existing functional data analysis methods. To evaluate its practical applicability, we apply the proposed classification procedure to the data sets of variable stars from the project STellar Astrophysics and Research on Exoplanets (STARE).

A Study on Sasang Constitutional Gene Selection Using DNA Chips by Multivariate Analysis (유전자 칩 및 다변량 분석방법을 이용한 사상체질 유전자 선별에 관한 연구)

  • Kim, Pan-Joon;Seo, Eun-Hee;Lee, Jung-Hwan;Ha, Jin-Ho;Choi, Hong-Sik;Jung, Tae-Young;Goo, Deok-Mo
    • Journal of Sasang Constitutional Medicine
    • /
    • v.18 no.3
    • /
    • pp.131-144
    • /
    • 2006
  • 1. Objectives This research uses the DNA chip, which includes 16,383 gene code, and various statistic prediction way that shows objectification index for the objectification of constitution diagnosis. 2. Methods Drawing blood whose constitution is confirmed, and analyze its gene information by using 1.7k DNA chip to find the gene correlation through multivariate statistical method. 3. Results and Conclusions Distinctive genes such as AK001919, U09384, NM_001805, X99962, NM_004796, AK026738, AL050148, BC002538, AK027074, AK026219, AF087962, AL390142, NM_015372, AL157466, NM_002446, AK024523, NM_014706, NM_014746 and AL137544 were related to Taeumin; AL157448, NM_005957, NM_005656, NM_017548, AK027246, NM_003025, NM_012302 and NM_005905 were represented in Soeumin, while AK026503, AF147325, NM_002076, AF147307, AK001375, NM_003740, NM_005114, AB007890, NM_005505, NM_015900, NM_014936, Z70694, AB023154, U52076, NM_004360, NM_005835, NM_017528, AF087987, NM_014897, AK021720, NM_006420, AJ277915, AK002118 and AK021918 were for Soyangin. This study figured out the possibility to develop the prediction system by sorting each constitution's gene, and research each constitution's distinctive character of manifestation pattern.

  • PDF

Resistant Principal Factor Analysis

  • Park, Youg-Seok;Byun, Ho-Seon
    • Journal of the Korean Statistical Society
    • /
    • v.25 no.1
    • /
    • pp.67-80
    • /
    • 1996
  • Factor analysis is a multivariate technique for describing the in-terrelationship among many variables in terms of a few underlying but unobservable random variables called factors. There are various approaches for this factor analysis. In particular, principal factor analysis is one of the most popular methods. This follows the mathematical algorithm of the principal component analysis based on the singular value decomposition. But it is known that the singular value decomposition is not resistant, i.e., it is very sensitive to small changes in the input data. In this article, using the resistant singular value decomposition of Choi and Huh (1994), we derive a resistant principal factor analysis relatively little influenced by notable observations.

  • PDF

Variable Selection with Nonconcave Penalty Function on Reduced-Rank Regression

  • Jung, Sang Yong;Park, Chongsun
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.1
    • /
    • pp.41-54
    • /
    • 2015
  • In this article, we propose nonconcave penalties on a reduced-rank regression model to select variables and estimate coefficients simultaneously. We apply HARD (hard thresholding) and SCAD (smoothly clipped absolute deviation) symmetric penalty functions with singularities at the origin, and bounded by a constant to reduce bias. In our simulation study and real data analysis, the new method is compared with an existing variable selection method using $L_1$ penalty that exhibits competitive performance in prediction and variable selection. Instead of using only one type of penalty function, we use two or three penalty functions simultaneously and take advantages of various types of penalty functions together to select relevant predictors and estimation to improve the overall performance of model fitting.