DOI QR코드

DOI QR Code

다형질 Threshold 개체모형에서 Missing 기록을 포함한 이산형 자료에 대한 Bayesian 분석

Bayesian Analysis for Categorical Data with Missing Traits Under a Multivariate Threshold Animal Model

  • 이득환 (한경대학교 동물생명자원학과)
  • Lee, Deuk-Hwan (Dept. of Animal Life and Resources, Hankyong National University)
  • 발행 : 2002.04.30

초록

한우의 근내지방도 또는 임신 여부 등과 같이 이산형 분포의 성질을 갖는 다수의 형질들에 대한 유전모수 및 종축의 유전능력을 평가하기 위한 방법으로써 Threshold 모형하에서 Bayesian 추론방법의 일종인 Gibbs sampling방법을 모의실험을 통하여 알아보았으며 기록이 누락된 다수의 형질을 포함하는 다형질 Threshold 개체모형에서의 종축평가 방법론을 제시하였다. 이산형 형질의 관측치에 대응하는 임의의 잠재변수는 기록을 갖고 있는 형질들에 대한 사전정보를 고려한 사후조건확률분포에서 Gibbs sampling을 할 때 모수에 근접하는 확률분포를 얻을 수 있었으며 이러한 이산형 기록들에 대한 육종가 추정치는 선형모형에서 보다 Threshold 모형에서의 추정치가 실제 모수에 더욱 근접하는 것을 알 수 있었다. 따라서 기록이 누락된 개체들에 대한 이산형 분포를 갖는 형질들에 대하여 선형분포를 갖는 형질들과 함께 동시 유전분석할 때 Threshod 모형이 일반 선형모형 보다 적합함을 알 수 있었다.

Genetic variance and covariance components of the linear traits and the ordered categorical traits, that are usually observed as dichotomous or polychotomous outcomes, were simultaneously estimated in a multivariate threshold animal model with concepts of arbitrary underlying liability scales with Bayesian inference via Gibbs sampling algorithms. A multivariate threshold animal model in this study can be allowed in any combination of missing traits with assuming correlation among the traits considered. Gibbs sampling algorithms as a hierarchical Bayesian inference were used to get reliable point estimates to which marginal posterior means of parameters were assumed. Main point of this study is that the underlying values for the observations on the categorical traits sampled at previous round of iteration and the observations on the continuous traits can be considered to sample the underlying values for categorical data and continuous data with missing at current cycle (see appendix). This study also showed that the underlying variables for missing categorical data should be generated with taking into account for the correlated traits to satisfy the fully conditional posterior distributions of parameters although some of papers (Wang et al., 1997; VanTassell et al., 1998) presented that only the residual effects of missing traits were generated in same situation. In present study, Gibbs samplers for making the fully Bayesian inferences for unknown parameters of interests are played rolls with methodologies to enable the any combinations of the linear and categorical traits with missing observations. Moreover, two kinds of constraints to guarantee identifiability for the arbitrary underlying variables are shown with keeping the fully conditional posterior distributions of those parameters. Numerical example for a threshold animal model included the maternal and permanent environmental effects on a multiple ordered categorical trait as calving ease, a binary trait as non-return rate, and the other normally distributed trait, birth weight, is provided with simulation study.

키워드

참고문헌

  1. Blasco, A. 2001. The Bayesian controversy in animal breeding. J. Anim. Sci. 79:2023-2046.
  2. Box, G. E. P. and Tiao, G. C. 1992. Bayesian inference in statistical analysis. New York. Wiley.
  3. Berger, P. J., Lin, E. C., Van Arendonk, J. and Janss, L. 1995. Properties of genetic parameter estimates from selection experiments by Gibbs sampling. J. Dairy Sci. 78 (Suppl 1):246.
  4. Datta, G. S. and Maiti, T. 1998. Multivariate Bayesian small area estimation: An application to survey and satellite data. The Indian J. of Statistics. 60:344-362.
  5. Djemali, M., Berger, P. J. and Freeman, A. E. 1987. Ordered categorical sire evaluation for dystocia in Holsteins. J. Dairy Sci. 70:2374-2384. https://doi.org/10.3168/jds.S0022-0302(87)80298-9
  6. Foulley, J. L., Gianola, D. and Hoeschele, I. 1987. Empirical Bayes estimation of parameters for n polygenic binary traits. Genet. Sel. Evol. 15:407-424.
  7. Foulley J. L., Gianola, D. and Thompson, R. 1983. Prediction of genetic merit from data on binary and quantitative variates with an application to calving difficulty, birth weight and pelvic opening. Genet. Sel. Evol. 15:407-424.
  8. Gelman, A., Carlin, J. B., Stern, H. S. and Rubin, D. B. 1995. Bayesian data analysis. Chapman & Hall.
  9. Geman, S. and Geman, D. 1984. Stochastic relaxation, Gibbs distributions and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intel. 6:721-741.
  10. Gianola, D. 1982. Theory and analysis of threshold characters. J. Anim. Sci. 54:1079-1096.
  11. Gianola, D. and Fouley, J. L. 1983. Sire evaluation for ordered categorical data with a threshold model. Genet. Sel. Evol. 15:201-244. https://doi.org/10.1186/1297-9686-15-2-201
  12. Hobert, J. and Casella, G. 1996. Effect of improper prior on Gibbs sampling in hierarchical linear mixed model. J. Am. Stat. Assoc. 91:1461- 1473.
  13. Hoeschele, I. B. and Tier, B. 1995. Estimation of variance components of threshold characters by marginal posterior mode and means via Gibbs sampling. Genet. Sel. Evol. 27:519-540.
  14. Jensen, J., Wang, C. S., Sorensen, D. and Gianola, D. 1994. Bayesian inference on variance and covariance components for traits influenced by maternal and direct genetic effects using the Gibbs sampler. Acta Agric Scand, Sect A, Anim. Sci. 44:193-201.
  15. Korsgaard, I. G., Andersen, A. H. and Sorensen, D. 1999. A usefull reparameterisation to obtain samples from conditional inverse Wishart distributions. Genet. Sel. Evol. 31:177-181.
  16. Lee, D. H., Rekaya, R. and Misztal, I. 2002. Analysis of Binary Data: Effect of Different Parameterizations on the Bias of Genetic Parameters. Asian J. Anim. Sci. (Submitted).
  17. Lee, D. H., Misztal, I., Bertrand, J. K. and Rekaya, R. 2001. Bayesian analysis of multiple- linear and categorical traits with varying number of categories. J. Anim. Sci. 79(suppl. 1):342.
  18. Li, S. C. and Lee, D. H. 2002. Bayesian analysis of threshold animal models with Gibbs sampling. Korean J. Stat. Sci. (accepted).
  19. Raftery, A. E. and Lewis, S. M. 1992. How many iterations in the Gibbs sampler? In: Bayesian Statistics IV, Oxford University Press, UK, 763-773.
  20. Rekaya, R. 2001. Bayesian inference in mixed linear model using Dirichlet process prior. J. Anim. Sci. 79(suppl. 1):110.
  21. Sorensen, D. A., Andersen, S., Gianola, D. and Korsgaard, I. 1995. Bayesian inference in threshold models using Gibbs sampling. Genet. Sel. Evol. 27:229-249.
  22. Sorensen, D. 1996. Gibbs sampling in quantitative genetics. Internal reports no. 82. from the Danish Institute of Animal Science.
  23. Sorensen, D., Wang, C. S., Jensen, J. and Gianola, D. 1994. Bayesian analysis of genetic trend due to selection to beef cattle breeding. Genet. Sel. Evol. 25:3-30.
  24. VanTassell, C. P., VanVleck, L. D. and Gregory, K. E. 1998. Bayesian analysis of twinning and ovulation rates using a multiple-trait threshold model and Gibbs sampling. J. Anim. Aci. 76: 2048-2061.
  25. Wang, C. S., Rutledge, J. J. and Gianola, D. 1994. Bayesian analysis of mixed linear models via Gibbs sampling with an application to litter size in Iberian pigs. Genet. Sel. Evol. 26:91-115.
  26. Wang, C. S., Quaas, R. L. and Pollak, E. J. 1997. Bayesian analysis of calving ease scores and birth weights. Genet. Sel. Evol. 20:117-143.
  27. Wright, S. 1934. An analysis of variability in number of digits in an inbred strain of guinea pigs. Genetics 19:506-536.