• 제목/요약/키워드: Dirichlet process mixture model

검색결과 12건 처리시간 0.02초

Dirichlet Process Mixtures of Linear Mixed Regressions

  • Kyung, Minjung
    • Communications for Statistical Applications and Methods
    • /
    • 제22권6호
    • /
    • pp.625-637
    • /
    • 2015
  • We develop a Bayesian clustering procedure based on a Dirichlet process prior with cluster specific random effects. Gibbs sampling of a normal mixture of linear mixed regressions with a Dirichlet process was implemented to calculate posterior probabilities when the number of clusters was unknown. Our approach (unlike its counterparts) provides simultaneous partitioning and parameter estimation with the computation of the classification probabilities. A Monte Carlo study of curve estimation results showed that the model was useful for function estimation. We find that the proposed Dirichlet process mixture model with cluster specific random effects detects clusters sensitively by combining vague edges into different clusters. Examples are given to show how these models perform on real data.

비모수 베이지안 겉보기 무관 회귀모형 (A nonparametric Bayesian seemingly unrelated regression model)

  • 조성일;석인혜;최태련
    • 응용통계연구
    • /
    • 제29권4호
    • /
    • pp.627-641
    • /
    • 2016
  • 본 논문에서는 겉보기 무관 회귀모형을 고려하고 디리크레 프로세스 혼합모형을 오차항의 분포로 하는 비모수 베이지안 방법을 제안한다. 제안된 모형을 바탕으로 사후분포를 유도하고 디리크레 프로세스 혼합모형의 붕괴깁스표집 방법을 통해 마코프 체인 몬테 칼로 알고리듬을 구성하고 사후추론을 실시한다. 모형의 성능을 비교하기 위해 모의실험을 실시하고, 더 나아가 한국지역의 강수량 예측에 대한 실제 자료에 적용해 본다.

Bayesian analysis of random partition models with Laplace distribution

  • Kyung, Minjung
    • Communications for Statistical Applications and Methods
    • /
    • 제24권5호
    • /
    • pp.457-480
    • /
    • 2017
  • We develop a random partition procedure based on a Dirichlet process prior with Laplace distribution. Gibbs sampling of a Laplace mixture of linear mixed regressions with a Dirichlet process is implemented as a random partition model when the number of clusters is unknown. Our approach provides simultaneous partitioning and parameter estimation with the computation of classification probabilities, unlike its counterparts. A full Gibbs-sampling algorithm is developed for an efficient Markov chain Monte Carlo posterior computation. The proposed method is illustrated with simulated data and one real data of the energy efficiency of Tsanas and Xifara (Energy and Buildings, 49, 560-567, 2012).

디리슈레 혼합모형을 이용한 함정 전투체계 부품의 고장시간 분포 추정 (An Application of Dirichlet Mixture Model for Failure Time Density Estimation to Components of Naval Combat System)

  • 이진환;김정훈;정봉주;김경택
    • 산업경영시스템학회지
    • /
    • 제42권4호
    • /
    • pp.194-202
    • /
    • 2019
  • Reliability analysis of the components frequently starts with the data that manufacturer provides. If enough failure data are collected from the field operations, the reliability should be recomputed and updated on the basis of the field failure data. However, when the failure time record for a component contains only a few observations, all statistical methodologies are limited. In this case, where the failure records for multiple number of identical components are available, a valid alternative is combining all the data from each component into one data set with enough sample size and utilizing the useful information in the censored data. The ROK Navy has been operating multiple Patrol Killer Guided missiles (PKGs) for several years. The Korea Multi-Function Control Console (KMFCC) is one of key components in PKG combat system. The maintenance record for the KMFCC contains less than ten failure observations and a censored datum. This paper proposes a Bayesian approach with a Dirichlet mixture model to estimate failure time density for KMFCC. Trends test for each component record indicated that null hypothesis, that failure occurrence is renewal process, is not rejected. Since the KMFCCs have been functioning under different operating environment, the failure time distribution may be a composition of a number of unknown distributions, i.e. a mixture distribution, rather than a single distribution. The Dirichlet mixture model was coded as probabilistic programming in Python using PyMC3. Then Markov Chain Monte Carlo (MCMC) sampling technique employed in PyMC3 probabilistically estimated the parameters' posterior distribution through the Dirichlet mixture model. The simulation results revealed that the mixture models provide superior fits to the combined data set over single models.

Nonparametric Bayesian methods: a gentle introduction and overview

  • MacEachern, Steven N.
    • Communications for Statistical Applications and Methods
    • /
    • 제23권6호
    • /
    • pp.445-466
    • /
    • 2016
  • Nonparametric Bayesian methods have seen rapid and sustained growth over the past 25 years. We present a gentle introduction to the methods, motivating the methods through the twin perspectives of consistency and false consistency. We then step through the various constructions of the Dirichlet process, outline a number of the basic properties of this process and move on to the mixture of Dirichlet processes model, including a quick discussion of the computational methods used to fit the model. We touch on the main philosophies for nonparametric Bayesian data analysis and then reanalyze a famous data set. The reanalysis illustrates the concept of admissibility through a novel perturbation of the problem and data, showing the benefit of shrinkage estimation and the much greater benefit of nonparametric Bayesian modelling. We conclude with a too-brief survey of fancier nonparametric Bayesian methods.

Nonparametric Bayesian Multiple Change Point Problems

  • Kim, Chansoo;Younshik Chung
    • Journal of the Korean Statistical Society
    • /
    • 제31권1호
    • /
    • pp.1-16
    • /
    • 2002
  • Since changepoint identification is important in many data analysis problem, we wish to make inference about the locations of one or more changepoints of the sequence. We consider the Bayesian nonparameteric inference for multiple changepoint problem using a Bayesian segmentation procedure proposed by Yang and Kuo (2000). A mixture of products of Dirichlet process is used as a prior distribution. To decide whether there exists a single change or not, our approach depends on nonparametric Bayesian Schwartz information criterion at each step. We discuss how to choose the precision parameter (total mass parameter) in nonparametric setting and show that the discreteness of the Dirichlet process prior can ha17e a large effect on the nonparametric Bayesian Schwartz information criterion and leads to conclusions that are very different results from reasonable parametric model. One example is proposed to show this effect.

Multiple Comparisons for a Bivariate Exponential Populations Based On Dirichlet Process Priors

  • Cho, Jang-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권2호
    • /
    • pp.553-560
    • /
    • 2007
  • In this paper, we consider two components system which lifetimes have Freund's bivariate exponential model with equal failure rates. We propose Bayesian multiple comparisons procedure for the failure rates of I Freund's bivariate exponential populations based on Dirichlet process priors(DPP). The family of DPP is applied in the form of baseline prior and likelihood combination to provide the comparisons. Computation of the posterior probabilities of all possible hypotheses are carried out through Markov Chain Monte Carlo(MCMC) method, namely, Gibbs sampling, due to the intractability of analytic evaluation. The whole process of multiple comparisons problem for the failure rates of bivariate exponential populations is illustrated through a numerical example.

  • PDF

Online nonparametric Bayesian analysis of parsimonious Gaussian mixture models and scenes clustering

  • Zhou, Ri-Gui;Wang, Wei
    • ETRI Journal
    • /
    • 제43권1호
    • /
    • pp.74-81
    • /
    • 2021
  • The mixture model is a very powerful and flexible tool in clustering analysis. Based on the Dirichlet process and parsimonious Gaussian distribution, we propose a new nonparametric mixture framework for solving challenging clustering problems. Meanwhile, the inference of the model depends on the efficient online variational Bayesian approach, which enhances the information exchange between the whole and the part to a certain extent and applies to scalable datasets. The experiments on the scene database indicate that the novel clustering framework, when combined with a convolutional neural network for feature extraction, has meaningful advantages over other models.

준모수적 계층적 선택모형에 대한 베이지안 방법 (A Bayesian Method to Semiparametric Hierarchical Selection Models)

  • 정윤식;장정훈
    • 응용통계연구
    • /
    • 제14권1호
    • /
    • pp.161-175
    • /
    • 2001
  • 메타분석(Meta-analysis)은 서로 독립적으로 연구되어진 결과들을 전체적인 하나의 결과로 도출하기 위해 사용되어지는 통계적 방법이다. 이러한 통계적 방법을 설명할 모형으로는 선택모형(selection model)을 포함한 계층적 모형(hierarchical model)을 사용하며, 이러한 모형들은 베이지안 메타분석에 유용한 것으로 알려져 있다. 그러나, 메타분석의 자료들은 일반적으로 출판편의(publication bias)를 갖고 있으므로 이를 극복하고자 가중함수(weight function)를 이용하여 분포함수를 새롭게 정의하여 사용한다. 최근에 Silliman(1997)은 계층적 모형(hierarchical model)에 가중함수를 첨부한 계층적 선택모형(hierarchical selection model)을 정의하고 모수적 베이지안 방법을 제시하였다. 본 연구에서는 미관측된 연구효과에 디리슈레 과정 사전분포(Dirichlet process prior)를 적용한 준모수적 계층적 선택모형(semiparametric hierarchical selection models)을 소개한다. 여기서 제시된 준모수적 계층적 선택모형을 베이지안 방법으로 추정하기 위하여 마코프 연쇄 몬테칼로(Markov chain Monte Carlo)방법을 이용한다. 제시된 방법을 적용하기 위하여 실제 자료(Johnson, 1993)인 충치를 예방하기 위한 두 가지의 예방약의 효과에 대한 차이를 비교하기 위해 얻어진 12개의 연구를 이용하여 메타분석을 한다.

  • PDF

Estimating dose-response curves using splines: a nonparametric Bayesian knot selection method

  • Lee, Jiwon;Kim, Yongku;Kim, Young Min
    • Communications for Statistical Applications and Methods
    • /
    • 제29권3호
    • /
    • pp.287-299
    • /
    • 2022
  • In radiation epidemiology, the excess relative risk (ERR) model is used to determine the dose-response relationship. In general, the dose-response relationship for the ERR model is assumed to be linear, linear-quadratic, linear-threshold, quadratic, and so on. However, since none of these functions dominate other functions for expressing the dose-response relationship, a Bayesian semiparametric method using splines has recently been proposed. Thus, we improve the Bayesian semiparametric method for the selection of the tuning parameters for splines as the number and location of knots using a Bayesian knot selection method. Equally spaced knots cannot capture the characteristic of radiation exposed dose distribution which is highly skewed in general. Therefore, we propose a nonparametric Bayesian knot selection method based on a Dirichlet process mixture model. Inference of the spline coefficients after obtaining the number and location of knots is performed in the Bayesian framework. We apply this approach to the life span study cohort data from the radiation effects research foundation in Japan, and the results illustrate that the proposed method provides competitive curve estimates for the dose-response curve and relatively stable credible intervals for the curve.