• Title/Summary/Keyword: MCMC Method

Search Result 103, Processing Time 0.022 seconds

Korean Welfare Panel Data: A Computational Bayesian Method for Ordered Probit Random Effects Models

  • Lee, Hyejin;Kyung, Minjung
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.1
    • /
    • pp.45-60
    • /
    • 2014
  • We introduce a MCMC sampling for a generalized linear normal random effects model with the ordered probit link function based on latent variables from suitable truncated normal distribution. Such models have proven useful in practice and we have observed numerically reasonable results in the estimation of fixed effects when the random effect term is provided. Applications that utilize Korean Welfare Panel Study data can be difficult to model; subsequently, we find that an ordered probit model with the random effects leads to an improved analyses with more accurate and precise inferences.

A Change-Point Analysis of Oil Supply Disruption : Bayesian Approach (석유공급교란에 대한 변화점 분석 및 분포 추정 : 베이지안 접근)

  • Park, Chun-Gun;Lee, Sung-Su
    • Journal of Korean Society for Quality Management
    • /
    • v.35 no.4
    • /
    • pp.159-165
    • /
    • 2007
  • Using statistical methods a change-point analysis of oil supply disruption is conducted. The statistical distribution of oil supply disruption is a weibull distribution. The detection of the change-point is applied to Bayesian method and weibull parameters are estimated through Markov chain monte carlo and parameter approach. The statistical approaches to the estimation for the change-point and weibull parameters is implemented with the sets of simulated and real data with small sizes of samples.

Sparse Data Cleaning using Multiple Imputations

  • Jun, Sung-Hae;Lee, Seung-Joo;Oh, Kyung-Whan
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.4 no.1
    • /
    • pp.119-124
    • /
    • 2004
  • Real data as web log file tend to be incomplete. But we have to find useful knowledge from these for optimal decision. In web log data, many useful things which are hyperlink information and web usages of connected users may be found. The size of web data is too huge to use for effective knowledge discovery. To make matters worse, they are very sparse. We overcome this sparse problem using Markov Chain Monte Carlo method as multiple imputations. This missing value imputation changes spare web data to complete. Our study may be a useful tool for discovering knowledge from data set with sparseness. The more sparseness of data in increased, the better performance of MCMC imputation is good. We verified our work by experiments using UCI machine learning repository data.

Bayesian Methods for Wavelet Series in Single-Index Models

  • Park, Chun-Gun;Vannucci, Marina;Hart, Jeffrey D.
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2005.04a
    • /
    • pp.83-126
    • /
    • 2005
  • Single-index models have found applications in econometrics and biometrics, where multidimensional regression models are often encountered. Here we propose a nonparametric estimation approach that combines wavelet methods for non-equispaced designs with Bayesian models. We consider a wavelet series expansion of the unknown regression function and set prior distributions for the wavelet coefficients and the other model parameters. To ensure model identifiability, the direction parameter is represented via its polar coordinates. We employ ad hoc hierarchical mixture priors that perform shrinkage on wavelet coefficients and use Markov chain Monte Carlo methods for a posteriori inference. We investigate an independence-type Metropolis-Hastings algorithm to produce samples for the direction parameter. Our method leads to simultaneous estimates of the link function and of the index parameters. We present results on both simulated and real data, where we look at comparisons with other methods.

  • PDF

Bayesian Model for Cost Estimation of Construction Projects

  • Kim, Sang-Yon
    • Journal of the Korea Institute of Building Construction
    • /
    • v.11 no.1
    • /
    • pp.91-99
    • /
    • 2011
  • Bayesian network is a form of probabilistic graphical model. It incorporates human reasoning to deal with sparse data availability and to determine the probabilities of uncertain cases. In this research, bayesian network is adopted to model the problem of construction project cost. General information, time, cost, and material, the four main factors dominating the characteristic of construction costs, are incorporated into the model. This research presents verify a model that were conducted to illustrate the functionality and application of a decision support system for predicting the costs. The Markov Chain Monte Carlo (MCMC) method is applied to estimate parameter distributions. Furthermore, it is shown that not all the parameters are normally distributed. In addition, cost estimates based on the Gibbs output is performed. It can enhance the decision the decision-making process.

Bayesian Variable Selection in the Proportional Hazard Model with Application to DNA Microarray Data

  • Lee, Kyeon-Eun;Mallick, Bani K.
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.357-360
    • /
    • 2005
  • In this paper we consider the well-known semiparametric proportional hazards (PH) models for survival analysis. These models are usually used with few covariates and many observations (subjects). But, for a typical setting of gene expression data from DNA microarray, we need to consider the case where the number of covariates p exceeds the number of samples n. For a given vector of response values which are times to event (death or censored times) and p gene expressions (covariates), we address the issue of how to reduce the dimension by selecting the significant genes. This approach enable us to estimate the survival curve when n < < p. In our approach, rather than fixing the number of selected genes, we will assign a prior distribution to this number. The approach creates additional flexibility by allowing the imposition of constraints, such as bounding the dimension via a prior, which in effect works as a penalty. To implement our methodology, we use a Markov Chain Monte Carlo (MCMC) method. We demonstrate the use of the methodology to diffuse large B-cell lymphoma (DLBCL) complementary DNA(cDNA) data.

  • PDF

Bayesian Model Selection for Nonlinear Regression under Noninformative Prior

  • Na, Jonghwa;Kim, Jeongsuk
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.719-729
    • /
    • 2003
  • We propose a Bayesian model selection procedure for nonlinear regression models under noninformative prior. For informative prior, Na and Kim (2002) suggested the Bayesian model selection procedure through MCMC techniques. We extend this method to the case of noninformative prior. The difficulty with the use of noninformative prior is that it is typically improper and hence is defined only up to arbitrary constant. The methods, such as Intrinsic Bayes Factor(IBF) and Fractional Bayes Factor(FBF), are used as a resolution to the problem. We showed the detailed model selection procedure through the specific real data set.

Multiple Comparisons for a Bivariate Exponential Populations Based On Dirichlet Process Priors

  • Cho, Jang-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.2
    • /
    • pp.553-560
    • /
    • 2007
  • In this paper, we consider two components system which lifetimes have Freund's bivariate exponential model with equal failure rates. We propose Bayesian multiple comparisons procedure for the failure rates of I Freund's bivariate exponential populations based on Dirichlet process priors(DPP). The family of DPP is applied in the form of baseline prior and likelihood combination to provide the comparisons. Computation of the posterior probabilities of all possible hypotheses are carried out through Markov Chain Monte Carlo(MCMC) method, namely, Gibbs sampling, due to the intractability of analytic evaluation. The whole process of multiple comparisons problem for the failure rates of bivariate exponential populations is illustrated through a numerical example.

  • PDF

Bayesian Analysis of a New Skewed Multivariate Probit for Correlated Binary Response Data

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.30 no.4
    • /
    • pp.613-635
    • /
    • 2001
  • This paper proposes a skewed multivariate probit model for analyzing a correlated binary response data with covariates. The proposed model is formulated by introducing an asymmetric link based upon a skewed multivariate normal distribution. The model connected to the asymmetric multivariate link, allows for flexible modeling of the correlation structure among binary responses and straightforward interpretation of the parameters. However, complex likelihood function of the model prevents us from fitting and analyzing the model analytically. Simulation-based Bayesian inference methodologies are provided to overcome the problem. We examine the suggested methods through two data sets in order to demonstrate their performances.

  • PDF

BAYESIAN HIERARCHICAL MODEL WITH SKEWED ELLIPTICAL DISTRIBUTION

  • Chung, Youn-Shik;Dipak K. Dey;Yang, Tae-Young;Jang, Jung-Hoon
    • Journal of the Korean Statistical Society
    • /
    • v.32 no.4
    • /
    • pp.425-448
    • /
    • 2003
  • Meta-analysis refers to quantitative methods for combining results from independent studies in order to draw overall conclusions. We consider hierarchical models including selection models under a skewed heavy tailed error distribution proposed originally by Chen et al. (1999) and Branco and Dey (2001). These rich classes of models combine the information of independent studies, allowing investigation of variability both between and within studies, and incorporate weight function. Here, the testing for the skewness parameter is discussed. The score test statistic for such a test can be shown to be expressed as the posterior expectations. Also, we consider the detail computational scheme under skewed normal and skewed Student-t distribution using MCMC method. Finally, we introduce one example from Johnson (1993)'s real data and apply our proposed methodology. We investigate sensitivity of our results under different skewed errors and under different prior distributions.