• Title/Summary/Keyword: Bayesian model class selection

Search Result 10, Processing Time 0.031 seconds

Bayesian Hierarchical Model with Skewed Elliptical Distribution

  • Chung Younshik
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2000.11a
    • /
    • pp.5-12
    • /
    • 2000
  • Meta-analysis refers to quantitative methods for combining results from independent studies in order to draw overall conclusions. We consider hierarchical models including selection models under a skewed heavy tailed error distribution and it is shown to be useful in such Bayesian meta-analysis. A general class of skewed elliptical distribution is reviewed and developed. These rich class of models combine the information of independent studies, allowing investigation of variability both between and within studies, and weight function. Here we investigate sensitivity of results to unobserved studies by considering a hierarchical selection model and use Markov chain Monte Carlo methods to develop inference for the parameters of interest.

  • PDF

A Bayesian Method for Narrowing the Scope fo Variable Selection in Binary Response t-Link Regression

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.4
    • /
    • pp.407-422
    • /
    • 2000
  • This article is concerned with the selecting predictor variables to be included in building a class of binary response t-link regression models where both probit and logistic regression models can e approximately taken as members of the class. It is based on a modification of the stochastic search variable selection method(SSVS), intended to propose and develop a Bayesian procedure that used probabilistic considerations for selecting promising subsets of predictor variables. The procedure reformulates the binary response t-link regression setup in a hierarchical truncated normal mixture model by introducing a set of hyperparameters that will be used to identify subset choices. In this setup, the most promising subset of predictors can be identified as that with highest posterior probability in the marginal posterior distribution of the hyperparameters. To highlight the merit of the procedure, an illustrative numerical example is given.

  • PDF

Bayesian ballast damage detection utilizing a modified evolutionary algorithm

  • Hu, Qin;Lam, Heung Fai;Zhu, Hong Ping;Alabi, Stephen Adeyemi
    • Smart Structures and Systems
    • /
    • v.21 no.4
    • /
    • pp.435-448
    • /
    • 2018
  • This paper reports the development of a theoretically rigorous method for permanent way engineers to assess the condition of railway ballast under a concrete sleeper with the potential to be extended to a smart system for long-term health monitoring of railway ballast. Owing to the uncertainties induced by the problems of modeling error and measurement noise, the Bayesian approach was followed in the development. After the selection of the most plausible model class for describing the damage status of the rail-sleeper-ballast system, Bayesian model updating is adopted to calculate the posterior PDF of the ballast stiffness at various regions under the sleeper. An obvious drop in ballast stiffness at a region under the sleeper is an evidence of ballast damage. In model updating, the model that can minimize the discrepancy between the measured and model-predicted modal parameters can be considered as the most probable model for calculating the posterior PDF under the Bayesian framework. To address the problems of non-uniqueness and local minima in the model updating process, a two-stage hybrid optimization method was developed. The modified evolutionary algorithm was developed in the first stage to identify the important regions in the parameter space and resulting in a set of initial trials for deterministic optimization to locate all most probable models in the second stage. The proposed methodology was numerically and experimentally verified. Using the identified model, a series of comprehensive numerical case studies was carried out to investigate the effects of data quantity and quality on the results of ballast damage detection. Difficulties to be overcome before the proposed method can be extended to a long-term ballast monitoring system are discussed in the conclusion.

A Bayesian Method to Semiparametric Hierarchical Selection Models (준모수적 계층적 선택모형에 대한 베이지안 방법)

  • 정윤식;장정훈
    • The Korean Journal of Applied Statistics
    • /
    • v.14 no.1
    • /
    • pp.161-175
    • /
    • 2001
  • Meta-analysis refers to quantitative methods for combining results from independent studies in order to draw overall conclusions. Hierarchical models including selection models are introduced and shown to be useful in such Bayesian meta-analysis. Semiparametric hierarchical models are proposed using the Dirichlet process prior. These rich class of models combine the information of independent studies, allowing investigation of variability both between and within studies, and weight function. Here we investigate sensitivity of results to unobserved studies by considering a hierachical selection model with including unknown weight function and use Markov chain Monte Carlo methods to develop inference for the parameters of interest. Using Bayesian method, this model is used on a meta-analysis of twelve studies comparing the effectiveness of two different types of flouride, in preventing cavities. Clinical informative prior is assumed. Summaries and plots of model parameters are analyzed to address questions of interest.

  • PDF

Bayesian Clustering of Prostate Cancer Patients by Using a Latent Class Poisson Model (잠재그룹 포아송 모형을 이용한 전립선암 환자의 베이지안 그룹화)

  • Oh Man-Suk
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.1
    • /
    • pp.1-13
    • /
    • 2005
  • Latent Class model has been considered recently by many researchers and practitioners as a tool for identifying heterogeneous segments or groups in a population, and grouping objects into the segments. In this paper we consider data on prostate cancer patients from Korean National Cancer Institute and propose a method for grouping prostate cancer patients by using latent class Poisson model. A Bayesian approach equipped with a Markov chain Monte Carlo method is used to overcome the limit of classical likelihood approaches. Advantages of the proposed Bayesian method are easy estimation of parameters with their standard errors, segmentation of objects into groups, and provision of uncertainty measures for the segmentation. In addition, we provide a method to determine an appropriate number of segments for the given data so that the method automatically chooses the number of segments and partitions objects into heterogeneous segments.

BAYESIAN ROBUST ANALYSIS FOR NON-NORMAL DATA BASED ON A PERTURBED-t MODEL

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.35 no.4
    • /
    • pp.419-439
    • /
    • 2006
  • The article develops a new class of distributions by introducing a nonnegative perturbing function to $t_\nu$ distribution having location and scale parameters. The class is obtained by using transformations and conditioning. The class strictly includes $t_\nu$ and $skew-t_\nu$ distributions. It provides yet other models useful for selection modeling and robustness analysis. Analytic forms of the densities are obtained and distributional properties are studied. These developments are followed by an easy method for estimating the distribution by using Markov chain Monte Carlo. It is shown that the method is straightforward to specify distribution ally and to implement computationally, with output readily adopted for constructing required criterion. The method is illustrated by using a simulation study.

Investigation of modal identification and modal identifiability of a cable-stayed bridge with Bayesian framework

  • Kuok, Sin-Chi;Yuen, Ka-Veng
    • Smart Structures and Systems
    • /
    • v.17 no.3
    • /
    • pp.445-470
    • /
    • 2016
  • In this study, the Bayesian probabilistic framework is investigated for modal identification and modal identifiability based on the field measurements provided in the structural health monitoring benchmark problem of an instrumented cable-stayed bridge named Ting Kau Bridge (TKB). The comprehensive structural health monitoring system on the cable-stayed TKB has been operated for more than ten years and it is recognized as one of the best test-beds with readily available field measurements. The benchmark problem of the cable-stayed bridge is established to stimulate investigations on modal identifiability and the present paper addresses this benchmark problem from the Bayesian prospective. In contrast to deterministic approaches, an appealing feature of the Bayesian approach is that not only the optimal values of the modal parameters can be obtained but also the associated estimation uncertainty can be quantified in the form of probability distribution. The uncertainty quantification provides necessary information to evaluate the reliability of parametric identification results as well as modal identifiability. Herein, the Bayesian spectral density approach is conducted for output-only modal identification and the Bayesian model class selection approach is used to evaluate the significance of different modes in modal identification. Detailed analysis on the modal identification and modal identifiability based on the measurements of the bridge will be presented. Moreover, the advantages and potentials of Bayesian probabilistic framework on structural health monitoring will be discussed.

Variable selection for latent class analysis using clustering efficiency (잠재변수 모형에서의 군집효율을 이용한 변수선택)

  • Kim, Seongkyung;Seo, Byungtae
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.6
    • /
    • pp.721-732
    • /
    • 2018
  • Latent class analysis (LCA) is an important tool to explore unseen latent groups in multivariate categorical data. In practice, it is important to select a suitable set of variables because the inclusion of too many variables in the model makes the model complicated and reduces the accuracy of the parameter estimates. Dean and Raftery (Annals of the Institute of Statistical Mathematics, 62, 11-35, 2010) proposed a headlong search algorithm based on Bayesian information criteria values to choose meaningful variables for LCA. In this paper, we propose a new variable selection procedure for LCA by utilizing posterior probabilities obtained from each fitted model. We propose a new statistic to measure the adequacy of LCA and develop a variable selection procedure. The effectiveness of the proposed method is also presented through some numerical studies.

Semi-Supervised Learning by Gaussian Mixtures (정규 혼합분포를 이용한 준지도 학습)

  • Choi, Byoung-Jeong;Chae, Youn-Seok;Choi, Woo-Young;Park, Chang-Yi;Koo, Ja-Yong
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.5
    • /
    • pp.825-833
    • /
    • 2008
  • Discriminant analysis based on Gaussian mixture models, an useful tool for multi-class classifications, can be extended to semi-supervised learning. We consider a model selection problem for a Gaussian mixture model in semi-supervised learning. More specifically, we adopt Bayesian information criterion to determine the number of subclasses in the mixture model. Through simulations, we illustrate the usefulness of the criterion.

Estimation of Optimal Mixture Number of GMM for Environmental Sounds Recognition (환경음 인식을 위한 GMM의 혼합모델 개수 추정)

  • Han, Da-Jeong;Park, Aa-Ron;Baek, Sung-June
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.13 no.2
    • /
    • pp.817-821
    • /
    • 2012
  • In this paper we applied the optimal mixture number estimation technique in GMM(Gaussian mixture model) using BIC(Bayesian information criterion) and MDL(minimum description length) as a model selection criterion for environmental sounds recognition. In the experiment, we extracted 12 MFCC(mel-frequency cepstral coefficients) features from 9 kinds of environmental sounds which amounts to 27747 data and classified them with GMM. As mentioned above, BIC and MDL is applied to estimate the optimal number of mixtures in each environmental sounds class. According to the experimental results, while the recognition performances are maintained, the computational complexity decreases by 17.8% with BIC and 31.7% with MDL. It shows that the computational complexity reduction by BIC and MDL is effective for environmental sounds recognition using GMM.