• 제목/요약/키워드: Bayesian statistical method

검색결과 306건 처리시간 0.024초

베이지안 통계 추론 (On the Bayesian Statistical Inference)

  • 이호석
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2007년도 한국컴퓨터종합학술대회논문집 Vol.34 No.1 (C)
    • /
    • pp.263-266
    • /
    • 2007
  • 본 논문은 베이지안 통계 추론에 대하여 논의한다. 논문은 베이지안 추론, Markov Chain과 Monte Carlo 적분, MCMC(Markov Chain Monte Carlo) 기법, Metropolis-Hastings 알고리즘, Gibbs 샘플링, Maximum Likelihood Estimation, EM 알고리즘, 상실된 데이터 보완 기법, BMA(Bayesian Model Averaging) 순서로 논의를 진행한다. 이러한 통계적 기법들은 대용량의 데이터를 처리하는 생물학, 의학, 생명 공학, 과학과 공학, 그리고 일반 데이터 조사와 처리 등에 사용되고 있으며, 최적의 추론 결과를 이끌어 내는데 중요한 방법을 제공하고 있다. 그리고 마지막으로 PC(Principal Component) 분석 기법에 대하여 논의한다. PC 분석 기법도 데이터 분석과 연구에 많이 활용된다.

  • PDF

A Parametric Empirical Bayesian Method for Multiple Comparisons

  • Kim, Woo-Chul;Hwang, Hyung-Tae
    • Journal of the Korean Statistical Society
    • /
    • 제20권1호
    • /
    • pp.44-56
    • /
    • 1991
  • For all pairwise comparisons of treatments, Bayesian simultaneous confidence intervals are proposed and studied. First Bayesian solutions are obtained for a fixed prior, and then prior parameters are estimated by a parametric empirical Bayesian method. The nominal confidence level is shown to be controlled asymptotically. An extension to the unbalanced design is also considered.

  • PDF

ON BAYESIAN ESTIMATION AND PROPERTIES OF THE MARGINAL DISTRIBUTION OF A TRUNCATED BIVARIATE t-DISTRIBUTION

  • KIM HEA-JUNG;KIM Ju SUNG
    • Journal of the Korean Statistical Society
    • /
    • 제34권3호
    • /
    • pp.245-261
    • /
    • 2005
  • The marginal distribution of X is considered when (X, Y) has a truncated bivariate t-distribution. This paper mainly focuses on the marginal nontruncated distribution of X where Y is truncated below at its mean and its observations are not available. Several properties and applications of this distribution, including relationship with Azzalini's skew-normal distribution, are obtained. To circumvent inferential problem arises from adopting the frequentist's approach, a Bayesian method utilizing a data augmentation method is suggested. Illustrative examples demonstrate the performance of the method.

통계모델링 방법의 비교 연구 (A Comparison Study on Statistical Modeling Methods)

  • 노유정
    • 한국산학기술학회논문지
    • /
    • 제17권5호
    • /
    • pp.645-652
    • /
    • 2016
  • 입력 랜덤 변수(input random variable)의 통계 모델링은 기계시스템의 신뢰성 해석(reliability analysis), 신뢰성 기반 설계(reliability-based design optimization), 해석모델의 통계적 검정(validation) 및 보정(calibration)을 위해 반드시 필요하다. 대표적인 통계모델링 기법에는 Akaike Information Criterion (AIC), AIC correction (AICc), Bayesian Information Criterion, Maximum Likelihood Estimation (MLE), Bayesian 방법 등이 있다. 이러한 방법들은 기본적으로 주어진 데이터로부터 후보 모델의 우도함수값을 이용하여 후보 모델 중 가장 적합한 모델을 선택하는 방법이며, 방법에 따라 데이터 수 혹은 파라미터의 수를 고려하여 모델을 선정한다. 하지만 실제 현장에서 데이터의 통계모델링을 하는 엔지니어는 각 방법의 장단점에 대한 이해가 부족하여 어떤 방법이 정확한 방법인지 몰라 통계모델링 수행 시 어려움이 있다. 본 논문에서는 다양한 통계모델링 방법들을 비교하고 각 방법의 장단점 분석을 통해 가장 적합한 모델링 기법을 제안하고자 한다. 각 방법의 검증을 위해 다양한 모분포를 가정하고 다양한 사이즈의 샘플을 임의로 생성하여 시뮬레이션을 수행하였으며, 실제 공학 데이터를 사용하여 통계모델링 방법의 유효성을 검증하였다.

통계적 추론에 있어서 베이지안과 고전적 방법(신뢰성 분석과 관련하여)

  • 박태룡
    • 한국수학사학회지
    • /
    • 제11권1호
    • /
    • pp.68-77
    • /
    • 1998
  • There are two approach methods widely in statistical inferences. First is sampling theory methods and the other is Bayesian methods. In this paper, we will introduce the most basic differences of the two approach methods. Especially, we investigate and introduce the historical origin of Bayesian methods in Statistical inferences which is currently used. Also, we introduce the some characteristics of sampling theory method and Bayesian methods.

  • PDF

Statistical Method for Implementing the Experimenter Effect in the Analysis of Gene Expression Data

  • Kim, In-Young;Rha, Sun-Young;Kim, Byung-Soo
    • Communications for Statistical Applications and Methods
    • /
    • 제13권3호
    • /
    • pp.701-718
    • /
    • 2006
  • In cancer microarray experiments, the experimenter or patient which is nested in each experimenter often shows quite heterogeneous error variability, which should be estimated for identifying a source of variation. Our study describes a Bayesian method which utilizes clinical information for identifying a set of DE genes for the class of subtypes as well as assesses and examines the experimenter effect and patient effect which is nested in each experimenter as a source of variation. We propose a Bayesian multilevel mixed effect model based on analysis of covariance (ANACOVA). The Bayesian multilevel mixed effect model is a combination of the multilevel mixed effect model and the Bayesian hierarchical model, which provides a flexible way of defining a suitable correlation structure among genes.

Leave-one-out Bayesian model averaging for probabilistic ensemble forecasting

  • Kim, Yongdai;Kim, Woosung;Ohn, Ilsang;Kim, Young-Oh
    • Communications for Statistical Applications and Methods
    • /
    • 제24권1호
    • /
    • pp.67-80
    • /
    • 2017
  • Over the last few decades, ensemble forecasts based on global climate models have become an important part of climate forecast due to the ability to reduce uncertainty in prediction. Moreover in ensemble forecast, assessing the prediction uncertainty is as important as estimating the optimal weights, and this is achieved through a probabilistic forecast which is based on the predictive distribution of future climate. The Bayesian model averaging has received much attention as a tool of probabilistic forecasting due to its simplicity and superior prediction. In this paper, we propose a new Bayesian model averaging method for probabilistic ensemble forecasting. The proposed method combines a deterministic ensemble forecast based on a multivariate regression approach with Bayesian model averaging. We demonstrate that the proposed method is better in prediction than the standard Bayesian model averaging approach by analyzing monthly average precipitations and temperatures for ten cities in Korea.

Application of Bayesian Statistical Analysis to Multisource Data Integration

  • Hong, Sa-Hyun;Moon, Wooil-M.
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2002년도 Proceedings of International Symposium on Remote Sensing
    • /
    • pp.394-399
    • /
    • 2002
  • In this paper, Multisource data classification methods based on Bayesian formula are considered. For this decision fusion scheme, the individual data sources are handled separately by statistical classification algorithms and then Bayesian fusion method is applied to integrate from the available data sources. This method includes the combination of each expert decisions where the weights of the individual experts represent the reliability of the sources. The reliability measure used in the statistical approach is common to all pixels in previous work. In this experiment, the weight factors have been assigned to have different value for all pixels in order to improve the integrated classification accuracies. Although most implementations of Bayesian classification approaches assume fixed a priori probabilities, we have used adaptive a priori probabilities by iteratively calculating the local a priori probabilities so as to maximize the posteriori probabilities. The effectiveness of the proposed method is at first demonstrated on simulations with artificial and evaluated in terms of real-world data sets. As a result, we have shown that Bayesian statistical fusion scheme performs well on multispectral data classification.

  • PDF

Estimating dose-response curves using splines: a nonparametric Bayesian knot selection method

  • Lee, Jiwon;Kim, Yongku;Kim, Young Min
    • Communications for Statistical Applications and Methods
    • /
    • 제29권3호
    • /
    • pp.287-299
    • /
    • 2022
  • In radiation epidemiology, the excess relative risk (ERR) model is used to determine the dose-response relationship. In general, the dose-response relationship for the ERR model is assumed to be linear, linear-quadratic, linear-threshold, quadratic, and so on. However, since none of these functions dominate other functions for expressing the dose-response relationship, a Bayesian semiparametric method using splines has recently been proposed. Thus, we improve the Bayesian semiparametric method for the selection of the tuning parameters for splines as the number and location of knots using a Bayesian knot selection method. Equally spaced knots cannot capture the characteristic of radiation exposed dose distribution which is highly skewed in general. Therefore, we propose a nonparametric Bayesian knot selection method based on a Dirichlet process mixture model. Inference of the spline coefficients after obtaining the number and location of knots is performed in the Bayesian framework. We apply this approach to the life span study cohort data from the radiation effects research foundation in Japan, and the results illustrate that the proposed method provides competitive curve estimates for the dose-response curve and relatively stable credible intervals for the curve.

신뢰성 해석을 위한 결합분포함수의 통계모델링 (Statistical Modeling of Joint Distribution Functions for Reliability Analysis)

  • 노유정;이상진
    • 한국산학기술학회논문지
    • /
    • 제15권5호
    • /
    • pp.2603-2609
    • /
    • 2014
  • 기계시스템의 신뢰성 해석을 위해서는 기계시스템에 성능을 미치는 변수의 확률 분포와 파라미터를 결정하는 통계적 모델링은 반드시 필요하다. 하지만, 신뢰성 해석에서 상당수의 변수는 상관관계가 있음에도 불구하고 독립변수로 취급되거나 실험데이터 수가 부족하다는 이유로 통계 모델에 대한 잘못된 가정을 하는 경우가 많다. 본 연구에서는 베이지안 방법을 이용하여 상관관계를 갖는 데이터의 결합분포함수를 copula를 이용하여 모델링함으로써 적은 수의 데이터로부터 정확한 입력모델을 산정하는 방법을 제안하였으며, 방법의 검증을 위해 다양한 상관계수와 데이터 수에 대해 통계 시뮬레이션을 수행하였다. 그 결과 Bayesian방법은 상관계수가 낮아 후보함수가 유사하거나 샘플수가 적어 정확한 모델을 산정하기 어려운 경우에도 후보 copula 중 실제 copula와 가장 근사한 후보 copula를 선정하였다. 이러한 근사 후보 copula는 신뢰성 해석결과 역시 실제 copula 함수를 이용한 신뢰성 해석 결과와 유사한 결과를 가짐을 확인할 수 있으므로 베이지안 방법은 신뢰성 해석을 위해 정확한 통계모델링을 제공함을 알 수 있다.