• Title/Summary/Keyword: 베이지안 선형모형

Search Result 38, Processing Time 0.019 seconds

Bayesian analysis of latent factor regression model (내재된 인자회귀모형의 베이지안 분석법)

  • Kyung, Minjung
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.4
    • /
    • pp.365-377
    • /
    • 2020
  • We discuss latent factor regression when constructing a common structure inherent among explanatory variables to solve multicollinearity and use them as regressors to construct a linear model of a response variable. Bayesian estimation with LASSO prior of a large penalty parameter to construct a significant factor loading matrix of intrinsic interests among infinite latent structures. The estimated factor loading matrix with estimated other parameters can be inversely transformed into linear parameters of each explanatory variable and used as prediction models for new observations. We apply the proposed method to Product Service Management data of HBAT and observe that the proposed method constructs the same factors of general common factor analysis for the fixed number of factors. The calculated MSE of predicted values of Bayesian latent factor regression model is also smaller than the common factor regression model.

A Hierarchical Bayesian Modeling of Temporal Trends in Return Levels for Extreme Precipitations (한국지역 집중호우에 대한 반환주기의 베이지안 모형 분석)

  • Kim, Yongku
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.2
    • /
    • pp.137-149
    • /
    • 2015
  • Flood planning needs to recognize trends for extreme precipitation events. Especially, the r-year return level is a common measure for extreme events. In this paper, we present a nonstationary temporal model for precipitation return levels using a hierarchical Bayesian modeling. For intensity, we model annual maximum daily precipitation measured in Korea with a generalized extreme value (GEV). The temporal dependence among the return levels is incorporated to the model for GEV model parameters and a linear model with autoregressive error terms. We apply the proposed model to precipitation data collected from various stations in Korea from 1973 to 2011.

Study to the randomized response model (확률응답모형에 관한 연구)

  • 이영진
    • The Korean Journal of Applied Statistics
    • /
    • v.4 no.2
    • /
    • pp.179-193
    • /
    • 1991
  • In this paper, we introduce various methods of PR techniques initiated by S. Warner in 1960's and examine the maximum likelihood estimator for them. One of the main subjects of this paper is to represent Warner model, Unrelated Question Model, and Multi-Proportion Model in linear model. The other subject is to study the inference of PR model by using the Bayesian Approach.

  • PDF

Bayesian logit models with auxiliary mixture sampling for analyzing diabetes diagnosis data (보조 혼합 샘플링을 이용한 베이지안 로지스틱 회귀모형 : 당뇨병 자료에 적용 및 분류에서의 성능 비교)

  • Rhee, Eun Hee;Hwang, Beom Seuk
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.1
    • /
    • pp.131-146
    • /
    • 2022
  • Logit models are commonly used to predicting and classifying categorical response variables. Most Bayesian approaches to logit models are implemented based on the Metropolis-Hastings algorithm. However, the algorithm has disadvantages of slow convergence and difficulty in ensuring adequacy for the proposal distribution. Therefore, we use auxiliary mixture sampler proposed by Frühwirth-Schnatter and Frühwirth (2007) to estimate logit models. This method introduces two sequences of auxiliary latent variables to make logit models satisfy normality and linearity. As a result, the method leads that logit model can be easily implemented by Gibbs sampling. We applied the proposed method to diabetes data from the Community Health Survey (2020) of the Korea Disease Control and Prevention Agency and compared performance with Metropolis-Hastings algorithm. In addition, we showed that the logit model using auxiliary mixture sampling has a great classification performance comparable to that of the machine learning models.

Introduction to variational Bayes for high-dimensional linear and logistic regression models (고차원 선형 및 로지스틱 회귀모형에 대한 변분 베이즈 방법 소개)

  • Jang, Insong;Lee, Kyoungjae
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.3
    • /
    • pp.445-455
    • /
    • 2022
  • In this paper, we introduce existing Bayesian methods for high-dimensional sparse regression models and compare their performance in various simulation scenarios. Especially, we focus on the variational Bayes approach proposed by Ray and Szabó (2021), which enables scalable and accurate Bayesian inference. Based on simulated data sets from sparse high-dimensional linear regression models, we compare the variational Bayes approach with other Bayesian and frequentist methods. To check the practical performance of the variational Bayes in logistic regression models, a real data analysis is conducted using leukemia data set.

A Bayesian Regression Model to Estimate the Deterioration Rate of Track Irregularities (궤도틀림 진전율 추정을 위한 베이지안 회귀분석 모형 연구)

  • Park, Bum Hwan
    • Journal of the Korean Society for Railway
    • /
    • v.19 no.4
    • /
    • pp.547-554
    • /
    • 2016
  • This study considered how to estimate the deterioration rate of the track quality index, which represents track geometric irregularity. Most existing studies have used a simple linear regression and regarded the slope of the regression equation as the progress rate. In this paper, we present a Bayesian approach to estimate the track irregularity progress. This Bayesian approach has many advantages, among which the biggest is that it can formally include the prior distribution of parameters which can be derived from historic data or from expert experiences; then, the rate can be expressed as a probability distribution. We investigated the possibility of applying the Bayesian method to the estimation of the deterioration rate by comparing our bayesian approach to the conventional linear regression approach.

Bayesian Analysis of a Zero-inflated Poisson Regression Model: An Application to Korean Oral Hygienic Data (영과잉 포아송 회귀모형에 대한 베이지안 추론: 구강위생 자료에의 적용)

  • Lim, Ah-Kyoung;Oh, Man-Suk
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.505-519
    • /
    • 2006
  • We consider zero-inflated count data, which is discrete count data but has too many zeroes compared to the Poisson distribution. Zero-inflated data can be found in various areas. Despite its increasing importance in practice, appropriate statistical inference on zero-inflated data is limited. Classical inference based on a large number theory does not fit unless the sample size is very large. And regular Poisson model shows lack of St due to many zeroes. To handle the difficulties, a mixture of distributions are considered for the zero-inflated data. Specifically, a mixture of a point mass at zero and a Poisson distribution is employed for the data. In addition, when there exist meaningful covariates selected to the response variable, loglinear link is used between the mean of the response and the covariates in the Poisson distribution part. We propose a Bayesian inference for the zero-inflated Poisson regression model by using a Markov Chain Monte Carlo method. We applied the proposed method to a Korean oral hygienic data and compared the inference results with other models. We found that the proposed method is superior in that it gives small parameter estimation error and more accurate predictions.

Bayesian control problem in multivariate mixture model (다변량 혼합모형에서 통계적 제어문제의 베이지안적 고찰)

  • 이석훈;박래현;최종석
    • The Korean Journal of Applied Statistics
    • /
    • v.3 no.2
    • /
    • pp.27-37
    • /
    • 1990
  • We consider the statistical control problem for the mixture model in which one can choose the values of independent variables that produce the values of the dependent variables as close to the target values as possible. The theory suggested for the problem is reviewed and an extended model with respect to the assumption of variance and the number of dependent variables is suggested. A Basyesian treatment is studied for the above problem with example as an illustration.

  • PDF

A comparison study of Bayesian high-dimensional linear regression models (베이지안 고차원 선형 회귀분석에서의 비교연구)

  • Shin, Ju-Won;Lee, Kyoungjae
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.3
    • /
    • pp.491-505
    • /
    • 2021
  • We consider linear regression models in high-dimensional settings (p ≫ n) and compare various classes of priors. The spike and slab prior is one of the most widely used priors for Bayesian regression models, but its model space is vast, resulting in a bad performance in finite samples. As an alternative, various continuous shrinkage priors, including the horseshoe prior and its variants, have been proposed. Although each of the above priors has been investigated separately, exhaustive comparative studies of their performance have been conducted very rarely. In this study, we compare the spike and slab prior, the horseshoe prior and its variants in various simulation settings. The performance of each method is demonstrated in terms of the regression coefficient estimation and variable selection. Finally, some remarks and suggestions are given based on comprehensive simulation studies.

Review of Mixed-Effect Models (혼합효과모형의 리뷰)

  • Lee, Youngjo
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.2
    • /
    • pp.123-136
    • /
    • 2015
  • Science has developed with great achievements after Galileo's discovery of the law depicting a relationship between observable variables. However, many natural phenomena have been better explained by models including unobservable random effects. A mixed effect model was the first statistical model that included unobservable random effects. The importance of the mixed effect models is growing along with the advancement of computational technologies to infer complicated phenomena; subsequently mixed effect models have extended to various statistical models such as hierarchical generalized linear models. Hierarchical likelihood has been suggested to estimate unobservable random effects. Our special issue about mixed effect models shows how they can be used in statistical problems as well as discusses important needs for future developments. Frequentist and Bayesian approaches are also investigated.