• 제목/요약/키워드: Bayesian variable selection

검색결과 46건 처리시간 0.031초

Analysis of Client Propensity in Cyber Counseling Using Bayesian Variable Selection

  • Pi, Su-Young
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제6권4호
    • /
    • pp.277-281
    • /
    • 2006
  • Cyber counseling, one of the most compatible type of consultation for the information society, enables people to reveal their mental agonies and private problems anonymously, since it does not require face-to-face interview between a counsellor and a client. However, there are few cyber counseling centers which provide high quality and trustworthy service, although the number of cyber counseling center has highly increased. Therefore, this paper is intended to enable an appropriate consultation for each client by analyzing client propensity using Bayesian variable selection. Bayesian variable selection is superior to stepwise regression analysis method in finding out a regression model. Stepwise regression analysis method, which has been generally used to analyze individual propensity in linear regression model, is not efficient since it is hard to select a proper model for its own defects. In this paper, based on the case database of current cyber counseling centers in the web, we will analyze clients' propensities using Bayesian variable selection to enable individually target counseling and to activate cyber counseling programs.

베이지안 변수선택 기법을 이용한 발틱건화물운임지수(BDI) 예측 (Forecasting the Baltic Dry Index Using Bayesian Variable Selection)

  • 한상우;김영민
    • 무역학회지
    • /
    • 제47권5호
    • /
    • pp.21-37
    • /
    • 2022
  • Baltic Dry Index (BDI) is difficult to forecast because of the high volatility and complexity. To improve the BDI forecasting ability, this study apply Bayesian variable selection method with a large number of predictors. Our estimation results based on the BDI and all predictors from January 2000 to September 2021 indicate that the out-of-sample prediction ability of the ADL model with the variable selection is superior to that of the AR model in terms of point and density forecasting. We also find that critical predictors for the BDI change over forecasts horizon. The lagged BDI are being selected as an key predictor at all forecasts horizon, but commodity price, the clarksea index, and interest rates have additional information to predict BDI at mid-term horizon. This implies that time variations of predictors should be considered to predict the BDI.

Bayesian Parameter :Estimation and Variable Selection in Random Effects Generalised Linear Models for Count Data

  • Oh, Man-Suk;Park, Tae-Sung
    • Journal of the Korean Statistical Society
    • /
    • 제31권1호
    • /
    • pp.93-107
    • /
    • 2002
  • Random effects generalised linear models are useful for analysing clustered count data in which responses are usually correlated. We propose a Bayesian approach to parameter estimation and variable selection in random effects generalised linear models for count data. A simple Gibbs sampling algorithm for parameter estimation is presented and a simple and efficient variable selection is done by using the Gibbs outputs. An illustrative example is provided.

Bayesian bi-level variable selection for genome-wide survival study

  • Eunjee Lee;Joseph G. Ibrahim;Hongtu Zhu
    • Genomics & Informatics
    • /
    • 제21권3호
    • /
    • pp.28.1-28.13
    • /
    • 2023
  • Mild cognitive impairment (MCI) is a clinical syndrome characterized by the onset and evolution of cognitive impairments, often considered a transitional stage to Alzheimer's disease (AD). The genetic traits of MCI patients who experience a rapid progression to AD can enhance early diagnosis capabilities and facilitate drug discovery for AD. While a genome-wide association study (GWAS) is a standard tool for identifying single nucleotide polymorphisms (SNPs) related to a disease, it fails to detect SNPs with small effect sizes due to stringent control for multiple testing. Additionally, the method does not consider the group structures of SNPs, such as genes or linkage disequilibrium blocks, which can provide valuable insights into the genetic architecture. To address the limitations, we propose a Bayesian bi-level variable selection method that detects SNPs associated with time of conversion from MCI to AD. Our approach integrates group inclusion indicators into an accelerated failure time model to identify important SNP groups. Additionally, we employ data augmentation techniques to impute censored time values using a predictive posterior. We adapt Dirichlet-Laplace shrinkage priors to incorporate the group structure for SNP-level variable selection. In the simulation study, our method outperformed other competing methods regarding variable selection. The analysis of Alzheimer's Disease Neuroimaging Initiative (ADNI) data revealed several genes directly or indirectly related to AD, whereas a classical GWAS did not identify any significant SNPs.

On an Optimal Bayesian Variable Selection Method for Generalized Logit Model

  • Kim, Hea-Jung;Lee, Ae Kuoung
    • Communications for Statistical Applications and Methods
    • /
    • 제7권2호
    • /
    • pp.617-631
    • /
    • 2000
  • This paper is concerned with suggesting a Bayesian method for variable selection in generalized logit model. It is based on Laplace-Metropolis algorithm intended to propose a simple method for estimating the marginal likelihood of the model. The algorithm then leads to a criterion for the selection of variables. The criterion is to find a subset of variables that maximizes the marginal likelihood of the model and it is seen to be a Bayes rule in a sense that it minimizes the risk of the variable selection under 0-1 loss function. Based upon two examples, the suggested method is illustrated and compared with existing frequentist methods.

  • PDF

Laplace-Metropolis알고리즘에 의한 다항로짓모형의 변수선택에 관한 연구 (Laplace-Metropolis Algorithm for Variable Selection in Multinomial Logit Model)

  • 김혜중;이애경
    • 품질경영학회지
    • /
    • 제29권1호
    • /
    • pp.11-23
    • /
    • 2001
  • This paper is concerned with suggesting a Bayesian method for variable selection in multinomial logit model. It is based upon an optimal rule suggested by use of Bayes rule which minimizes a risk induced by selecting the multinomial logit model. The rule is to find a subset of variables that maximizes the marginal likelihood of the model. We also propose a Laplace-Metropolis algorithm intended to suggest a simple method forestimating the marginal likelihood of the model. Based upon two examples, artificial data and empirical data examples, the Bayesian method is illustrated and its efficiency is examined.

  • PDF

Bayesian Variable Selection in the Proportional Hazard Model

  • Lee, Kyeong-Eun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권3호
    • /
    • pp.605-616
    • /
    • 2004
  • In this paper we consider the proportional hazard models for survival analysis in the microarray data. For a given vector of response values and gene expressions (covariates), we address the issue of how to reduce the dimension by selecting the significant genes. In our approach, rather than fixing the number of selected genes, we will assign a prior distribution to this number. To implement our methodology, we use a Markov Chain Monte Carlo (MCMC) method.

  • PDF

Simultaneous outlier detection and variable selection via difference-based regression model and stochastic search variable selection

  • Park, Jong Suk;Park, Chun Gun;Lee, Kyeong Eun
    • Communications for Statistical Applications and Methods
    • /
    • 제26권2호
    • /
    • pp.149-161
    • /
    • 2019
  • In this article, we suggest the following approaches to simultaneous variable selection and outlier detection. First, we determine possible candidates for outliers using properties of an intercept estimator in a difference-based regression model, and the information of outliers is reflected in the multiple regression model adding mean shift parameters. Second, we select the best model from the model including the outlier candidates as predictors using stochastic search variable selection. Finally, we evaluate our method using simulations and real data analysis to yield promising results. In addition, we need to develop our method to make robust estimates. We will also to the nonparametric regression model for simultaneous outlier detection and variable selection.

A Bayesian Method for Narrowing the Scope fo Variable Selection in Binary Response t-Link Regression

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • 제29권4호
    • /
    • pp.407-422
    • /
    • 2000
  • This article is concerned with the selecting predictor variables to be included in building a class of binary response t-link regression models where both probit and logistic regression models can e approximately taken as members of the class. It is based on a modification of the stochastic search variable selection method(SSVS), intended to propose and develop a Bayesian procedure that used probabilistic considerations for selecting promising subsets of predictor variables. The procedure reformulates the binary response t-link regression setup in a hierarchical truncated normal mixture model by introducing a set of hyperparameters that will be used to identify subset choices. In this setup, the most promising subset of predictors can be identified as that with highest posterior probability in the marginal posterior distribution of the hyperparameters. To highlight the merit of the procedure, an illustrative numerical example is given.

  • PDF

베이지안 변수선택을 이용한 한국 수익률곡선 추정 (Estimation of the Korean Yield Curve via Bayesian Variable Selection)

  • 구병수
    • 경제분석
    • /
    • 제26권1호
    • /
    • pp.84-132
    • /
    • 2020
  • 중앙은행은 수익률곡선을 바탕으로 미래 수익률에 대한 시장의 기대를 추론한다. 통화 정책의 유효성 제고를 위하여, 시장이 예상하는 미래 수익률의 움직임을 정확히 파악할 필요가 있기 때문이다. 이에 따라 그동안 수익률곡선과 시장기대를 정확하게 추정하기 위한 다양한 모형들이 활용되었다. 이와 함께 채권시장의 발달로 채권시장과 거시경제간의 상호 연관성이 높아지면서 수익률 곡선에 영향을 미치는 거시변수가 무엇인지 파악하는 것이 매우 중요해졌다. 그러나 수익률 결정요인에 관한 다양한 이론이 있는 만큼 그동안의 선행연구에서는 수익률곡선 추정 모형에 포함되는 거시변수들이 서로 달랐다. 이는 수익률곡선을 추정하는 데 있어 어떤 변수를 포함한 모형이 바람직한가에 관한 문제, 즉 모형 불확실성이 존재한다는 것을 의미한다. 이러한 상황에서 본 연구는 수익률곡선과 미래 수익률에 대한 시장기대를 정교하게 추정하기 위해 동태적 Nelson-Siegel 모형에 베이지안 변수선택 방법을 적용하였다. 베이지안 변수선택은 모형에 포함되는 중요한 변수를 선험적으로 결정하는 데 따르는 문제들을 완화하고 모형 불확실성을 추정에 효율적으로 반영하는 포괄적인 방법이라는 점에서 바람직한 추정방법이 될 수 있다. 베이지안 변수선택 모형과 선행연구의 모형들을 비교한 결과 모형에 어떤 거시변수를 포함하느냐에 따라 도출되는 미래 수익률에 대한 시장기대가 상당히 다르게 나타났다. 이는 모형 불확실성이 추정결과에 큰 영향을 미치며 이를 추정에 반영하는 것이 타당하다는 것을 의미한다. 베이지안 변수선택 모형의 예측력이 선행연구의 다른 모형들보다 우월한 것으로 나타난 점도 이를 뒷받침한다. 따라서 모형 불확실성이 추정에 영향을 미치는 상황에서 수익률곡선과 시장기대 추정의 정확성 제고를 위해 베이지안 변수선택 모형을 활용하는 것이 바람직할 것으로 판단된다.