• 제목/요약/키워드: Bayesian posterior

검색결과 345건 처리시간 0.026초

The inference and estimation for latent discrete outcomes with a small sample

  • Choi, Hyung;Chung, Hwan
    • Communications for Statistical Applications and Methods
    • /
    • 제23권2호
    • /
    • pp.131-146
    • /
    • 2016
  • In research on behavioral studies, significant attention has been paid to the stage-sequential process for longitudinal data. Latent class profile analysis (LCPA) is an useful method to study sequential patterns of the behavioral development by the two-step identification process: identifying a small number of latent classes at each measurement occasion and two or more homogeneous subgroups in which individuals exhibit a similar sequence of latent class membership over time. Maximum likelihood (ML) estimates for LCPA are easily obtained by expectation-maximization (EM) algorithm, and Bayesian inference can be implemented via Markov chain Monte Carlo (MCMC). However, unusual properties in the likelihood of LCPA can cause difficulties in ML and Bayesian inference as well as estimation in small samples. This article describes and addresses erratic problems that involve conventional ML and Bayesian estimates for LCPA with small samples. We argue that these problems can be alleviated with a small amount of prior input. This study evaluates the performance of likelihood and MCMC-based estimates with the proposed prior in drawing inference over repeated sampling. Our simulation shows that estimates from the proposed methods perform better than those from the conventional ML and Bayesian method.

Parameter Optimization and Uncertainty Analysis of the Rainfall-Runoff Model Coupled with Hierarchical Bayesian Inference Scheme (Hierarchical Bayesian 기법을 통한 강우-유출모형 매개변수의 최적화 및 불확실성 분석)

  • Mun, Yeong-Il;Gwon, Hyeon-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 한국수자원학회 2007년도 학술발표회 논문집
    • /
    • pp.1752-1756
    • /
    • 2007
  • 정교한 강우-유출 모의를 위해서는 적절한 매개변수의 추정이 필수적이며, 매개변수 추정 방법은 시행착오(trial and error)에 의한 수동보정법과 최적화방법을 사용한 자동보정법으로 구분할 수 있다. 모형의 매개변수의 수가 많은 경우 수동보정법에 의한 매개변수 추정은 매우 어렵다. 자동 보정법에 사용되는 최적화방법은 Rosenbrock 알고리즘, patten search, 컴플렉스(complex) 방법, Powell 방법 등과 같은 지역최적화 방법과 전역최적화 방법으로 나눌 수 있다. 그러나 기존 방법론들은 매개변수의 최적화를 추적하기 위한 알고리즘이 대부분이며 이들 매개변수에 관련된 불확실성을 평가하는데는 미흡한 단접이 있다. 이러한 점에서 본 연구에서는 강우-유출모형의 매개변수 추정에 있어서 불확실성을 평가할 수 있는 새로운 방법론을 검토하고자 한다. 매개변수와 관련된 불확실성을 평가하기 위한 방법은 여러 가지가 있으나 통계적으로 매우 우수한 능력을 보이는 Hierarchical Bayesian 알고리즘을 Probability-Distributed 강우-유출 모형에 적용하였다. 본 방법론은 최적화와 동시에 각 매개변수에 관련된 사후분포(posterior distribution)의 추정이 가능하므로 모형이 갖는 불확실성을 효과적으로 평가할 수 있다. 따라서, 수자원 관리에 있어서 불확실성을 고려할 수 있으므로 보다 수리수문학적 위험도를 저감할 수 있을 것으로 판단된다.

  • PDF

A Missing Value Replacement Method for Agricultural Meteorological Data Using Bayesian Spatio-Temporal Model (농업기상 결측치 보정을 위한 통계적 시공간모형)

  • Park, Dain;Yoon, Sanghoo
    • Journal of Environmental Science International
    • /
    • 제27권7호
    • /
    • pp.499-507
    • /
    • 2018
  • Agricultural meteorological information is an important resource that affects farmers' income, food security, and agricultural conditions. Thus, such data are used in various fields that are responsible for planning, enforcing, and evaluating agricultural policies. The meteorological information obtained from automatic weather observation systems operated by rural development agencies contains missing values owing to temporary mechanical or communication deficiencies. It is known that missing values lead to reduction in the reliability and validity of the model. In this study, the hierarchical Bayesian spatio-temporal model suggests replacements for missing values because the meteorological information includes spatio-temporal correlation. The prior distribution is very important in the Bayesian approach. However, we found a problem where the spatial decay parameter was not converged through the trace plot. A suitable spatial decay parameter, estimated on the bias of root-mean-square error (RMSE), which was determined to be the difference between the predicted and observed values. The latitude, longitude, and altitude were considered as covariates. The estimated spatial decay parameters were 0.041 and 0.039, for the spatio-temporal model with latitude and longitude and for latitude, longitude, and altitude, respectively. The posterior distributions were stable after the spatial decay parameter was fixed. root mean square error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and bias were calculated for model validation. Finally, the missing values were generated using the independent Gaussian process model.

Parameter Optimization and Uncertainty Analysis of the Rainfall-Runoff Model (강우-유출모형 매개변수의 최적화 및 불확실성 분석)

  • Moon, Young-Il;Kwon, Hyun-Han
    • 한국방재학회:학술대회논문집
    • /
    • 한국방재학회 2008년도 정기총회 및 학술발표대회
    • /
    • pp.723-726
    • /
    • 2008
  • It is not always easy to estimate the parameters in hydrologic models due to insufficient hydrologic data when hydraulic structures are designed or water resources plan are established, uncertainty analysis, therefore, are inevitably needed to examine reliability for the estimated results. With regard to this point, this study applies a Bayesian Markov Chain Monte Carlo scheme to the NWS-PC rainfall-runoff model that has been widely used, and a case study is performed in Soyang Dam watershed in Korea. The NWS-PC model is calibrated against observed daily runoff, and thirteen parameters in the model are optimized as well as posterior distributions associated with each parameter are derived. The Bayesian Markov Chain Monte Carlo shows a improved result in terms of statistical performance measures and graphical examination. The patterns of runoff can be influenced by various factors and the Bayesian approaches are capable of translating the uncertainties into parameter uncertainties. One could provide against an expected runoff event by utilizing information driven by Bayesian methods. Therefore, the rainfall-runoff analysis coupled with the uncertainty analysis can give us an insight in evaluating flood risk and dam size in a reasonable way.

  • PDF

Bayesian inference for an ordered multiple linear regression with skew normal errors

  • Jeong, Jeongmun;Chung, Younshik
    • Communications for Statistical Applications and Methods
    • /
    • 제27권2호
    • /
    • pp.189-199
    • /
    • 2020
  • This paper studies a Bayesian ordered multiple linear regression model with skew normal error. It is reasonable that the kind of inherent information available in an applied regression requires some constraints on the coefficients to be estimated. In addition, the assumption of normality of the errors is sometimes not appropriate in the real data. Therefore, to explain such situations more flexibly, we use the skew-normal distribution given by Sahu et al. (The Canadian Journal of Statistics, 31, 129-150, 2003) for error-terms including normal distribution. For Bayesian methodology, the Markov chain Monte Carlo method is employed to resolve complicated integration problems. Also, under the improper priors, the propriety of the associated posterior density is shown. Our Bayesian proposed model is applied to NZAPB's apple data. For model comparison between the skew normal error model and the normal error model, we use the Bayes factor and deviance information criterion given by Spiegelhalter et al. (Journal of the Royal Statistical Society Series B (Statistical Methodology), 64, 583-639, 2002). We also consider the problem of detecting an influential point concerning skewness using Bayes factors. Finally, concluding remarks are discussed.

Uncertainty Estimation of AR Model Parameters Using a Bayesian technique (Bayesian 기법을 활용한 AR Model 매개변수의 불확실성 추정)

  • Park, Chan-Young;Park, Jong-Hyeon;Park, Min-Woo;Kwon, Hyun-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 한국수자원학회 2016년도 학술발표회
    • /
    • pp.280-280
    • /
    • 2016
  • 특정 자료의 시간의 흐름에 따른 예측치를 추정하는 방법으로 AR Model 즉, 자기회귀모형이 많이 사용되고 있다. AR Model은 변수의 현재 값을 과거 값의 함수로 나타내게 되는데, 이런 시계열 분석 모델을 사용할 때 매개변수의 추정 과정이 필수적으로 요구된다. 일반적으로 매개변수를 추정하는 방법에는 확률적근사법(stochastic approximation), 최소제곱법(method of least square), 자기상관법(method of autocorrelation method), 최우도법(method of maximum likelihood) 등이 있다. AR Model에서 가장 많이 사용되는 최우도법은 표본크기가 충분히 클 때 가장 효율적인 방법으로 평가되지만 수치적으로 해를 구하는 과정이 복잡한 경우가 많으며, 해를 구하지 못하는 어려움이 따르기도 한다. 또한 표본 크기가 작을 때 일반적으로 잘 일치하지 않은 결과를 얻게 된다. 우리나라의 강우, 유량 등의 자료는 자료의 수가 적은 경우가 많기 때문에 최우도법을 통한 매개변수 추정 시 불확실성이 내재되어있지만 그것을 정량적으로 제시하는데 한계가 있다. 본 연구에서는 AR Model의 매개변수 추정 시 Bayesian 기법으로 매개변수의 사후분포(posterior distribution)를 제공하여 매개변수의 불확실성 구간을 정량적으로 표현하게 됨으로써, 시계열 분석을 통해 보다 신뢰성 있는 예측치를 얻을 수 있으리라 판단된다.

  • PDF

Bayesian Rules Based Optimal Defense Strategies for Clustered WSNs

  • Zhou, Weiwei;Yu, Bin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권12호
    • /
    • pp.5819-5840
    • /
    • 2018
  • Considering the topology of hierarchical tree structure, each cluster in WSNs is faced with various attacks launched by malicious nodes, which include network eavesdropping, channel interference and data tampering. The existing intrusion detection algorithm does not take into consideration the resource constraints of cluster heads and sensor nodes. Due to application requirements, sensor nodes in WSNs are deployed with approximately uncorrelated security weights. In our study, a novel and versatile intrusion detection system (IDS) for the optimal defense strategy is primarily introduced. Given the flexibility that wireless communication provides, it is unreasonable to expect malicious nodes will demonstrate a fixed behavior over time. Instead, malicious nodes can dynamically update the attack strategy in response to the IDS in each game stage. Thus, a multi-stage intrusion detection game (MIDG) based on Bayesian rules is proposed. In order to formulate the solution of MIDG, an in-depth analysis on the Bayesian equilibrium is performed iteratively. Depending on the MIDG theoretical analysis, the optimal behaviors of rational attackers and defenders are derived and calculated accurately. The numerical experimental results validate the effectiveness and robustness of the proposed scheme.

Bayesian Conway-Maxwell-Poisson (CMP) regression for longitudinal count data

  • Morshed Alam ;Yeongjin Gwon ;Jane Meza
    • Communications for Statistical Applications and Methods
    • /
    • 제30권3호
    • /
    • pp.291-309
    • /
    • 2023
  • Longitudinal count data has been widely collected in biomedical research, public health, and clinical trials. These repeated measurements over time on the same subjects need to account for an appropriate dependency. The Poisson regression model is the first choice to model the expected count of interest, however, this may not be an appropriate when data exhibit over-dispersion or under-dispersion. Recently, Conway-Maxwell-Poisson (CMP) distribution is popularly used as the distribution offers a flexibility to capture a wide range of dispersion in the data. In this article, we propose a Bayesian CMP regression model to accommodate over and under-dispersion in modeling longitudinal count data. Specifically, we develop a regression model with random intercept and slope to capture subject heterogeneity and estimate covariate effects to be different across subjects. We implement a Bayesian computation via Hamiltonian MCMC (HMCMC) algorithm for posterior sampling. We then compute Bayesian model assessment measures for model comparison. Simulation studies are conducted to assess the accuracy and effectiveness of our methodology. The usefulness of the proposed methodology is demonstrated by a well-known example of epilepsy data.

Analyze the parameter uncertainty of SURR model using Bayesian Markov Chain Monte Carlo method with informal likelihood functions

  • Duyen, Nguyen Thi;Nguyen, Duc Hai;Bae, Deg-Hyo
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 한국수자원학회 2021년도 학술발표회
    • /
    • pp.127-127
    • /
    • 2021
  • In order to estimate parameter uncertainty of hydrological models, the consideration of the likelihood functions which provide reliable parameters of model is necessary. In this study, the Bayesian Markov Chain Monte Carlo (MCMC) method with informal likelihood functions is used to analyze the uncertainty of parameters of the SURR model for estimating the hourly streamflow of Gunnam station of Imjin basin, Korea. Three events were used to calibrate and one event was used to validate the posterior distributions of parameters. Moreover, the performance of four informal likelihood functions (Nash-Sutcliffe efficiency, Normalized absolute error, Index of agreement, and Chiew-McMahon efficiency) on uncertainty of parameter is assessed. The indicators used to assess the uncertainty of the streamflow simulation were P-factor (percentage of observed streamflow included in the uncertainty interval) and R-factor (the average width of the uncertainty interval). The results showed that the sensitivities of parameters strongly depend on the likelihood functions and vary for different likelihood functions. The uncertainty bounds illustrated the slight differences from various likelihood functions. This study confirms the importance of the likelihood function selection in the application of Bayesian MCMC to the uncertainty assessment of the SURR model.

  • PDF

Bayesian Survival Estimation of Pareto Distribution of the Second Kind Based on Type II Censored Data

  • Kim, Dal-Ho;Lee, Woo-Dong;Kang, Sang-Gil
    • Communications for Statistical Applications and Methods
    • /
    • 제12권3호
    • /
    • pp.729-742
    • /
    • 2005
  • In this paper, we discuss the propriety of the various noninformative priors for the Pareto distribution. The reference prior, Jeffreys prior and ad hoc noninformative prior which is used in several literatures will be introduced and showed that which prior gives the proper posterior distribution. The reference prior and Jeffreys prior give a proper posterior distribution, but ad hoc noninformative prior which is proportional to reciprocal of the parameters does not give a proper posterior. To compute survival function, we use the well-known approximation method proposed by Lindley (1980) and Tireney and Kadane (1986). And two methods are compared by simulation. A real data example is given to illustrate our methodology.