• Title/Summary/Keyword: Posterior inference

Search Result 91, Processing Time 0.022 seconds

A study on MERS-CoV outbreak in Korea using Bayesian negative binomial branching processes (베이지안 음이항 분기과정을 이용한 한국 메르스 발생 연구)

  • Park, Yuha;Choi, Ilsu
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.1
    • /
    • pp.153-161
    • /
    • 2017
  • Branching processes which is used for epidemic dispersion as stochastic process model have advantages to estimate parameters by real data. We have to estimate both mean and dispersion parameter in order to use the negative binomial distribution as an offspring distribution on branching processes. In existing studies on biology and epidemiology, it is estimated using maximum-likelihood methods. However, for most of epidemic data, it is hard to get the best precision of maximum-likelihood estimator. We suggest a Bayesian inference that have good properties of statistics for small-sample. After estimating dispersion parameter we modelled the posterior distribution for 2015 Korea MERS cases. As the result, we found that the estimated dispersion parameter is relatively stable no matter how we assume prior distribution. We also computed extinction probabilities on branching processes using estimated dispersion parameters.

Genetic parameters for daily milk somatic cell score and relationships with yield traits of primiparous Holstein cattle in Iran

  • Kheirabadi, Khabat;Razmkabir, Mohammad
    • Journal of Animal Science and Technology
    • /
    • v.58 no.10
    • /
    • pp.38.1-38.6
    • /
    • 2016
  • Background: Despite the importance of relationships between somatic cell score (SCS) and currently selected traits (milk, fat and protein yield) of Holstein cows, there was a lack of comprehensive literature for it in Iran. Therefore we tried to examine heritabilities and relationships between these traits using a fixed-regression animal model and Bayesian inference. The data set consisted of 1,078,966 test-day observations from 146,765 primiparous daughters of 1930 sires, with calvings from 2002 to 2013. Results: Marginal posterior means of heritability estimates for SCS ($0.03{\pm}0.002$) were distinctly lower than those for milk ($0.204{\pm}0.006$), fat ($0.096{\pm}0.004$) and protein ($0.147{\pm}0.005$) yields. In the case of phenotypic correlations, the relationships between production and SCS were near zero at the beginning of lactation but become increasingly negative as days in milk increased. Although all environmental correlations between production and SCS were negative ($-0.177{\pm}0.007$, $-0.165{\pm}0.008$ and $-0.152{\pm}0.007$ between SCS and milk, fat, and protein yield, respectively), slightly antagonistic genetic correlations were found; with posterior mean of relationships ranging from $0.01{\pm}0.039$ to $0.11{\pm}0.036$. This genetic opposition was distinctly higher for protein than for fat. Conclusion: Although small, the positive genetic correlations suggest some genetic antagonism between desired increased milk production and reduced SCS (i.e., single-trait selection for increased milk production will also increase SCS).

A Study on Bayesian Approach of Software Stochastic Reliability Superposition Model using General Order Statistics (일반 순서 통계량을 이용한 소프트웨어 신뢰확률 중첩모형에 관한 베이지안 접근에 관한 연구)

  • Lee, Byeong-Su;Kim, Hui-Cheol;Baek, Su-Gi;Jeong, Gwan-Hui;Yun, Ju-Yong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.8
    • /
    • pp.2060-2071
    • /
    • 1999
  • The complicate software failure system is defined to the superposition of the points of failure from several component point process. Because the likelihood function is difficulty in computing, we consider Gibbs sampler using iteration sampling based method. For each observed failure epoch, we applied to latent variables that indicates with component of the superposition mode. For model selection, we explored the posterior Bayesian criterion and the sum of relative errors for the comparison simple pattern with superposition model. A numerical example with NHPP simulated data set applies the thinning method proposed by Lewis and Shedler[25] is given, we consider Goel-Okumoto model and Weibull model with GOS, inference of parameter is studied. Using the posterior Bayesian criterion and the sum of relative errors, as we would expect, the superposition model is best on model under diffuse priors.

  • PDF

Operational modal analysis of Canton Tower by a fast frequency domain Bayesian method

  • Zhang, Feng-Liang;Ni, Yi-Qing;Ni, Yan-Chun;Wang, You-Wu
    • Smart Structures and Systems
    • /
    • v.17 no.2
    • /
    • pp.209-230
    • /
    • 2016
  • The Canton Tower is a high-rise slender structure with a height of 610 m. A structural health monitoring system has been instrumented on the structure, by which data is continuously monitored. This paper presents an investigation on the identified modal properties of the Canton Tower using ambient vibration data collected during a whole day (24 hours). A recently developed Fast Bayesian FFT method is utilized for operational modal analysis on the basis of the measured acceleration data. The approach views modal identification as an inference problem where probability is used as a measure for the relative plausibility of outcomes given a model of the structure and measured data. Focusing on the first several modes, the modal properties of this supertall slender structure are identified on non-overlapping time windows during the whole day under normal wind speed. With the identified modal parameters and the associated posterior uncertainty, the distribution of the modal parameters in the future is predicted and assessed. By defining the modal root-mean-square value in terms of the power spectral density of modal force identified, the identified natural frequencies and damping ratios versus the vibration amplitude are investigated with the associated posterior uncertainty considered. Meanwhile, the correlations between modal parameters and temperature, modal parameters and wind speed are studied. For comparison purpose, the frequency domain decomposition (FDD) method is also utilized to identify the modal parameters. The identified results obtained by the Bayesian method, the FDD method and a finite element model are compared and discussed.

Concept of Trend Analysis of Hydrologic Extreme Variables and Nonstationary Frequency Analysis (극치수문자료의 경향성 분석 개념 및 비정상성 빈도해석)

  • Lee, Jeong-Ju;Kwon, Hyun-Han;Kim, Tae-Woong
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.30 no.4B
    • /
    • pp.389-397
    • /
    • 2010
  • This study introduced a Bayesian based frequency analysis in which the statistical trend analysis for hydrologic extreme series is incorporated. The proposed model employed Gumbel extreme distribution to characterize extreme events and a fully coupled bayesian frequency model was finally utilized to estimate design rainfalls in Seoul. Posterior distributions of the model parameters in both Gumbel distribution and trend analysis were updated through Markov Chain Monte Carlo Simulation mainly utilizing Gibbs sampler. This study proposed a way to make use of nonstationary frequency model for dynamic risk analysis, and showed an increase of hydrologic risk with time varying probability density functions. The proposed study showed advantage in assessing statistical significance of parameters associated with trend analysis through statistical inference utilizing derived posterior distributions.

Survival Analysis for White Non-Hispanic Female Breast Cancer Patients

  • Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Gabbidon, Kemesha;Stewart, Tiffanie Shauna-Jeanne;Bhatt, Chintan
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.9
    • /
    • pp.4049-4054
    • /
    • 2014
  • Background: Race and ethnicity are significant factors in predicting survival time of breast cancer patients. In this study, we applied advanced statistical methods to predict the survival of White non-Hispanic female breast cancer patients, who were diagnosed between the years 1973 and 2009 in the United States (U.S.). Materials and Methods: Demographic data from the Surveillance Epidemiology and End Results (SEER) database were used for the purpose of this study. Nine states were randomly selected from 12 U.S. cancer registries. A stratified random sampling method was used to select 2,000 female breast cancer patients from these nine states. We compared four types of advanced statistical probability models to identify the best-fit model for the White non-Hispanic female breast cancer survival data. Three model building criterion were used to measure and compare goodness of fit of the models. These include Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC). In addition, we used a novel Bayesian method and the Markov Chain Monte Carlo technique to determine the posterior density function of the parameters. After evaluating the model parameters, we selected the model having the lowest DIC value. Using this Bayesian method, we derived the predictive survival density for future survival time and its related inferences. Results: The analytical sample of White non-Hispanic women included 2,000 breast cancer cases from the SEER database (1973-2009). The majority of cases were married (55.2%), the mean age of diagnosis was 63.61 years (SD = 14.24) and the mean survival time was 84 months (SD = 35.01). After comparing the four statistical models, results suggested that the exponentiated Weibull model (DIC= 19818.220) was a better fit for White non-Hispanic females' breast cancer survival data. This model predicted the survival times (in months) for White non-Hispanic women after implementation of precise estimates of the model parameters. Conclusions: By using modern model building criteria, we determined that the data best fit the exponentiated Weibull model. We incorporated precise estimates of the parameter into the predictive model and evaluated the survival inference for the White non-Hispanic female population. This method of analysis will assist researchers in making scientific and clinical conclusions when assessing survival time of breast cancer patients.

Recurrent Neural Network Modeling of Etch Tool Data: a Preliminary for Fault Inference via Bayesian Networks

  • Nawaz, Javeria;Arshad, Muhammad Zeeshan;Park, Jin-Su;Shin, Sung-Won;Hong, Sang-Jeen
    • Proceedings of the Korean Vacuum Society Conference
    • /
    • 2012.02a
    • /
    • pp.239-240
    • /
    • 2012
  • With advancements in semiconductor device technologies, manufacturing processes are getting more complex and it became more difficult to maintain tighter process control. As the number of processing step increased for fabricating complex chip structure, potential fault inducing factors are prevail and their allowable margins are continuously reduced. Therefore, one of the key to success in semiconductor manufacturing is highly accurate and fast fault detection and classification at each stage to reduce any undesired variation and identify the cause of the fault. Sensors in the equipment are used to monitor the state of the process. The idea is that whenever there is a fault in the process, it appears as some variation in the output from any of the sensors monitoring the process. These sensors may refer to information about pressure, RF power or gas flow and etc. in the equipment. By relating the data from these sensors to the process condition, any abnormality in the process can be identified, but it still holds some degree of certainty. Our hypothesis in this research is to capture the features of equipment condition data from healthy process library. We can use the health data as a reference for upcoming processes and this is made possible by mathematically modeling of the acquired data. In this work we demonstrate the use of recurrent neural network (RNN) has been used. RNN is a dynamic neural network that makes the output as a function of previous inputs. In our case we have etch equipment tool set data, consisting of 22 parameters and 9 runs. This data was first synchronized using the Dynamic Time Warping (DTW) algorithm. The synchronized data from the sensors in the form of time series is then provided to RNN which trains and restructures itself according to the input and then predicts a value, one step ahead in time, which depends on the past values of data. Eight runs of process data were used to train the network, while in order to check the performance of the network, one run was used as a test input. Next, a mean squared error based probability generating function was used to assign probability of fault in each parameter by comparing the predicted and actual values of the data. In the future we will make use of the Bayesian Networks to classify the detected faults. Bayesian Networks use directed acyclic graphs that relate different parameters through their conditional dependencies in order to find inference among them. The relationships between parameters from the data will be used to generate the structure of Bayesian Network and then posterior probability of different faults will be calculated using inference algorithms.

  • PDF

Parameter Optimization and Uncertainty Analysis of the Rainfall-Runoff Model Coupled with Hierarchical Bayesian Inference Scheme (Hierarchical Bayesian 기법을 통한 강우-유출모형 매개변수의 최적화 및 불확실성 분석)

  • Mun, Yeong-Il;Gwon, Hyeon-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2007.05a
    • /
    • pp.1752-1756
    • /
    • 2007
  • 정교한 강우-유출 모의를 위해서는 적절한 매개변수의 추정이 필수적이며, 매개변수 추정 방법은 시행착오(trial and error)에 의한 수동보정법과 최적화방법을 사용한 자동보정법으로 구분할 수 있다. 모형의 매개변수의 수가 많은 경우 수동보정법에 의한 매개변수 추정은 매우 어렵다. 자동 보정법에 사용되는 최적화방법은 Rosenbrock 알고리즘, patten search, 컴플렉스(complex) 방법, Powell 방법 등과 같은 지역최적화 방법과 전역최적화 방법으로 나눌 수 있다. 그러나 기존 방법론들은 매개변수의 최적화를 추적하기 위한 알고리즘이 대부분이며 이들 매개변수에 관련된 불확실성을 평가하는데는 미흡한 단접이 있다. 이러한 점에서 본 연구에서는 강우-유출모형의 매개변수 추정에 있어서 불확실성을 평가할 수 있는 새로운 방법론을 검토하고자 한다. 매개변수와 관련된 불확실성을 평가하기 위한 방법은 여러 가지가 있으나 통계적으로 매우 우수한 능력을 보이는 Hierarchical Bayesian 알고리즘을 Probability-Distributed 강우-유출 모형에 적용하였다. 본 방법론은 최적화와 동시에 각 매개변수에 관련된 사후분포(posterior distribution)의 추정이 가능하므로 모형이 갖는 불확실성을 효과적으로 평가할 수 있다. 따라서, 수자원 관리에 있어서 불확실성을 고려할 수 있으므로 보다 수리수문학적 위험도를 저감할 수 있을 것으로 판단된다.

  • PDF

Bayesian parameter estimation and prediction in NHPP software reliability growth model (NHPP소프트웨어 신뢰도 성장모형에서 베이지안 모수추정과 예측)

  • Chang, Inhong;Jung, Deokhwan;Lee, Seungwoo;Song, Kwangyoon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.4
    • /
    • pp.755-762
    • /
    • 2013
  • In this paper we consider the NHPP software reliability model. And we deal with the maximum likelihood estimation and the Bayesian estimation with conjugate prior for parameter inference in the mean value function of Goel-Okumoto model (1979). The parameter estimates for the proposed model is presented by MLE and Bayes estimator in data set. We compare the predicted number of faults with the actual data set using the proposed mean value function.

Bayesian inference for an ordered multiple linear regression with skew normal errors

  • Jeong, Jeongmun;Chung, Younshik
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.2
    • /
    • pp.189-199
    • /
    • 2020
  • This paper studies a Bayesian ordered multiple linear regression model with skew normal error. It is reasonable that the kind of inherent information available in an applied regression requires some constraints on the coefficients to be estimated. In addition, the assumption of normality of the errors is sometimes not appropriate in the real data. Therefore, to explain such situations more flexibly, we use the skew-normal distribution given by Sahu et al. (The Canadian Journal of Statistics, 31, 129-150, 2003) for error-terms including normal distribution. For Bayesian methodology, the Markov chain Monte Carlo method is employed to resolve complicated integration problems. Also, under the improper priors, the propriety of the associated posterior density is shown. Our Bayesian proposed model is applied to NZAPB's apple data. For model comparison between the skew normal error model and the normal error model, we use the Bayes factor and deviance information criterion given by Spiegelhalter et al. (Journal of the Royal Statistical Society Series B (Statistical Methodology), 64, 583-639, 2002). We also consider the problem of detecting an influential point concerning skewness using Bayes factors. Finally, concluding remarks are discussed.