• Title/Summary/Keyword: Bayesian p-value

Search Result 23, Processing Time 0.02 seconds

Bayesian Inference for Autoregressive Models with Skewed Exponential Power Errors (비대칭 지수멱 오차를 가지는 자기회귀모형에서의 베이지안 추론)

  • Ryu, Hyunnam;Kim, Dal Ho
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.6
    • /
    • pp.1039-1047
    • /
    • 2014
  • An autoregressive model with normal errors is a natural model that attempts to fit time series data. More flexible models that include normal distribution as a special case are necessary because they can cover normality to non-normality models. The skewed exponential power distribution is a possible candidate for autoregressive models errors that may have tails lighter(platykurtic) or heavier(leptokurtic) than normal and skewness; in addition, the use of skewed exponential power distribution can reduce the influence of outliers and consequently increases the robustness of the analysis. We use SIR algorithm and grid method for an efficient Bayesian estimation.

Application of Pharmacovigilance Methods in Occupational Health Surveillance: Comparison of Seven Disproportionality Metrics

  • Bonneterre, Vincent;Bicout, Dominique Joseph;De Gaudemaris, Regis
    • Safety and Health at Work
    • /
    • v.3 no.2
    • /
    • pp.92-100
    • /
    • 2012
  • Objectives: The French National Occupational Diseases Surveillance and Prevention Network (RNV3P) is a French network of occupational disease specialists, which collects, in standardised coded reports, all cases where a physician of any specialty, referred a patient to a university occupational disease centre, to establish the relation between the disease observed and occupational exposures, independently of statutory considerations related to compensation. The objective is to compare the relevance of disproportionality measures, widely used in pharmacovigilance, for the detection of potentially new disease ${\times}$ exposure associations in RNV3P database (by analogy with the detection of potentially new health event ${\times}$ drug associations in the spontaneous reporting databases from pharmacovigilance). Methods: 2001-2009 data from RNV3P are used (81,132 observations leading to 11,627 disease ${\times}$ exposure associations). The structure of RNV3P database is compared with the ones of pharmacovigilance databases. Seven disproportionality metrics are tested and their results, notably in terms of ranking the disease ${\times}$ exposure associations, are compared. Results: RNV3P and pharmacovigilance databases showed similar structure. Frequentist methods (proportional reporting ratio [PRR], reporting odds ratio [ROR]) and a Bayesian one (known as BCPNN for "Bayesian Confidence Propagation Neural Network") show a rather similar behaviour on our data, conversely to other methods (as Poisson). Finally the PRR method was chosen, because more complex methods did not show a greater value with the RNV3P data. Accordingly, a procedure for detecting signals with PRR method, automatic triage for exclusion of associations already known, and then investigating these signals is suggested. Conclusion: This procedure may be seen as a first step of hypothesis generation before launching epidemiological and/or experimental studies.

Updated confidence intervals for the COVID-19 antibody retention rate in the Korean population

  • Kamruzzaman, Md.;Apio, Catherine;Park, Taesung
    • Genomics & Informatics
    • /
    • v.18 no.4
    • /
    • pp.45.1-45.5
    • /
    • 2020
  • With the ongoing rise of coronavirus disease 2019 (COVID-19) pandemic across the globe, interests in COVID-19 antibody testing, also known as a serology test has grown, as a way to measure how far the infection has spread in the population and to identify individuals who may be immune. Recently, many countries reported their population based antibody titer study results. South Korea recently reported their third antibody formation rate, where it divided the study between the general population and the young male youths in their early twenties. As previously stated, these simple point estimates may be misinterpreted without proper estimation of standard error and confidence intervals. In this article, we provide an updated 95% confidence intervals for COVID-19 antibody formation rate for the Korean population using asymptotic, exact and Bayesian statistical estimation methods. As before, we found that the Wald method gives the narrowest interval among all asymptotic methods whereas mid p-value gives the narrowest among all exact methods and Jeffrey's method gives the narrowest from Bayesian method. The most conservative 95% confidence interval estimation shows that as of 00:00 November 23, 2020, at least 69,524 people were infected but not confirmed. It also shows that more positive cases were found among the young male in their twenties (0.22%), three times that of the general public (0.051%). This thereby calls for the quarantine authorities' need to strengthen quarantine managements for the early twenties in order to find the hidden infected people in the population.

Confidence intervals for the COVID-19 neutralizing antibody retention rate in the Korean population

  • Apio, Catherine;Kamruzzaman, Md.;Park, Taesung
    • Genomics & Informatics
    • /
    • v.18 no.3
    • /
    • pp.31.1-31.8
    • /
    • 2020
  • The coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has become a global pandemic. No specific therapeutic agents or vaccines for COVID-19 are available, though several antiviral drugs, are under investigation as treatment agents for COVID-19. The use of convalescent plasma transfusion that contain neutralizing antibodies for COVID-19 has become the major focus. This requires mass screening of populations for these antibodies. While several countries started reporting population based antibody rate, its simple point estimate may be misinterpreted without proper estimation of standard error and confidence intervals. In this paper, we review the importance of antibody studies and present the 95% confidence intervals COVID-19 antibody rate for the Korean population using two recently performed antibody tests in Korea. Due to the sparsity of data, the estimation of confidence interval is a big challenge. Thus, we consider several confidence intervals using Asymptotic, Exact and Bayesian estimation methods. In this article, we found that the Wald method gives the narrowest interval among all Asymptotic methods whereas mid p-value gives the narrowest among all Exact methods and Jeffrey's method gives the narrowest from Bayesian method. The most conservative 95% confidence interval estimation shows that as of 00:00 on September 15, 2020, at least 32,602 people were infected but not confirmed in Korea.

A genome-wide association study on growth traits of Korean commercial pig breeds using Bayesian methods

  • Jong Hyun Jung;Sang Min Lee;Sang-Hyon Oh
    • Animal Bioscience
    • /
    • v.37 no.5
    • /
    • pp.807-816
    • /
    • 2024
  • Objective: This study aims to identify the significant regions and candidate genes of growth-related traits (adjusted backfat thickness [ABF], average daily gain [ADG], and days to 90 kg [DAYS90]) in Korean commercial GGP pig (Duroc, Landrace, and Yorkshire) populations. Methods: A genome-wide association study (GWAS) was performed using single-nucleotide polymorphism (SNP) markers for imputation to Illumina PorcineSNP60. The BayesB method was applied to calculate thresholds for the significance of SNP markers. The identified windows were considered significant if they explained ≥1% genetic variance. Results: A total of 28 window regions were related to genetic growth effects. Bayesian GWAS revealed 28 significant genetic regions including 52 informative SNPs associated with growth traits (ABF, ADG, DAYS90) in Duroc, Landrace, and Yorkshire pigs, with genetic variance ranging from 1.00% to 5.46%. Additionally, 14 candidate genes with previous functional validation were identified for these traits. Conclusion: The identified SNPs within these regions hold potential value for future marker-assisted or genomic selection in pig breeding programs. Consequently, they contribute to an improved understanding of genetic architecture and our ability to genetically enhance pigs. SNPs within the identified regions could prove valuable for future marker-assisted or genomic selection in pig breeding programs.

Comparison of genome-wide association and genomic prediction methods for milk production traits in Korean Holstein cattle

  • Lee, SeokHyun;Dang, ChangGwon;Choy, YunHo;Do, ChangHee;Cho, Kwanghyun;Kim, Jongjoo;Kim, Yousam;Lee, Jungjae
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.32 no.7
    • /
    • pp.913-921
    • /
    • 2019
  • Objective: The objectives of this study were to compare identified informative regions through two genome-wide association study (GWAS) approaches and determine the accuracy and bias of the direct genomic value (DGV) for milk production traits in Korean Holstein cattle, using two genomic prediction approaches: single-step genomic best linear unbiased prediction (ss-GBLUP) and Bayesian Bayes-B. Methods: Records on production traits such as adjusted 305-day milk (MY305), fat (FY305), and protein (PY305) yields were collected from 265,271 first parity cows. After quality control, 50,765 single-nucleotide polymorphic genotypes were available for analysis. In GWAS for ss-GBLUP (ssGWAS) and Bayes-B (BayesGWAS), the proportion of genetic variance for each 1-Mb genomic window was calculated and used to identify informative genomic regions. Accuracy of the DGV was estimated by a five-fold cross-validation with random clustering. As a measure of accuracy for DGV, we also assessed the correlation between DGV and deregressed-estimated breeding value (DEBV). The bias of DGV for each method was obtained by determining regression coefficients. Results: A total of nine and five significant windows (1 Mb) were identified for MY305 using ssGWAS and BayesGWAS, respectively. Using ssGWAS and BayesGWAS, we also detected multiple significant regions for FY305 (12 and 7) and PY305 (14 and 2), respectively. Both single-step DGV and Bayes DGV also showed somewhat moderate accuracy ranges for MY305 (0.32 to 0.34), FY305 (0.37 to 0.39), and PY305 (0.35 to 0.36) traits, respectively. The mean biases of DGVs determined using the single-step and Bayesian methods were $1.50{\pm}0.21$ and $1.18{\pm}0.26$ for MY305, $1.75{\pm}0.33$ and $1.14{\pm}0.20$ for FY305, and $1.59{\pm}0.20$ and $1.14{\pm}0.15$ for PY305, respectively. Conclusion: From the bias perspective, we believe that genomic selection based on the application of Bayesian approaches would be more suitable than application of ss-GBLUP in Korean Holstein populations.

A Study on Characteristics and Predictions of Seasonal Chlorophyll-a using Bayseian Regression in Paldang Watershed (베이지안 추정을 이용한 팔당호 유역의 계절별 클로로필a 예측 및 오염특성 연구)

  • Kim, Mi-Ah;Shin, Yuna;Kim, Kyunghyun;Heo, Tae-Young;Yoo, Moonkyu;Lee, Su-Woong
    • Journal of Korean Society on Water Environment
    • /
    • v.29 no.6
    • /
    • pp.832-841
    • /
    • 2013
  • In recent years, eutrophication in the Paldang Lake has become one of the major environmental problems in Korea as it may threaten drinking water safety and human health. Thus it is important to understand the phenomena and predict the time and magnitude of algal blooms for applying adequate algal reduction measures. This study performed seasonal water quality assessment and chlorophyll-a prediction using Bayseian simple/multiple linear regression analysis. Bayseian regression analysis could be a useful tool to overcome limitations of conventional regression analysis. Also it can consider uncertainty in prediction by using posterior distribution. Generally, chlorophyll-a of a P2(Paldang Dam 2) site showed high concentration in spring and it was similar to that of P4(Paldang Dam 4) site. For the development of Bayseian model, we performed seasonal correlation. As a result, chlorophyll-a of a P2 site had a high correlation with P5(Paldang Dam 5) site in spring (r = 0.786, p<0.05) and with P4 in winter (r = 0.843, p<0.05). Based on the DIC (Deviance Information Criterion) value, critical explanatory variables of the best fitting Bayesian linear regression model were selected as a $PO_4-P$ (P2), Chlorophyll-a (P5) in spring, $NH_3-N$ (P2), Chlorophyll-a (P4), $NH_3-N$ (P4) in summer, DTP (P2), outflow (P2), TP (P3), TP (P4) fall, COD (P2), Chl-a (P4) and COD (P4) in winter. The results of chlorophyll-a prediction showed relatively high $R^2$ and low RMSE values in summer and winter.

Analysis of Molecular Variance and Population Structure of Sesame (Sesamum indicum L.) Genotypes Using Simple Sequence Repeat Markers

  • Asekova, Sovetgul;Kulkarni, Krishnanand P.;Oh, Ki Won;Lee, Myung-Hee;Oh, Eunyoung;Kim, Jung-In;Yeo, Un-Sang;Pae, Suk-Bok;Ha, Tae Joung;Kim, Sung Up
    • Plant Breeding and Biotechnology
    • /
    • v.6 no.4
    • /
    • pp.321-336
    • /
    • 2018
  • Sesame (Sesamum indicum L.) is an important oilseed crop grown in tropical and subtropical areas. The objective of this study was to investigate the genetic relationships among 129 sesame landraces and cultivars using simple sequence repeat (SSR) markers. Out of 70 SSRs, 23 were found to be informative and produced 157 alleles. The number of alleles per locus ranged from 3 - 14, whereas polymorphic information content ranged from 0.33 - 0.86. A distance-based phylogenetic analysis revealed two major and six minor clusters. The population structure analysis using a Bayesian model-based program in STRUCTURE 2.3.4 divided 129 sesame accessions into three major populations (K = 3). Based on pairwise comparison estimates, Pop1 was observed to be genetically close to Pop2 with $F_{ST}$ value of 0.15, while Pop2 and Pop3 were genetically closest with $F_{ST}$ value of 0.08. Analysis of molecular variance revealed a high percentage of variability among individuals within populations (85.84%) than among the populations (14.16%). Similarly, a high variance was observed among the individuals within the country of origins (90.45%) than between the countries of origins. The grouping of genotypes in clusters was not related to their geographic origin indicating considerable gene flow among sesame genotypes across the selected geographic regions. The SSR markers used in the present study were able to distinguish closely linked sesame genotypes, thereby showing their usefulness in assessing the potentially important source of genetic variation. These markers can be used for future sesame varietal classification, conservation, and other breeding purposes.

Genetic Diversity and Structure of the Korean Endemic Species, Coreanomecon hylomeconoides Nakai, as Revealed by ISSR markers (한국 특산식물 매미꽃(Coreanomecon hylomeconoides Nakai) 집단의 유전다양성 및 구조)

  • Son, Sung-Won;Chung, Jae-Min;Kim, Eun-Hye;Choi, Kyoung-Su;Park, SeonJoo
    • Korean Journal of Plant Resources
    • /
    • v.26 no.2
    • /
    • pp.310-319
    • /
    • 2013
  • The genetic diversity and structure of eight populations of Coreanomecon hylomeconoides Nakai, an endemic Korean plant, were investigated using 50 ISSR loci from eight primers. The average percentage of polymorphic loci was 47.3%. The Shannon's index (SI=0.218) and gene diversity (h=0.142) were relatively lower than those of other long-lived perennials. The Sancheong (SI=0.233, h=0153), Gwangyang (SI=0.263, h=0.171), and Suncheon (SI=0.241, h=0.159) populations showed greater genetic diversity than the Namhae and Gwangju populations, which are on the edge of the distribution. Analysis of molecular variance (AMOVA) showed that 18% of the total variation could be attributed to differences among populations, and 82% to differences within populations, indicating moderate gene flow among adjacent populations. These results were supported by value of Nm (2.184). The UPGMA conducted using the genetic distance and Bayesian cluster analysis showed a remarkable geographic trend structured into east and west regions. Overall, the results indicate that the Sancheong and Gwangyang populations, which had a large population size and higher degree of genetic diversity, should be the focus of in situ conservation.

Total bilirubin level as a biomarker for dampness-heat differentiation in traditional Korean treatment for jaundice

  • Sohn, Ki Cheul;Jung, Hyun-Jung;Lee, A-Jin;Kim, Sang-Gyung;Shin, ImHee;Kwak, Sang Gyu
    • The Journal of Korean Medicine
    • /
    • v.34 no.4
    • /
    • pp.46-55
    • /
    • 2013
  • Objectives: Classifying the pattern of jaundice during diagnosis will significantly improve the outcome of common KM interventions. This study aimed at determining an objective index for accurately diagnosing heat and dampness KM patterns in patients with jaundice. Methods: We systematically reviewed laboratory findings from case reports published in the scientific literature of Korean medicine. Cases were classified as following either the heat or dampness pattern. Biochemical indices were compared using a Bayesian factor (BF) analysis and standard t-tests. Results: The laboratory findings of 32 patients were evaluated. The heat pattern was observed in 17 patients and the dampness pattern in 15. No significant differences were observed between the 2 groups in terms of white blood cell count (BF=1.659); hemoglobin concentration (BF=2.627); platelet count (BF=1.019); or levels of direct bilirubin (BF=1.453), aspartate aminotransferase (BF=1.226), alanine aminotransferase (BF=1.340), alkaline phosphatase (BF=2.344), or gamma-glutamyl transpeptidase (BF=2.782). However, total bilirubin levels were significantly higher in the dampness pattern group (BF=0.854, P-value=0.070). Conclusions: Patients with high total bilirubin levels may predominantly follow the dampness pattern, while those with low levels may predominantly follow the heat pattern. These results are expected to be useful for the development of timely and efficient KM treatments as well as new integrative therapeutic approaches for jaundice. However, further studies are essential to fully validate the utility of total bilirubin as a biomarker for differentiating between heat and dampness patterns.