• Title/Summary/Keyword: 이항자료

Search Result 241, Processing Time 0.018 seconds

Fitting Bivariate Generalized Binomial Models of the Sarmanov Type (Sarmanov형 이변량 일반화이항모형의 적합)

  • Lee, Joo-Yong;Kim, Kee-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.271-280
    • /
    • 2009
  • For bivariate binomial data with both intra and inter-class correlation, Danaher and Hardie (2005) proposed a bivariate beta-binomial model. However, the model is limited to the situation where the intra-class correlation is strictly positive. Thus it might be seriously inadequate for data with a negative intra-class correlation. Several authors have considered generalized binomial distributions covering a wider range of intra-class correlation which could relax the possible model restrictions imposed. Among others there are the additive/multiplicative and the beta/extended beta binomial model. In this study, bivariate models of the Sarmanov (1966) type are formed by combining each of those univariate models to take care of the inter-class correlation, and are evaluated in terms of the goodness-of-fit. As a result, B-mB and B-ebB are fitted, successfully, to real data and that B-mB, which has a wider permissible range than B-ebB for the intra-class correlation is relatively preferred.

Fitting Distribution of Accident Frequency of Freeway Horizontal Curve Sections & Development of Negative Binomial Regression Models (고속도로 평면선형상 사고빈도분포 추정을 통한 음이항회귀모형 개발 (기하구조요인을 중심으로))

  • 강민욱;도철웅;손봉수
    • Journal of Korean Society of Transportation
    • /
    • v.20 no.7
    • /
    • pp.197-204
    • /
    • 2002
  • 교통사고예측 및 예방을 위해서는 실제적으로 도로설계과정에서 제어가 가능한 도로 기하구조요소에 대한 사고관계를 파악함이 타당하다. 즉, 도로의 설계자는 도로건설에 앞서 기하구조요소와 사고와의 관계를 현장자료를 통해 정확히 밝혀 도로설계에 반영해야 한다. 이를 위해, 교통사고의 빈도분포를 박히는 것은 가장 기본이 되는 일이며, 교통사고 예측모형개발에 선행되어야 한다. 일반적으로 교통사고건수의 경우 분산이 평균보다 큰 과분산(overdispersion)의 특징을 가지고 있어 음이항 분포를 따른다고 알려져 있다. 따라서 본 논문은 사고모형의 개발에 앞서, 사고발생지점에 대한 도로설계요소와 기타 잠재적인 사고발생 관련요인이 비교적 잘 파악되어있는 호남고속도로를 중심으로 평면 선형상 곡선부에 대하여 교통사고의 분포를 적합도 검정을 통해 알아보고자 하였다. 사고자료는 한국도로송사의 호남고속도로 5년(1996∼2000)간 자료를 분석에 맞게 정리하였으며, 강민욱과 송봉수(2002)에서 제시한 평면선형에 있어서의 구간분할법을 이용하여 배향곡선구간과 단일곡선구간에 대한 사고분석을 하였다. 적합도 분석결과, 예상대로 음이항분포가 사고건수를 설명하기에 가장 적합한 확률분포로 제시되었으며, 이를 통해 최우추정법을 이용한 음이항회귀모형을 개발하였다. 구간분할법을 적용한 음이항회귀모형의 경우, 기존의 확률회귀토형에 비하여 높은 결정계수를 갖았으며, 모형에서 적용된 기하구조요소로는 차량 노출계수, 곡선반경, 단위거리 당 편경사변화값 등이다.

A new sample selection model for overdispersed count data (과대산포 가산자료의 새로운 표본선택모형)

  • Jo, Sung Eun;Zhao, Jun;Kim, Hyoung-Moon
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.6
    • /
    • pp.733-749
    • /
    • 2018
  • Sample selection arises as a result of the partial observability of the outcome of interest in a study. Heckman introduced a sample selection model to analyze such data and proposed a full maximum likelihood estimation method under the assumption of normality. Recently sample selection models for binomial and Poisson response variables have been proposed. Based on the theory of symmetry-modulated distribution, we extend these to a model for overdispersed count data. This type of data with no sample selection is often modeled using negative binomial distribution. Hence we propose a sample selection model for overdispersed count data using the negative binomial distribution. A real data application is employed. Simulation studies reveal that our estimation method based on profile log-likelihood is stable.

A mixed-effects model for overdispersed binomial data (초과변동의 이항자료에 대한 혼합효과 모형)

  • Choi, Jae-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.10 no.1
    • /
    • pp.199-205
    • /
    • 1999
  • This paper discusses the generalized mixed-effects model for the analysis of overdispersed binomial data. Sometimes certain types of sampling designs or genetic characters of experimental units can be regarded as factors of extra binomial variation. For such cases, this paper suggests models with one or two random effects to explain overdispersion caused by those affecting factors and shows how to test for a model adequacy based on deviance.

  • PDF

A binomial CUSUM chart for monitoring type I right-censored Weibull lifetimes (제1형의 우측중도절단된 와이블 수명자료를 관리하는 이항 누적합 관리도)

  • Choi, Min-jae;Lee, Jaeheon
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.5
    • /
    • pp.823-833
    • /
    • 2016
  • The lifetime is a key characteristic of product quality. It is best to obtain the lifetime data of all samples, but they are often censored due to time or expense limitations. In this paper, we propose a binomial cumulative sum (CUSUM) chart to monitor the mean of type I right-censored Weibull lifetime data, for a xed value of the Weibull shape parameter. We compare the performance of the proposed binomial CUSUM chart with CUSUM charts studied previously using the steady-state average run length (ARL). The results show that the performance of the binomial CUSUM chart is better when the censoring rate is high and/or the sample size is small.

Comparison of Estimators of Dependence Related Parameter in Generalized Binomial Distribution (일반화 이항분포모형에서 시행간 종속성 규정모수의 추정량 비교 연구)

  • Moon, Myung-Sang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.10 no.2
    • /
    • pp.279-288
    • /
    • 1999
  • In many cases where the conventional binomial distribution fails to apply to real world data, it is mainly due to the lack of independence among Bernoulli trials. Several authors have proposed models that are useful when independence assumption is not satisfied. In this paper, one proposed model is adapted, and estimators of dependence related parameter that is crucial in defining that model are considered. Simulation is performed to compare two estimators(method of moment estimator and maximum likelihood estimator) of dependence related parameter, and conclusions are made.

  • PDF

A Bayesian zero-inflated negative binomial regression model based on Pólya-Gamma latent variables with an application to pharmaceutical data (폴랴-감마 잠재변수에 기반한 베이지안 영과잉 음이항 회귀모형: 약학 자료에의 응용)

  • Seo, Gi Tae;Hwang, Beom Seuk
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.2
    • /
    • pp.311-325
    • /
    • 2022
  • For count responses, the situation of excess zeros often occurs in various research fields. Zero-inflated model is a common choice for modeling such count data. Bayesian inference for the zero-inflated model has long been recognized as a hard problem because the form of conditional posterior distribution is not in closed form. Recently, however, Pillow and Scott (2012) and Polson et al. (2013) proposed a Pólya-Gamma data-augmentation strategy for logistic and negative binomial models, facilitating Bayesian inference for the zero-inflated model. We apply Bayesian zero-inflated negative binomial regression model to longitudinal pharmaceutical data which have been previously analyzed by Min and Agresti (2005). To facilitate posterior sampling for longitudinal zero-inflated model, we use the Pólya-Gamma data-augmentation strategy.

Comparative Simulation Studies on Generalized Binomial Models (일반화 이항모형의 적합도 평가)

  • Baik, E.J.;Kim, K.Y.
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.4
    • /
    • pp.507-516
    • /
    • 2011
  • Comparative studies on generalized binomial models (Moon, 2003; Ng, 1989; Paul, 1985; Kupper and Haseman, 1978; Griffiths, 1973) are restrictive in that the models compared are rather limited and MSE of the estimates is the only measure considered for the model adequacy. This paper is aimed to report simulation results which provide possible guidelines for selecting a proper model. We examine Pearson type of goodness-of-fit statistic to its degrees of freedom and AIC for the overall model quality. MSE and Bias of the individual estimates are also considered as the component fit measures. Performance of some models varies widely for a certain range of the parameter space while most of the models are quite competent. Our evaluation shows that the Extended Beta-Binomial model (Prentice, 1986) turns out to be particularly favorable in the point that it provides consistently excellent fit almost all over the values of the intra-class correlation coefficient and the probability of success.

Statistical Modeling of Learning Curves with Binary Response Data (이항 반응 자료에 대한 학습곡선의 모형화)

  • Lee, Seul-Ji;Park, Man-Sik
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.3
    • /
    • pp.433-450
    • /
    • 2012
  • As a worker performs a certain operation repeatedly, he tends to become familiar with the job and complete it in a very short time. That means that the efficiency is improved due to his accumulated knowledge, experience and skill in regards to the operation. Investing time in an output is reduced by repeating any operation. This phenomenon is referred to as the learning curve effect. A learning curve is a graphical representation of the changing rate of learning. According to previous literature, learning curve effects are determined by subjective pre-assigned factors. In this study, we propose a new statistical model to clarify the learning curve effect by means of a basic cumulative distribution function. This work mainly focuses on the statistical modeling of binary data. We employ the Newton-Raphson method for the estimation and Delta method for the construction of confidence intervals. We also perform a real data analysis.

On Prediction Intervals for Binomial Data (이항자료에 대한 예측구간)

  • Ryu, Jea-Bok
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.6
    • /
    • pp.943-952
    • /
    • 2013
  • Wald, Agresti-Coull, Jeffreys, and Bayes-Laplace methods are commonly used for confidence interval of binomial proportion are applied for prediction intervals. We used coverage probability, mean coverage probability, root mean squared error, and mean expected width for numerical comparisons. From the comparisons, we found that Wald is not proper as for confidence interval and Agresti-Coull is too conservative to differ from confidence interval. However, Jeffrey and Bayes-Laplace are good for prediction interval and Jeffrey is especially desirable as for confidence interval.