• Title/Summary/Keyword: Zero-inflated model

Search Result 66, Processing Time 0.018 seconds

An application to Multivariate Zero-Inflated Poisson Regression Model

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.2
    • /
    • pp.177-186
    • /
    • 2003
  • The Zero-Inflated Poisson regression is a model for count data with exess zeros. When the correlated response variables are intrested, we have to extend the univariate zero-inflated regression model to multivariate model. In this paper, we study and simulate the multivariate zero-inflated regression model. A real example was applied to this model. Regression parameters are estimated by using MLE's. We also compare the fitness of multivariate zero-inflated Poisson regression model with the decision tree model.

  • PDF

THE DEVELOPMENT OF A ZERO-INFLATED RASCH MODEL

  • Kim, Sungyeun;Lee, Guemin
    • The Pure and Applied Mathematics
    • /
    • v.20 no.1
    • /
    • pp.59-70
    • /
    • 2013
  • The purpose of this study was to develop a zero-inflated Rasch (ZI-Rasch) model, a combination of the Rasch model and the ZIP model. The ZI-Rasch model was considered in this study as an appropriate alternative to the Rasch model for zero-inflated data. To investigate the relative appropriateness of the ZI-Rasch model, several analyses were conducted using PROC NLMIXED procedures in SAS under various simulation conditions. Sets of criteria for model evaluations (-2LL, AIC, AICC, and BIC) and parameter estimations (RMSE, and $r$) from the ZI-Rasch model were compared with those from the Rasch model. In the data-model fit indices, regardless of the simulation conditions, the ZI-Rasch model produced better fit statistics than did the Rasch model, even when the response data were generated from the Rasch model. In terms of item parameter ${\lambda}$ estimations, the ZI-Rasch model produced estimates similar to those of the Rasch model.

An application to Zero-Inflated Poisson Regression Model

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.1
    • /
    • pp.45-53
    • /
    • 2003
  • The Zero-Inflated Poisson regression is a model for count data with exess zeros. When the reponse variables have excess zeros, it is not easy to apply the Poisson regression model. In this paper, we study and simulate the zero-inflated Poisson regression model. An real example was applied to this model. Regression parameters are estimated by using MLE's. We also compare the fitness of zero-inflated Poisson model with the Poisson regression and decision tree model.

  • PDF

Bayesian Analysis of a Zero-inflated Poisson Regression Model: An Application to Korean Oral Hygienic Data (영과잉 포아송 회귀모형에 대한 베이지안 추론: 구강위생 자료에의 적용)

  • Lim, Ah-Kyoung;Oh, Man-Suk
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.505-519
    • /
    • 2006
  • We consider zero-inflated count data, which is discrete count data but has too many zeroes compared to the Poisson distribution. Zero-inflated data can be found in various areas. Despite its increasing importance in practice, appropriate statistical inference on zero-inflated data is limited. Classical inference based on a large number theory does not fit unless the sample size is very large. And regular Poisson model shows lack of St due to many zeroes. To handle the difficulties, a mixture of distributions are considered for the zero-inflated data. Specifically, a mixture of a point mass at zero and a Poisson distribution is employed for the data. In addition, when there exist meaningful covariates selected to the response variable, loglinear link is used between the mean of the response and the covariates in the Poisson distribution part. We propose a Bayesian inference for the zero-inflated Poisson regression model by using a Markov Chain Monte Carlo method. We applied the proposed method to a Korean oral hygienic data and compared the inference results with other models. We found that the proposed method is superior in that it gives small parameter estimation error and more accurate predictions.

A Bayesian zero-inflated negative binomial regression model based on Pólya-Gamma latent variables with an application to pharmaceutical data (폴랴-감마 잠재변수에 기반한 베이지안 영과잉 음이항 회귀모형: 약학 자료에의 응용)

  • Seo, Gi Tae;Hwang, Beom Seuk
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.2
    • /
    • pp.311-325
    • /
    • 2022
  • For count responses, the situation of excess zeros often occurs in various research fields. Zero-inflated model is a common choice for modeling such count data. Bayesian inference for the zero-inflated model has long been recognized as a hard problem because the form of conditional posterior distribution is not in closed form. Recently, however, Pillow and Scott (2012) and Polson et al. (2013) proposed a Pólya-Gamma data-augmentation strategy for logistic and negative binomial models, facilitating Bayesian inference for the zero-inflated model. We apply Bayesian zero-inflated negative binomial regression model to longitudinal pharmaceutical data which have been previously analyzed by Min and Agresti (2005). To facilitate posterior sampling for longitudinal zero-inflated model, we use the Pólya-Gamma data-augmentation strategy.

Safety Performance Functions for Central Business Districts Using a Zero-Inflated Model (영과잉을 고려한 중심상업지구 교통사고모형 개발에 관한 연구)

  • Lee, Sang Hyuk;Woo, Yong Han
    • International Journal of Highway Engineering
    • /
    • v.18 no.4
    • /
    • pp.83-92
    • /
    • 2016
  • PURPOSES : The purpose of this study was to develop safety performance functions (SPFs) that use zero-inflated negative binomial regression models for urban intersections in central business districts (CBDs), and to compare the statistical significance of developed models against that of regular negative binomial regression models. METHODS : To develop and analyze the SPFs of intersections in CBDs, data acquisition was conducted for dependent and independent variables in areas of study. We analyzed the SPFs using zero-inflated negative binomial regression model as well as regular negative binomial regression model. We then compared the results by analyzing the statistical significance of the models. RESULTS : SPFs were estimated for all accidents and injury accidents at intersections in CBDs in terms of variables such as AADT, Number of Lanes at Major Roads, Median Barriers, Right Turn with an Exclusive Turn Lane, Turning Guideline, and Front Signal. We also estimated the log-likelihood at convergence and the likelihood ratio of SPFs for comparing the zero-inflated model with the regular model. In he SPFs, estimated log-likelihood at convergence and the likelihood ratio of the zero-inflated model were at -836.736, 0.193 and -836.415, 0.195. Also estimated the log-likelihood at convergence and likelihood ratio of the regular model were at -843.547, 0.187 and -842.631, 0.189, respectively. These figures demonstrate that zero-inflated negative binomial regression models can better explain traffic accidents at intersections in CBDs. CONCLUSIONS : SPFs that use a zero-inflated negative binomial regression model demonstrate better statistical significance compared with those that use a regular negative binomial regression model.

Zero In ated Poisson Model for Spatial Data (영과잉 공간자료의 분석)

  • Han, Junhee;Kim, Changhoon
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.2
    • /
    • pp.231-239
    • /
    • 2015
  • A Poisson model is the first choice for counts data. Quasi Poisson or negative binomial models are usually used in cases of over (or under) dispersed data. However, these models might be unsuitable if the data consist of excessive number of zeros (zero inflated data). For zero inflated counts data, Zero Inflated Poisson (ZIP) or Zero Inflated Negative Binomial (ZINB) models are recommended to address the issue. In this paper, we further considered a situation where zero inflated data are spatially correlated. A mixed effect model with random effects that account for spatial autocorrelation is used to fit the data.

Application of Zero-Inflated Poisson Distribution to Utilize Government Quality Assurance Activity Data (정부 품질보증활동 데이터 활용을 위한 Zero-Inflated 포아송 분포 적용)

  • Kim, JH;Lee, CW
    • Journal of Korean Society for Quality Management
    • /
    • v.46 no.3
    • /
    • pp.509-522
    • /
    • 2018
  • Purpose: The purpose of this study was to propose more accurate mathematical model which can represent result of government quality assurance activity, especially corrective action and flaw. Methods: The collected data during government quality assurance activity was represented through histogram. To find out which distributions (Poisson distribution, Zero-Inflated Poisson distribution) could represent the histogram better, this study applied Pearson's correlation coefficient. Results: The result of this study is as follows; Histogram of corrective action during past 3 years and Zero-Inflated Poisson distribution had strong relationship that their correlation coefficients was over 0.94. Flaw data could not re-parameterize to Zero-Inflated Poisson distribution because its frequency of flaw occurrence was too small. However, histogram of flaw data during past 3 years and Poisson distribution showed strong relationship that their correlation coefficients was 0.99. Conclusion: Zero-Inflated Poisson distribution represented better than Poisson distribution to demonstrate corrective action histogram. However, in the case of flaw data histogram, Poisson distribution was more accurate than Zero-Inflated Poisson distribution.

A Bayesian joint model for continuous and zero-inflated count data in developmental toxicity studies

  • Hwang, Beom Seuk
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.2
    • /
    • pp.239-250
    • /
    • 2022
  • In many applications, we frequently encounter correlated multiple outcomes measured on the same subject. Joint modeling of such multiple outcomes can improve efficiency of inference compared to independent modeling. For instance, in developmental toxicity studies, fetal weight and number of malformed pups are measured on the pregnant dams exposed to different levels of a toxic substance, in which the association between such outcomes should be taken into account in the model. The number of malformations may possibly have many zeros, which should be analyzed via zero-inflated count models. Motivated by applications in developmental toxicity studies, we propose a Bayesian joint modeling framework for continuous and count outcomes with excess zeros. In our model, zero-inflated Poisson (ZIP) regression model would be used to describe count data, and a subject-specific random effects would account for the correlation across the two outcomes. We implement a Bayesian approach using MCMC procedure with data augmentation method and adaptive rejection sampling. We apply our proposed model to dose-response analysis in a developmental toxicity study to estimate the benchmark dose in a risk assessment.

Weighted zero-inflated Poisson mixed model with an application to Medicaid utilization data

  • Lee, Sang Mee;Karrison, Theodore;Nocon, Robert S.;Huang, Elbert
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.2
    • /
    • pp.173-184
    • /
    • 2018
  • In medical or public health research, it is common to encounter clustered or longitudinal count data that exhibit excess zeros. For example, health care utilization data often have a multi-modal distribution with excess zeroes as well as a multilevel structure where patients are nested within physicians and hospitals. To analyze this type of data, zero-inflated count models with mixed effects have been developed where a count response variable is assumed to be distributed as a mixture of a Poisson or negative binomial and a distribution with a point mass of zeros that include random effects. However, no study has considered a situation where data are also censored due to the finite nature of the observation period or follow-up. In this paper, we present a weighted version of zero-inflated Poisson model with random effects accounting for variable individual follow-up times. We suggested two different types of weight function. The performance of the proposed model is evaluated and compared to a standard zero-inflated mixed model through simulation studies. This approach is then applied to Medicaid data analysis.