• Title/Summary/Keyword: zero-inflated data

Search Result 70, Processing Time 0.024 seconds

An Analysis on the Determinants of Employed Labour Quantity in the Fishing Industry (어가의 고용량 결정요인 분석)

  • Kim, Tae-Hyun;Park, Cheol-Hyung;Nam, Jongoh
    • Environmental and Resource Economics Review
    • /
    • v.27 no.3
    • /
    • pp.545-567
    • /
    • 2018
  • This study applied and compared Poisson model, negative binomial model, zero inflated Poisson model, and zero inflated negative binomial model to estimate determinants of employed labour quantity. To estimate each of models, this study used fisheries census data which were obtained at microdata integrated service running by Statistics Korea. The study selected zero inflated negative binomial model according to the Vuong test and Likelihood-ratio test. In addition, the study estimated fishing village's practical changes on employed labour quantity as analyzing changes from 2010 to 2015. The results showed that the household with fishing vessels and high selling price had a significant effect on decrease of the labour quantities. Meanwhile, the longer work experience of the household, the more significant the increase in the labour quantities. In conclusion, this study presented that capitalized fishing household and the acceleration of aging had a significant impact on the change in the labour quantities.

Neighborhood Environment Associated with Physical Activity among Rural Adults: Applying Zero-Inflated Negative Binominal Regression Modeling (영과잉 음이항 회귀모형을 적용한 농촌지역 성인 신체활동의 지역사회환경 요인 분석)

  • Kim, Bongjeong
    • Journal of Korean Public Health Nursing
    • /
    • v.29 no.3
    • /
    • pp.488-502
    • /
    • 2015
  • Purpose: This study was conducted to determine the neighborhood environmental factors associated with physical activity among adults living in rural communities. Methods: A cross-sectional descriptive survey was conducted with a convenience sample of 201 adults living in three Ri in Y-city, Gyeonggi-do. Data were collected from face-to-face interview by trained interviewers and were analyzed using a zero-inflated negative binominal regression model. Results: Participants reported engaged in moderate or vigorous physical activity was 76.1%; 10.5% of participants reported that they met moderate physical activity recommendations and 14.5% of participants reported that they met vigorous physical activity recommendations. Zero-inflated negative binominal regression analysis showed association of increasing days of physical activity with social cohesion (${\beta}=.130$, p=.005), social network (${\beta}=-.096$, p=.003), and safety for crime (${\beta}=-.151$, p=.036), and no days of physical activity was associated with no attainment of education and marginally associated with increasing BMI. Conclusion: Neighborhood environmental factors including social cohesion, social network, and crime for safety were significantly associated with physical activity of rural adults. Community health nurses should expand an approach for individual behavior change to incorporate rural adults' specific neighborhood environmental factors into physical activity interventions.

Predictors of Blood and Body Fluid Exposure and Mediating Effects of Infection Prevention Behavior in Shift-Working Nurses: Application of Analysis Method for Zero-Inflated Count Data (교대근무 간호사의 혈액과 체액 노출 사고 예측 요인과 감염예방행위의 매개효과: 영과잉 가산 자료 분석방법을 적용하여)

  • Ryu, Jae Geum;Choi-Kwon, Smi
    • Journal of Korean Academy of Nursing
    • /
    • v.50 no.5
    • /
    • pp.658-670
    • /
    • 2020
  • Purpose: This study aimed to identify the predictors of blood and body fluid exposure (BBFE) in multifaceted individual (sleep disturbance and fatigue), occupational (occupational stress), and organizational (hospital safety climate) factors, as well as infection prevention behavior. We also aimed to test the mediating effect of infection prevention behavior in relation to multifaceted factors and the frequency of BBFE. Methods: This study was based on a secondary data analysis, using data of 246 nurses from the Shift Work Nurses' Health and Turnover study. Based on the characteristics of zero-inflated and over-dispersed count data of frequencies of BBFE, the data were analyzed to calculate zero-inflated negative binomial regression within a generalized linear model and to test the mediating effect using SPSS 25.0, Stata 14.1, and PROCESS macro. Results: We found that the frequency of BBFE increased in subjects with disturbed sleep (IRR = 1.87, p = .049), and the probability of non-BBFE increased in subjects showing higher infection prevention behavior (IRR = 15.05, p = .006) and a hospital safety climate (IRR = 28.46, p = .018). We also found that infection prevention behavior had mediating effects on the occupational stress-BBFE and hospital safety climate-BBFE relationships. Conclusion: Sleep disturbance is an important risk factor related to frequency of BBFE, whereas preventive factors are infection prevention behavior and hospital safety climate. We suggest individual and systemic efforts to improve sleep, occupational stress, and hospital safety climate to prevent BBFE occurrence.

Likelihood Ratio Test for the Epidemic Alternatives on the Zero-Inflated Poisson Model (변화시점이 있는 영과잉-포아송모형에서 돌출대립가설에 대한 우도비검정)

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.2
    • /
    • pp.247-253
    • /
    • 1998
  • In ease of the epidemic Zero-Inflated Poisson model, likelihood ratio test was used for testing epidemic alternatives. Epidemic changepoints were estimated by the method of least squares. It were used for starting points to estimate the maximum likelihood estimators. And several parameters were compared through the Monte Carlo simulations. As a result, maximum likelihood estimators for the epidemic chaagepoints and several parameters are better than the least squares and moment estimators.

  • PDF

Developing Rear-End Collision Models of Roundabouts in Korea (국내 회전교차로의 추돌사고 모형 개발)

  • Park, Byung Ho;Beak, Tae Hun
    • Journal of the Korean Society of Safety
    • /
    • v.29 no.6
    • /
    • pp.151-157
    • /
    • 2014
  • This study deals with the rear-end collision at roundabouts. The purpose of this study is to develop the accident models of rear-end collision in Korea. In pursuing the above, this study gives particular attention to developing the appropriate models using Poisson, negative binomial model, ZAM, multiple linear and nonlinear regression models, and statistical analysis tools. The main results are as follows. First, the Vuong statistics and overdispersion parameters indicate that ZIP is the most appropriate model among count data models. Second, RMSE, MPB, MAD and correlation coefficient tests show that the multiple nonlinear model is the most suitable to the rear-end collision data. Finally, such the independent variables as traffic volume, ratio of heavy vehicle, number of circulatory roadway lane, number of crosswalk and stop line are adopted in the optimal model.

Estimation of Advertising Exposure Distribution by Zero-inflation Regression Models (영과잉 회귀모형을 이용한 광고노출분포 추정)

  • Lee, Dong-Hee
    • Journal of the Korean Data Analysis Society
    • /
    • v.20 no.6
    • /
    • pp.2841-2852
    • /
    • 2018
  • This study examines regression modeling method using zero-inflated distribution in relation to estimation of exposure distribution required in advertisement media planning. Exposure distribution is the percentage of audiences that are exposed each time the ad is repeated. Such an exposure distribution plays a very important role in providing basic information necessary for calculating various indicators for quantitatively measuring the advertising effect. Especially, due to the decrease of advertising price and the spread of various media, the frequency of the advertisement or the broadcasting of specific advertisements has been greatly increased compared to the past. As a result, the frequency of exposure is relatively decreasing. In this situation, the number of individuals who are not exposed to the media, that is, are not exposed to advertising structurally is increasing. This research proposes advertising exposure distribution models using a zero-inflated regression model, and conducts a comparative study using actual cases.

A Bayesian cure rate model with dispersion induced by discrete frailty

  • Cancho, Vicente G.;Zavaleta, Katherine E.C.;Macera, Marcia A.C.;Suzuki, Adriano K.;Louzada, Francisco
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.5
    • /
    • pp.471-488
    • /
    • 2018
  • In this paper, we propose extending proportional hazards frailty models to allow a discrete distribution for the frailty variable. Having zero frailty can be interpreted as being immune or cured. Thus, we develop a new survival model induced by discrete frailty with zero-inflated power series distribution, which can account for overdispersion. This proposal also allows for a realistic description of non-risk individuals, since individuals cured due to intrinsic factors (immunes) are modeled by a deterministic fraction of zero-risk while those cured due to an intervention are modeled by a random fraction. We put the proposed model in a Bayesian framework and use a Markov chain Monte Carlo algorithm for the computation of posterior distribution. A simulation study is conducted to assess the proposed model and the computation algorithm. We also discuss model selection based on pseudo-Bayes factors as well as developing case influence diagnostics for the joint posterior distribution through ${\psi}-divergence$ measures. The motivating cutaneous melanoma data is analyzed for illustration purposes.

Prediction of K-league soccer scores using bivariate Poisson distributions (이변량 포아송분포를 이용한 K-리그 골 점수의 예측)

  • Lee, Jang Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1221-1229
    • /
    • 2014
  • In this paper we choose the best model among several bivariate Poisson models on Korean soccer data. The models considered allow for correlation between the number of goals of two competing teams. We use an R package called bivpois for bivariate Poisson regression models and the data of K-league for season 1983-2012. Finally we conclude that the best fitted model supported by the AIC and BIC is the bivariate Poisson model with constant covariance. The zero and diagonal inflated models did not improve the model fit. The model can be used to examine home-away effect, goodness of fit, attack and defense parameters.

Development of the U-turn Accident Model at 4-Legged Signalized Intersections in Urban Areas (도시부 4지 신호교차로 유턴 사고모형 개발)

  • Kang, JongHo;Kim, KyungWhan;Ha, ManBok;Kim, SeongMun
    • International Journal of Highway Engineering
    • /
    • v.16 no.2
    • /
    • pp.119-129
    • /
    • 2014
  • PURPOSES : The purpose of this study is to develop the U-turn accident model at 4-legged signalized intersections in urban areas. METHODS : In order to analyze the characteristics of the accidents which are associated with U-turn operation at 4-legged signalized intersections in urban areas and develop an U-turn accident model by regression analysis, the tests of overdispersion and zero-inflation are conducted about the dependent variables of number of accidents and EPDO (Equivalent Property Damage Only). RESULTS : As their results, the Poisson model fits best for number of accident and the ZIP (Zero Inflated Poisson) fits best for EPOD, the variables of conflict traffic, width of opposing road, traffic passing speed are adopted as independent variable for both models. The variables of number of bus berths and rate of U-turn signal time at which the U-turn is permitted are adopted as independent variable only for EPDO. CONCLUSIONS : These study results suggest that U-turn would be permitted at the intersection where the width of opposing road is wider than 11.9 meters, the passing vehicle speed is not high and U-turn operation is not hindered by the buses stopping at bus stops.

Prediction of the Number of Food Poisoning Occurrences by Microbes (원인균별 식중독 발생 건수 예측)

  • Yeo, In-Kwon
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.6
    • /
    • pp.923-932
    • /
    • 2013
  • This paper proposes a method to predict the number of foodborne disease outbreaks by microbes. The weekly data of food poisoning occurrences by microbes in Korea contain many zero-valued observations and have dependency between outbreaks. In order to model both phenomena, the number of food poisonings is predicted by an autoregressive model and the probabilities of food poisoning occurrences by microbes (given the total of food poisonings) are estimated by the baseline category logit model. The predicted number of foodborne disease outbreaks by a microbe is obtained by multiplying the predicted number of foodborne disease outbreaks and the estimated probability of the food poisoning by the corresponding microbe. The mean squared error and the mean absolute value error are evaluated to compare the performances of the proposed method and the zero-inflated model.