• Title/Summary/Keyword: poisson regression models

Search Result 96, Processing Time 0.025 seconds

Accident Models of Circular Intersections in Korea (국내 원형교차로 사고모형)

  • Lee, Seung Ju;Park, Min Kyu;Park, Byung Ho
    • Journal of the Korean Society of Safety
    • /
    • v.29 no.1
    • /
    • pp.54-58
    • /
    • 2014
  • This study deals with the accidents of circular intersections in Korea. The goal is to develop the accident models for 94 circular intersections. In pursuing the above, this study gives particular attentions to collecting the data of geometric structure and accidents, and comparatively analyzing such the models as Poisson and NB regression and multiple regression model using SPSS 17.0 and LIMDEP 3.0. The main results are as follows. First, the negative binomial model among various models was analyzed to be the most appropriate. Second, 3 independent variables was adopted in the model, and these variables was analyzed to have a positive relation to the accident rate. Finally, the reduced width of circulatory roadway, removal of the parking lot within circulatory roadway and appropriate levels of approach lane were required to improve the safety of circular intersection.

Development of Accident Model by Traffic Violation Type in Korea 4-legged Circular Intersections (국내 4지 원형교차로 법규위반별 사고모형 개발)

  • Park, Byung Ho;Kim, Kyeong Yong
    • Journal of the Korean Society of Safety
    • /
    • v.30 no.2
    • /
    • pp.70-76
    • /
    • 2015
  • This study deals with the traffic accident of circular intersections. The purpose of the study is to develop the accident models by traffic violation type. In pursuing the above, this study gives particular attention to analyzing various factors that influence traffic accident and developing such the optimal models as Poisson and Negative binomial regression models. The main results are the followings. First, 4 negative binomial models which were statistically significant were developed. This was because the over-dispersion coefficients had a value greater than 1.96. Second, the common variables in these models were not adopted. The specific variables by model were analyzed to be traffic volume, conflicting ratio, number of circulatory lane, width of circulatory lane, number of traffic island by access road, number of reduction facility, feature of central island and crosswalk.

Traffic Accident Models of Domestic Rotary by Day and Nighttime (국내 로터리의 주.야간 교통사고모형)

  • Park, Byung-Ho;Lim, Jin-Kang;Back, Tae-Hun
    • Journal of the Korean Society of Safety
    • /
    • v.27 no.2
    • /
    • pp.105-110
    • /
    • 2012
  • This study deals with the accident models of rotary. The objectives is to develop the models by day and nighttime. In pursuing the above, this study gives particular attentions to collecting the data of geometric structure and accidents of 20 rotaries and developing the Poisson and negative binomial regression models using NLOGIT 4.0. The main results are as follows. First, the numbers of accident of nighttime (1.03 per 1,000 entering vehicles) were analyzed to be very higher than those of day (0.47 per 1,000 entering vehicles). Second, 4 Poisson models which were all statistically significant were developed, in which the dependent variable were both the number of accident and EPDO (equivalent property damage only). Finally, the number of entry/exit ($X_1$) and the number of entering lane ($X_5$) in the models of the number of accident, and $X_1$ in the EPDO models were adopted as the common variables. The variables were analyzed to be all positive to the dependent variables.

Variable selection in Poisson HGLMs using h-likelihoood

  • Ha, Il Do;Cho, Geon-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.6
    • /
    • pp.1513-1521
    • /
    • 2015
  • Selecting relevant variables for a statistical model is very important in regression analysis. Recently, variable selection methods using a penalized likelihood have been widely studied in various regression models. The main advantage of these methods is that they select important variables and estimate the regression coefficients of the covariates, simultaneously. In this paper, we propose a simple procedure based on a penalized h-likelihood (HL) for variable selection in Poisson hierarchical generalized linear models (HGLMs) for correlated count data. For this we consider three penalty functions (LASSO, SCAD and HL), and derive the corresponding variable-selection procedures. The proposed method is illustrated using a practical example.

Generalized nonlinear percentile regression using asymmetric maximum likelihood estimation

  • Lee, Juhee;Kim, Young Min
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.6
    • /
    • pp.627-641
    • /
    • 2021
  • An asymmetric least squares estimation method has been employed to estimate linear models for percentile regression. An asymmetric maximum likelihood estimation (AMLE) has been developed for the estimation of Poisson percentile linear models. In this study, we propose generalized nonlinear percentile regression using the AMLE, and the use of the parametric bootstrap method to obtain confidence intervals for the estimates of parameters of interest and smoothing functions of estimates. We consider three conditional distributions of response variables given covariates such as normal, exponential, and Poisson for three mean functions with one linear and two nonlinear models in the simulation studies. The proposed method provides reasonable estimates and confidence interval estimates of parameters, and comparable Monte Carlo asymptotic performance along with the sample size and quantiles. We illustrate applications of the proposed method using real-life data from chemical and radiation epidemiological studies.

Modeling clustered count data with discrete weibull regression model

  • Yoo, Hanna
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.4
    • /
    • pp.413-420
    • /
    • 2022
  • In this study we adapt discrete weibull regression model for clustered count data. Discrete weibull regression model has an attractive feature that it can handle both under and over dispersion data. We analyzed the eighth Korean National Health and Nutrition Examination Survey (KNHANES VIII) from 2019 to assess the factors influencing the 1 month outpatient stay in 17 different regions. We compared the results using clustered discrete Weibull regression model with those of Poisson, negative binomial, generalized Poisson and Conway-maxwell Poisson regression models, which are widely used in count data analyses. The results show that the clustered discrete Weibull regression model using random intercept model gives the best fit. Simulation study is also held to investigate the performance of the clustered discrete weibull model under various dispersion setting and zero inflated probabilities. In this paper it is shown that using a random effect with discrete Weibull regression can flexibly model count data with various dispersion without the risk of making wrong assumptions about the data dispersion.

Traffic Accident Models of Arterial Road Sections by Number of Lane in the Case of Cheongju (차로수별 간선도로구간 사고모형 - 청주시를 사례로 -)

  • Lim, Jin-Kang;Na, Hee;Park, Byung-Ho
    • Journal of the Korean Society of Safety
    • /
    • v.26 no.5
    • /
    • pp.130-135
    • /
    • 2011
  • This study deals with the accident models of arterial road sections. The objectives is to develop the models by number of lane. In pursuing the above, this study gives particular emphasis to dividing the 474 small link sections, collecting the accident data of 2007, and applying the statistical programs of SPSS17.0 and NLOGIT4.0. The main results are as follows. First, the number of accidents of two-lane roads were analyzed to be 59.9% of totals and to be the most of all. Second, one Poisson and two negative binomial regression models which were all statistically significant were developed. Finally, the common variables of all models were evaluated to be ADT and number of exit/entry which were all positive to the accidents.

The Analysis of the Number of Donations Based on a Mixture of Poisson Regression Model (포아송 분포의 혼합모형을 이용한 기부 횟수 자료 분석)

  • Kim In-Young;Park Su-Bum;Kim Byung-Soo;Park Tae-Kyu
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.1-12
    • /
    • 2006
  • The aim of this study is to analyse a survey data on the number of charitable donations using a mixture of two Poisson regression models. The survey was conducted in 2002 by Volunteer 21, an nonprofit organization, based on Koreans, who were older than 20. The mixture of two Poisson distributions is used to model the number of donations based on the empirical distribution of the data. The mixture of two Poisson distributions implies the whole population is subdivided into two groups, one with lesser number of donations and the other with larger number of donations. We fit the mixture of Poisson regression models on the number of donations to identify significant covariates. The expectation-maximization algorithm is employed to estimate the parameters. We computed 95% bootstrap confidence interval based on bias-corrected and accelerated method and used then for selecting significant explanatory variables. As a result, the income variable with four categories and the volunteering variable (1: experience of volunteering, 0: otherwise) turned out to be significant with the positive regression coefficients both in the lesser and the larger donation groups. However, the regression coefficients in the lesser donation group were larger than those in larger donation group.

Forecasting hierarchical time series for foodborne disease outbreaks (식중독 발생 건수에 대한 계층 시계열 예측)

  • In-Kwon Yeo
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.4
    • /
    • pp.499 -508
    • /
    • 2024
  • In this paper, we investigate hierarchical time series forecasting that adhere to a hierarchical structure when deriving predicted values by analyzing segmented data as well as aggregated datasets. The occurrences of food poisoning by a specific pathogen are analyzed using zero-inflated Poisson regression models and negative binomial regression models. The occurrences of major, miscellaneous, and overall food poisoning are analyzed using Poisson regression models and negative binomial regression models. For hierarchical time series forecasting, the MinT estimation proposed by Wickramasuriya et al. (2019) is employed. Negative predicted values resulting from hierarchical adjustments are adjusted to zero, and weights are multiplied to the remaining lowest-level variables to satisfy the hierarchical structure. Empirical analysis revealed that there is little difference between hierarchical and non-hierarchical adjustments in predictions based on pathogens. However, hierarchical adjustments generally yield superior results for predictions concerning major, miscellaneous, and overall occurrences. Without hierarchical adjustment, instances may occur where the predicted frequencies of the lowest-level variables exceed that of major or miscellaneous occurrences. However, the proposed method enables the acquisition of predictions that adhere to the hierarchical structure.

Developing the Traffic Accident Severity Models by Vehicle Type (차량유형에 따른 교통사고심각도 분석모형 개발)

  • Kim, Kyung-Hwan;Park, Byung-Ho
    • Journal of the Korean Society of Safety
    • /
    • v.25 no.3
    • /
    • pp.131-136
    • /
    • 2010
  • This study deals with the accident models of arterial link sections by vehicle type. The objectives are to analyze the characteristics of accidents, and to develop the models by type. In pursuing the above, this study uses the data of 414 accidents occurred on 24 major arterial links in 2007. The main results analyzed are as follows. First, the number of accidents is analyzed to account for about 47% in passenger car, 15% in SUV and 10% in trucks. Second, 3 Poisson regression models which are all statistically significant are developed using passenger car, SUV and truck as dependant variables. Finally, AADT and the number of traffic islands as common variables, and the number of pedestrian crossings, lanes, connecting roads, intersections(4-Leg), rate of medians and the number of bus stops as specific variables of the models are selected.