• Title/Summary/Keyword: Truncated Regression Model

Search Result 19, Processing Time 0.024 seconds

Simplicial Regression Depth with Censored and Truncated Data

  • Park, Jinho
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.1
    • /
    • pp.167-175
    • /
    • 2003
  • In this paper we develop a robust procedure to estimate regression coefficients for a linear model with censored and truncated data based on simplicial regression depth. Simplicial depth of a point is defined as the proportion of data simplices containing it. This simplicial depth can be extended to regression problem with censored and truncated data. Any line can be given a depth and the deepest regression line is the line with the maximum simplicial regression depth. We show how the proposed regression performs through analyzing AIDS incubation data.

Bayesian Analysis for a Functional Regression Model with Truncated Errors in Variables

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.31 no.1
    • /
    • pp.77-91
    • /
    • 2002
  • This paper considers a functional regression model with truncated errors in explanatory variables. We show that the ordinary least squares (OLS) estimators produce bias in regression parameter estimates under misspecified models with ignored errors in the explanatory variable measurements, and then propose methods for analyzing the functional model. Fully parametric frequentist approaches for analyzing the model are intractable and thus Bayesian methods are pursued using a Markov chain Monte Carlo (MCMC) sampling based approach. Necessary theories involved in modeling and computation are provided. Finally, a simulation study is given to illustrate and examine the proposed methods.

Developing the Pedestrian Accident Models Using Tobit Model (토빗모형을 이용한 가로구간 보행자 사고모형 개발)

  • Lee, Seung Ju;Kim, Yun Hwan;Park, Byung Ho
    • International Journal of Highway Engineering
    • /
    • v.16 no.3
    • /
    • pp.101-107
    • /
    • 2014
  • PURPOSES : This study deals with the pedestrian accidents in case of Cheongju. The goals are to develop the pedestrian accident model. METHODS : To analyze the accident, count data models, truncated count data models and Tobit regression models are utilized in this study. The dependent variable is the number of accident. Independent variables are traffic volume, intersection geometric structure and the transportation facility. RESULTS : The main results are as follows. First, Tobit model was judged to be more appropriate model than other models. Also, these models were analyzed to be statistically significant. Second, such the main variables related to accidents as traffic volume, pedestrian volume, number of Entry/exit, number of crosswalk and bus stop were adopted in the above model. CONCLUSIONS : The optimal model for pedestrian accidents is evaluated to be Tobit model.

Efficient Score Estimation and Adaptive Rank and M-estimators from Left-Truncated and Right-Censored Data

  • Chul-Ki Kim
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.3
    • /
    • pp.113-123
    • /
    • 1996
  • Data-dependent (adaptive) choice of asymptotically efficient score functions for rank estimators and M-estimators of regression parameters in a linear regression model with left-truncated and right-censored data are developed herein. The locally adaptive smoothing techniques of Muller and Wang (1990) and Uzunogullari and Wang (1992) provide good estimates of the hazard function h and its derivative h' from left-truncated and right-censored data. However, since we need to estimate h'/h for the asymptotically optimal choice of score functions, the naive estimator, which is just a ratio of estimated h' and h, turns out to have a few drawbacks. An altermative method to overcome these shortcomings and also to speed up the algorithms is developed. In particular, we use a subroutine of the PPR (Projection Pursuit Regression) method coded by Friedman and Stuetzle (1981) to find the nonparametric derivative of log(h) for the problem of estimating h'/h.

  • PDF

Estimating the Economic Value of the Songieong Beach Using A Count Data Model: - Off-season Estimating Value of the Beach - (가산자료모형을 이용한 송정 해수욕장의 경제적 가치추정: - 비수기 해수욕장의 가치추정 -)

  • Heo, Yun-Jeong;Lee, Seung-Lae
    • The Journal of Fisheries Business Administration
    • /
    • v.38 no.2
    • /
    • pp.79-101
    • /
    • 2007
  • The purpose of this study is to estimate the economic value of the Songieong Beach in Off-season, using a Individual Travel Cost Model(ITCM). Songieong Beach is located in Busan but far away from city. These days, however, the increased rate of traffic inflow to the Songieong beach and the five-day working week are reflected in the trend analysis. Moreover, people have changed psychological value. For that reason, visitors are on the increase on the beach in off-season. The ITCM is applied to estimate non-market value or environmental Good like a Contingent Valuation Method and Hedonic Price Model etc. The ITCM was derived from the Count Data Model(i.e. Poisson and Negative Binomial model). So this paper compares Poisson and negative binomial count data models to measure the tourism demands. The data for the study were collected from the Songjeong Beach on visitors over the a week from November 1 through November 23, 2006. Interviewers were instructed to interview only individuals. So the sample was taken in 113. A dependent variable that is defined on the non-negative integers and subject to sampling truncation is the result of a truncated count data process. This paper analyzes the effects of determinants on visitors' demand for exhibition using a class of maximum-likelihood regression estimators for count data from truncated samples, The count data and truncated models are used primarily to explain non-negative integer and truncation properties of tourist trips as suggested by the economic valuation literature. The results suggest that the truncated negative binomial model is improved overdispersion problem and more preferred than the other models in the study. This paper is not the same as the others. One thing is that Estimating Value of the Beach in off-season. The other thing is this study emphasizes in particular 'travel cost' that is not only monetary cost but also including opportunity cost of 'travel time'. According to the truncated negative binomial model, estimates the Consumer Surplus(CS) values per trip of about 199,754 Korean won and the total economic value was estimated to be 1,288,680 Korean won.

  • PDF

An Efficiency Analysis of Public Enterprises Using Bootstrap DEA (부트스트랩 DEA를 이용한 공기업 효율성 분석)

  • Park, Man Hee
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.5
    • /
    • pp.475-487
    • /
    • 2015
  • This study measures the managerial efficiency of Korea's 14 public enterprises using bootstrap DEA in 2013. In addition, it examines the factors that affect on the bootstrap bias-corrected efficiency using truncated regression analysis. The results and implications of this study are as follows. First, using bootstrap DEA model analysis, the results showed that the mean technical efficiency was 0.3182, the mean pure technical efficiency was 0.4994 and the mean scale efficiency was 0.6585. The main cause of technical inefficiency was due to pure technical inefficiency. Second, rank test between technical efficiency of general DEA model and bootstrap DEA model was no significant difference under CRS and VRS assumption. Third, the main cause of the inefficiency in 11 DMUs among 14 DMUs were mainly due to the pure technology and three DMUs were because of the scale efficiency. Finally, in the truncated regression analysis, cost of labor, profit, sales, return of equity, and the number of employees appeared as factors affecting the scale efficiency at the 10% significance level.

The skew-t censored regression model: parameter estimation via an EM-type algorithm

  • Lachos, Victor H.;Bazan, Jorge L.;Castro, Luis M.;Park, Jiwon
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.3
    • /
    • pp.333-351
    • /
    • 2022
  • The skew-t distribution is an attractive family of asymmetrical heavy-tailed densities that includes the normal, skew-normal and Student's-t distributions as special cases. In this work, we propose an EM-type algorithm for computing the maximum likelihood estimates for skew-t linear regression models with censored response. In contrast with previous proposals, this algorithm uses analytical expressions at the E-step, as opposed to Monte Carlo simulations. These expressions rely on formulas for the mean and variance of a truncated skew-t distribution, and can be computed using the R library MomTrunc. The standard errors, the prediction of unobserved values of the response and the log-likelihood function are obtained as a by-product. The proposed methodology is illustrated through the analyses of simulated and a real data application on Letter-Name Fluency test in Peruvian students.

A Bayesian Method for Narrowing the Scope fo Variable Selection in Binary Response t-Link Regression

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.4
    • /
    • pp.407-422
    • /
    • 2000
  • This article is concerned with the selecting predictor variables to be included in building a class of binary response t-link regression models where both probit and logistic regression models can e approximately taken as members of the class. It is based on a modification of the stochastic search variable selection method(SSVS), intended to propose and develop a Bayesian procedure that used probabilistic considerations for selecting promising subsets of predictor variables. The procedure reformulates the binary response t-link regression setup in a hierarchical truncated normal mixture model by introducing a set of hyperparameters that will be used to identify subset choices. In this setup, the most promising subset of predictors can be identified as that with highest posterior probability in the marginal posterior distribution of the hyperparameters. To highlight the merit of the procedure, an illustrative numerical example is given.

  • PDF

A Study on the Efficiency and Its Determinants in Korea's Service Sectors Using DEA (자료포락분석(DEA)를 이용한 우리나라 서비스산업의 효율성과 결정요인 분석)

  • Bae, Se-Young
    • Journal of Digital Convergence
    • /
    • v.19 no.10
    • /
    • pp.339-348
    • /
    • 2021
  • This paper aims to analyze the production efficiency in Korea's ten service sectors using DEA and its determinants utilizing a truncated-Tobit regression model and a censored-Tobit regression model in 2010-2019. This paper found: First, the Korean service sector's production efficiency in general has been significantly low and polarized. Especially, the inefficiency resulted from the scale inefficiency in the 'sewerage waste management industry.' Second, in the determinants analysis, the results show the positive effect of the investment and R&D expenses on technical efficiency, while FDI and lobbying expenses illustrate the negative impact. Moreover, it seems that the larger the industry, the higher the efficiency. Thus, the future Korean government's economic policy for the service sectors requires a mixed and integrated policy of the macroeconomic aspect such as active investment and R&D activities with microeconomic aspect including a convergence of FDI and human capital.

Restricted maximum likelihood estimation of a censored random effects panel regression model

  • Lee, Minah;Lee, Seung-Chun
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.4
    • /
    • pp.371-383
    • /
    • 2019
  • Panel data sets have been developed in various areas, and many recent studies have analyzed panel, or longitudinal data sets. Maximum likelihood (ML) may be the most common statistical method for analyzing panel data models; however, the inference based on the ML estimate will have an inflated Type I error because the ML method tends to give a downwardly biased estimate of variance components when the sample size is small. The under estimation could be severe when data is incomplete. This paper proposes the restricted maximum likelihood (REML) method for a random effects panel data model with a censored dependent variable. Note that the likelihood function of the model is complex in that it includes a multidimensional integral. Many authors proposed to use integral approximation methods for the computation of likelihood function; however, it is well known that integral approximation methods are inadequate for high dimensional integrals in practice. This paper introduces to use the moments of truncated multivariate normal random vector for the calculation of multidimensional integral. In addition, a proper asymptotic standard error of REML estimate is given.