• Title/Summary/Keyword: robust regression model

Search Result 179, Processing Time 0.021 seconds

Model-Robust G-Efficient Cuboidal Experimental Designs (입방형 영역에서의 G-효율이 높은 Model-Robust 실험설계)

  • Park, You-Jin;Yi, Yoon-Ju
    • IE interfaces
    • /
    • v.23 no.2
    • /
    • pp.118-125
    • /
    • 2010
  • The determination of a regression model is important in using statistical designs of experiments. Generally, the exact regression model is not known, and experimenters suppose that a certain model form will be fit. Then an experimental design suitable for that predetermined model form is selected and the experiment is conducted. However, the initially chosen regression model may not be correct, and this can result in undesirable statistical properties. We develop model-robust experimental designs that have stable prediction variance for a family of candidate regression models over a cuboidal region by using genetic algorithms and the desirability function method. We then compare the stability of prediction variance of model-robust experimental designs with those of the 3-level face centered cube. These model-robust experimental designs have moderately high G-efficiencies for all candidate models that the experimenter may potentially wish to fit, and outperform the cuboidal design for the second-order model. The G-efficiencies are provided for the model-robust experimental designs and the face centered cube.

Usage of auxiliary variable and neural network in doubly robust estimation

  • Park, Hyeonah;Park, Wonjun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.3
    • /
    • pp.659-667
    • /
    • 2013
  • If the regression model or the propensity model is correct, the unbiasedness of the estimator using doubly robust imputation can be guaranteed. Using a neural network instead of a logistic regression model for the propensity model, the estimators using doubly robust imputation are approximately unbiased even though both assumed models fail. We also propose a doubly robust estimator of ratio form using population information of an auxiliary variable. We prove some properties of proposed theory by restricted simulations.

ROBUST FUZZY LINEAR REGRESSION BASED ON M-ESTIMATORS

  • SOHN BANG-YONG
    • Journal of applied mathematics & informatics
    • /
    • v.18 no.1_2
    • /
    • pp.591-601
    • /
    • 2005
  • The results of fuzzy linear regression are very sensitive to irregular data. When this points exist in a set of data, a fuzzy linear regression model can be incorrectly interpreted. The purpose of this paper is to detect irregular data and to propose robust fuzzy linear regression based on M-estimators with triangular fuzzy regression coefficients for crisp input-output data. Numerical example shows that irregular data can be detected by using the residuals based on M-estimators, and the proposed robust fuzzy linear regression is very resistant to this points.

Statistical Matching Techniques Using the Robust Regression Model (로버스트 회귀모형을 이용한 자료결합방법)

  • Jhun, Myoung-Shic;Jung, Ji-Song;Park, Hye-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.6
    • /
    • pp.981-996
    • /
    • 2008
  • Statistical matching techniques whose aim is to achieve a complete data file from different sources. Since the statistical matching method proposed by Rubin (1986) assumes the multivariate normality for data, using this method to data which violates the assumption would involve some problems. This research proposed the statistical matching method using robust regression as an alternative to the linear regression. Furthermore, we carried out a simulation study to compare the performance of the robust regression model and the linear regression model for the statistical matching.

Robust inference for linear regression model based on weighted least squares

  • Park, Jin-Pyo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.13 no.2
    • /
    • pp.271-284
    • /
    • 2002
  • In this paper we consider the robust inference for the parameter of linear regression model based on weighted least squares. First we consider the sequential test of multiple outliers. Next we suggest the way to assign a weight to each observation $(x_i,\;y_i)$ and recommend the robust inference for linear model. Finally, to check the performance of confidence interval for the slope using proposed method, we conducted a Monte Carlo simulation and presented some numerical results and examples.

  • PDF

Robust extreme quantile estimation for Pareto-type tails through an exponential regression model

  • Richard Minkah;Tertius de Wet;Abhik Ghosh;Haitham M. Yousof
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.6
    • /
    • pp.531-550
    • /
    • 2023
  • The estimation of extreme quantiles is one of the main objectives of statistics of extremes (which deals with the estimation of rare events). In this paper, a robust estimator of extreme quantile of a heavy-tailed distribution is considered. The estimator is obtained through the minimum density power divergence criterion on an exponential regression model. The proposed estimator was compared with two estimators of extreme quantiles in the literature in a simulation study. The results show that the proposed estimator is stable to the choice of the number of top order statistics and show lesser bias and mean square error compared to the existing extreme quantile estimators. Practical application of the proposed estimator is illustrated with data from the pedochemical and insurance industries.

Fast robust variable selection using VIF regression in large datasets (대형 데이터에서 VIF회귀를 이용한 신속 강건 변수선택법)

  • Seo, Han Son
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.4
    • /
    • pp.463-473
    • /
    • 2018
  • Variable selection algorithms for linear regression models of large data are considered. Many algorithms are proposed focusing on the speed and the robustness of algorithms. Among them variance inflation factor (VIF) regression is fast and accurate due to the use of a streamwise regression approach. But a VIF regression is susceptible to outliers because it estimates a model by a least-square method. A robust criterion using a weighted estimator has been proposed for the robustness of algorithm; in addition, a robust VIF regression has also been proposed for the same purpose. In this article a fast and robust variable selection method is suggested via a VIF regression with detecting and removing potential outliers. A simulation study and an analysis of a dataset are conducted to compare the suggested method with other methods.

Parameter Calibration of Storage Function Model and Flood Forecasting (2) Comparative Study on the Flood Forecasting Methods (저류함수모형의 매개변수 보정과 홍수예측 (2) 홍수예측방법의 비교 연구)

  • Kim, Bum Jun;Song, Jae Hyun;Kim, Hung Soo;Hong, Il Pyo
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.1B
    • /
    • pp.39-50
    • /
    • 2006
  • The flood control offices of main rivers have used a storage function model to forecast flood stage in Korea and studies of flood forecasting actively have been done even now. On this account, the storage function model, which is used in flood control office, regression models and artificial neural network model are applied into flood forecasting of study watershed in this paper. The result obtained by each method are analyzed for the comparative study. In case of storage function model, this paper uses the representative parameters of the flood control offices and the optimized parameters. Regression coefficients are obtained by regression analysis and neural network is trained by backpropagation algorithm after selecting four events between 1995 to 2001. As a result of this study, it is shown that the optimized parameters are superior to the representative parameters for flood forecasting. The results obtained by multiple, robust, stepwise regression analysis, one of the regression methods, show very good forecasts. Although the artificial neural network model shows less exact results than the regression model, it can be efficient way to produce a good forecasts.

A study on robust regression estimators in heteroscedastic error models

  • Son, Nayeong;Kim, Mijeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1191-1204
    • /
    • 2017
  • Weighted least squares (WLS) estimation is often easily used for the data with heteroscedastic errors because it is intuitive and computationally inexpensive. However, WLS estimator is less robust to a few outliers and sometimes it may be inefficient. In order to overcome robustness problems, Box-Cox transformation, Huber's M estimation, bisquare estimation, and Yohai's MM estimation have been proposed. Also, more efficient estimations than WLS have been suggested such as Bayesian methods (Cepeda and Achcar, 2009) and semiparametric methods (Kim and Ma, 2012) in heteroscedastic error models. Recently, Çelik (2015) proposed the weight methods applicable to the heteroscedasticity patterns including butterfly-distributed residuals and megaphone-shaped residuals. In this paper, we review heteroscedastic regression estimators related to robust or efficient estimation and describe their properties. Also, we analyze cost data of U.S. Electricity Producers in 1955 using the methods discussed in the paper.

Fuzzy Theil regression Model (Theil방법을 이용한 퍼지회귀모형)

  • Yoon, Jin Hee;Lee, Woo-Joo;Choi, Seung-Hoe
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.23 no.4
    • /
    • pp.366-370
    • /
    • 2013
  • Regression Analysis is an analyzing method of regression model to explain the statistical relationship between explanatory variable and response variables. This paper introduce Theil's method to find a fuzzy regression model which explain the relationship between explanatory variable and response variables. Theil's method is a robust method which is not sensive to outliers. Theil's method use medians of rate of increment based on randomly chosen pairs of each components of ${\alpha}$-level sets of fuzzy data in order to estimate the coefficients of fuzzy regression model. We propose an example to show Theil's estimator is robust than the Least squares estimator.