Search | Korea Science

A Score test for Detection of Outliers in Nonlinear Regression

Kahng, Myung-Wook
- Journal of the Korean Statistical Society
- /
- v.22 no.2
- /
- pp.201-208
- /
- 1993
Given the specific mean shift outlier model, the score test for multiple outliers in nonlinear regression is discussed as an alternative to the likelihood ratio test. The geometric interpretation of the score statistic is also presented.
PDF

Robust multiple imputation method for missings with boundary and outliers (한계와 이상치가 있는 결측치의 로버스트 다중대체 방법)

Park, Yousung;Oh, Do Young;Kwon, Tae Yeon
- The Korean Journal of Applied Statistics
- /
- v.32 no.6
- /
- pp.889-898
- /
- 2019
The problem of missing value imputation for variables in surveys that include item missing becomes complicated if outliers and logical boundary conditions between other survey items cannot be ignored. If there are outliers and boundaries in a variable including missing values, imputed values based on previous regression-based imputation methods are likely to be biased and not meet boundary conditions. In this paper, we approach these difficulties in imputation by combining various robust regression models and multiple imputation methods. Through a simulation study on various scenarios of outliers and boundaries, we find and discuss the optimal combination of robust regression and multiple imputation method.
https://doi.org/10.5351/KJAS.2019.32.6.889 인용 PDF KSCI

Robust Generalized Labeled Multi-Bernoulli Filter and Smoother for Multiple Target Tracking using Variational Bayesian

Li, Peng;Wang, Wenhui;Qiu, Junda;You, Congzhe;Shu, Zhenqiu
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.16 no.3
- /
- pp.908-928
- /
- 2022
Multiple target tracking mainly focuses on tracking unknown number of targets in the complex environment of clutter and missed detection. The generalized labeled multi-Bernoulli (GLMB) filter has been shown to be an effective approach and attracted extensive attention. However, in the scenarios where the clutter rate is high or measurement-outliers often occur, the performance of the GLMB filter will significantly decline due to the Gaussian-based likelihood function is sensitive to clutter. To solve this problem, this paper presents a robust GLMB filter and smoother to improve the tracking performance in the scenarios with high clutter rate, low detection probability, and measurement-outliers. Firstly, a Student-T distribution variational Bayesian (TDVB) filtering technology is employed to update targets' states. Then, The likelihood weight in the tracking process is deduced again. Finally, a trajectory smoothing method is proposed to improve the integrative tracking performance. The proposed method are compared with recent multiple target tracking filters, and the simulation results show that the proposed method can effectively improve tracking accuracy in the scenarios with high clutter rate, low detection rate and measurement-outliers. Code is published on GitHub.
https://doi.org/10.3837/tiis.2022.03.009 인용 PDF KSCI HTML

Robust inference for linear regression model based on weighted least squares

Park, Jin-Pyo
- Journal of the Korean Data and Information Science Society
- /
- v.13 no.2
- /
- pp.271-284
- /
- 2002
In this paper we consider the robust inference for the parameter of linear regression model based on weighted least squares. First we consider the sequential test of multiple outliers. Next we suggest the way to assign a weight to each observation $(x_i,\;y_i)$ and recommend the robust inference for linear model. Finally, to check the performance of confidence interval for the slope using proposed method, we conducted a Monte Carlo simulation and presented some numerical results and examples.
PDF

Outlier Identification in Regression Analysis using Projection Pursuit

Kim, Hyojung;Park, Chongsun
- Communications for Statistical Applications and Methods
- /
- v.7 no.3
- /
- pp.633-641
- /
- 2000
In this paper, we propose a method to identify multiple outliers in regression analysis with only assumption of smoothness on the regression function. Our method uses single-linkage clustering algorithm and Projection Pursuit Regression (PPR). It was compared with existing methods using several simulated and real examples and turned out to be very useful in regression problem with the regression function which is far from linear.
PDF

Robust Estimation and Outlier Detection

Myung Geun Kim
- Communications for Statistical Applications and Methods
- /
- v.1 no.1
- /
- pp.33-40
- /
- 1994
The conditional expectation of a random variable in a multivariate normal random vector is a multiple linear regression on its predecessors. Using this fact, the least median of squares estimation method developed in a multiple linear regression is adapted to a multivariate data to identify influential observations. The resulting method clearly detect outliers and it avoids the masking effect.
PDF

V-mask Type Criterion for Identification of Outliers In Logistic Regression

Kim Bu-Yong
- Communications for Statistical Applications and Methods
- /
- v.12 no.3
- /
- pp.625-634
- /
- 2005
A procedure is proposed to identify multiple outliers in the logistic regression. It detects the leverage points by means of hierarchical clustering of the robust distances based on the minimum covariance determinant estimator, and then it employs a V-mask type criterion on the scatter plot of robust residuals against robust distances to classify the observations into vertical outliers, bad leverage points, good leverage points, and regular points. Effectiveness of the proposed procedure is evaluated on the basis of the classic and artificial data sets, and it is shown that the procedure deals very well with the masking and swamping effects.
https://doi.org/10.5351/CKSS.2005.12.3.625 인용 PDF KSCI

Identification of Regression Outliers Based on Clustering of LMS-residual Plots

Kim, Bu-Yong;Oh, Mi-Hyun
- Communications for Statistical Applications and Methods
- /
- v.11 no.3
- /
- pp.485-494
- /
- 2004
An algorithm is proposed to identify multiple outliers in linear regression. It is based on the clustering of residuals from the least median of squares estimation. A cut-height criterion for the hierarchical cluster tree is suggested, which yields the optimal clustering of the regression outliers. Comparisons of the effectiveness of the procedures are performed on the basis of the classic data and artificial data sets, and it is shown that the proposed algorithm is superior to the one that is based on the least squares estimation. In particular, the algorithm deals very well with the masking and swamping effects while the other does not.
https://doi.org/10.5351/CKSS.2004.11.3.485 인용 PDF KSCI

An Outlier Data Analysis using Support Vector Regression (Support Vector Regression을 이용한 이상치 데이터분석)

Jun, Sung-Hae
- Journal of the Korean Institute of Intelligent Systems
- /
- v.18 no.6
- /
- pp.876-880
- /
- 2008
Outliers are the observations which are very larger or smaller than most observations in the given data set. These are shown by some sources. The result of the analysis with outliers may be depended on them. In general, we do data analysis after removing outliers. But, in data mining applications such as fraud detection and intrusion detection, outliers are included in training data because they have crucial information. In regression models, simple and multiple regression models need to eliminate outliers from given training data by standadized and studentized residuals to construct good model. In this paper, we use support vector regression(SVR) based on statistical teaming theory to analyze data with outliers in regression. We verify the improved performance of our work by the experiment using synthetic data sets.
https://doi.org/10.5391/JKIIS.2008.18.6.876 인용 PDF KSCI

Unmasking Multiple Outliers in Multivariate Data

Yoo Jong-Young
- Communications for Statistical Applications and Methods
- /
- v.13 no.1
- /
- pp.29-38
- /
- 2006
We proposed a procedure for detecting of multiple outliers in multivariate data. Rousseeuw and van Zomeren (1990) have suggested the robust distance $RD_i$ by using the Resampling Algorithm. But $RD_i$ are based on the assumption that X is in the general position.(X is said to be in the general position when every subsample of size p+1 has rank p) From the practical points of view, this is clearly unrealistic. In this paper, we proposed a computing method for approximating MVE, which is not subject to these problems. The procedure is easy to compute, and works well even if subsample is singular or nearly singular matrix.
https://doi.org/10.5351/CKSS.2006.13.1.029 인용 PDF KSCI

Search Result 79, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)