Search | Korea Science

Influence Analysis of Constrained Regression Models

Kim, Myung-Geun
- Communications for Statistical Applications and Methods
- /
- v.14 no.2
- /
- pp.281-286
- /
- 2007
Cook's distance is generalized to the multiple linear regression with linear constraints on regression coefficients. It is used for identifying influential observations in constrained regression models. A numerical example is provided for illustration.
https://doi.org/10.5351/CKSS.2007.14.2.281 인용 PDF KSCI

The disparity profile of working conditions by the type of employment according to the economic sectors and occupations (임금근로자의 고용형태별 유해요인 노출 격차의 업종별 직종별 분포 특성)

Rhee, Kyung-Yong;Kim, Ki-Sik;Yoon, Young-Shik
- Journal of the Korea Safety Management & Science
- /
- v.15 no.4
- /
- pp.197-207
- /
- 2013
OSHA(Occupational Safety and Health Act) generally regulates employer's business principles in the workplace to maintain safety environment. This act has the fundamental purpose to protect employee's safety and health in the workplace by reducing industrial accidents. Authors tried to investigate the correlation between 'occupational injuries and illnesses' and level of regulation compliance using Survey on Current Status of Occupational Safety & Health data by the various statistical methods, such as generalized regression analysis, logistic regression analysis and poison regression analysis in order to compare the results of those methods. The results have shown that the significant affecting compliance factors were different among those statistical methods. This means that specific interpretation should be considered based on each statistical method. In the future, relevant statistical technique will be developed considering the distribution type of occupational injuries.
https://doi.org/10.12812/ksms.2013.15.4.197 인용 PDF KSCI

Multivariate Statistical Analysis and Prediction for the Flash Points of Binary Systems Using Physical Properties of Pure Substances (순수 성분의 물성 자료를 이용한 2성분계 혼합물의 인화점에 대한 다변량 통계 분석 및 예측)

Lee, Bom-Sock;Kim, Sung-Young
- Journal of the Korean Institute of Gas
- /
- v.11 no.3
- /
- pp.13-18
- /
- 2007
The multivariate statistical analysis, using the multiple linear regression(MLR), have been applied to analyze and predict the flash points of binary systems. Prediction for the flash points of flammable substances is important for the examination of the fire and explosion hazards in the chemical process design. In this paper, the flash points are predicted by MLR based on the physical properties of pure substances and the experimental flash points data. The results of regression and prediction by MLR are compared with the values calculated by Raoult's law and Van Laar equation.
PDF

A Note on a Fuzzy Linear Regression Model for Fuzzy Input-output Date Using Real Coefficients

Hong, Dug-Hun
- Communications for Statistical Applications and Methods
- /
- v.8 no.2
- /
- pp.319-325
- /
- 2001
In this note, we propose a simple fuzzy linear regression model for fuzzy input-output data based on Tanaka's approach. Then an LP-based method to derived the satisfying solution of the decision making is developed.
PDF

A New Deletion Criterion of Principal Components Regression with Orientations of the Parameters

Lee, Won-Woo
- Journal of the Korean Statistical Society
- /
- v.16 no.2
- /
- pp.55-70
- /
- 1987
The principal components regression is one of the substitues for least squares method when there exists multicollinearity in the multiple linear regression model. It is observed graphically that the performance of the principal components regression is strongly dependent upon the values of the parameters. Accordingly, a new deletion criterion which determines proper principal components to be deleted from the analysis is developed and its usefulness is checked by simulations.
PDF

Training for Huge Data set with On Line Pruning Regression by LS-SVM

Kim, Dae-Hak;Shim, Joo-Yong;Oh, Kwang-Sik
- Proceedings of the Korean Statistical Society Conference
- /
- 2003.10a
- /
- pp.137-141
- /
- 2003
LS-SVM(least squares support vector machine) is a widely applicable and useful machine learning technique for classification and regression analysis. LS-SVM can be a good substitute for statistical method but computational difficulties are still remained to operate the inversion of matrix of huge data set. In modern information society, we can easily get huge data sets by on line or batch mode. For these kind of huge data sets, we suggest an on line pruning regression method by LS-SVM. With relatively small number of pruned support vectors, we can have almost same performance as regression with full data set.
PDF

Efficiency of Aggregate Data in Non-linear Regression

Huh, Jib
- Communications for Statistical Applications and Methods
- /
- v.8 no.2
- /
- pp.327-336
- /
- 2001
This work concerns estimating a regression function, which is not linear, using aggregate data. In much of the empirical research, data are aggregated for various reasons before statistical analysis. In a traditional parametric approach, a linear estimation of the non-linear function with aggregate data can result in unstable estimators of the parameters. More serious consequence is the bias in the estimation of the non-linear function. The approach we employ is the kernel regression smoothing. We describe the conditions when the aggregate data can be used to estimate the regression function efficiently. Numerical examples will illustrate our findings.
PDF

Support Vector Machine for Interval Regression

Hong Dug Hun;Hwang Changha
- Proceedings of the Korean Statistical Society Conference
- /
- 2004.11a
- /
- pp.67-72
- /
- 2004
Support vector machine (SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate interval linear and nonlinear regression models combining the possibility and necessity estimation formulation with the principle of SVM. For data sets with crisp inputs and interval outputs, the possibility and necessity models have been recently utilized, which are based on quadratic programming approach giving more diverse spread coefficients than a linear programming one. SVM also uses quadratic programming approach whose another advantage in interval regression analysis is to be able to integrate both the property of central tendency in least squares and the possibilistic property In fuzzy regression. However this is not a computationally expensive way. SVM allows us to perform interval nonlinear regression analysis by constructing an interval linear regression function in a high dimensional feature space. In particular, SVM is a very attractive approach to model nonlinear interval data. The proposed algorithm here is model-free method in the sense that we do not have to assume the underlying model function for interval nonlinear regression model with crisp inputs and interval output. Experimental results are then presented which indicate the performance of this algorithm.
PDF

On Sensitivity Analysis in Principal Component Regression

Kim, Soon-Kwi;Park, Sung H.
- Journal of the Korean Statistical Society
- /
- v.20 no.2
- /
- pp.177-190
- /
- 1991
In this paper, we discuss and review various measures which have been presented for studying outliers. high-leverage points, and influential observations when principal component regression is adopted. We suggest several diagnostics measures when principal component regression is used. A numerical example is illustrated. Some individual data points may be flagged as outliers, high-leverage point, or influential points.
PDF

Application of Statistical Models for Default Probability of Loans in Mortgage Companies

Jung, Jin-Whan
- Communications for Statistical Applications and Methods
- /
- v.7 no.2
- /
- pp.605-616
- /
- 2000
Three primary interests frequently raised by mortgage companies are introduced and the corresponding statistical approaches for the default probability in mortgage companies are examined. Statistical models considered in this paper are time series, logistic regression, decision tree, neural network, and discrete time models. Usage of the models is illustrated using an artificially modified data set and the corresponding models are evaluated in appropriate manners.
PDF

Search Result 3,423, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)