• Title/Summary/Keyword: 회귀추정법

Search Result 387, Processing Time 0.027 seconds

回歸分析에 있어서의 多共線性과 名稱을 保全시키는 資料變換 技法

  • 兪浣
    • Journal of the Korean Statistical Society
    • /
    • v.8 no.2
    • /
    • pp.109-116
    • /
    • 1979
  • 두 개의 변수의 대체효과(substitution effect)를 연구하기 위하여 수요 또는 공급의 모형을 만들었을 경우 이에 관련된 변수들의 이름이 중요시 된다. 실제 관측 자료를 사용하였을 경우 흔히 일어나는 다공선성(multicollinearity) 문제를 다루기 위한 대안으로써 선형회귀선을 예로 들어 능형회귀기법(ridge regression technique)과 요인분석기법(factor analytic technique)을 소개하였으며 이에서 얻어지는 계수(coefficient)를 OLS 추정치로 설명하기 위하여 원래의 자료를 변환하였다. 실지 수요와 공급의 모형이 비선형일 경우 일반적으로 능형회귀나 요인분석을 쓰지 못한다는 점을 감안, 이러한 방법을 자료의 변환방법으로 설명함으로써 비선형모형에서도 다공선성문제를 위하여 능형회귀분석법이나 요인분석기법을 사용할 수 있도록 하였다.

  • PDF

무응답 보정에서 변수 선택을 이용한 보조정보의 결정에 관한 연구

  • 손창균;홍기학;이기성
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2001.11a
    • /
    • pp.63-68
    • /
    • 2001
  • 조사과정에서 필연적으로 발생하는 무응답을 보정하기 위해 보조정보를 사용한다. 이 때, 이용 가능한 보조정보의 차원이 크면, 계산과정에서 많은 시간이 소요되며 데이터를 다루기가 매우 어렵다. 또한 추정량의 분산이 보조정보의 차원에 의존하기 때문에 과소추정의 문제가 발생한다. 이러한 문제를 해결하기 위해 무응답 보정에서 적절한 보조정보의 선택 방법을 제안하였고, 이에 대한 효율성을 모의실험을 통해 살펴보았다.

  • PDF

Outlier Detection of Autoregressive Models Using Robust Regression Estimators (로버스트 추정법을 이용한 자기상관회귀모형에서의 특이치 검출)

  • Lee Dong-Hee;Park You-Sung;Kim Kee-Whan
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.2
    • /
    • pp.305-317
    • /
    • 2006
  • Outliers adversely affect model identification, parameter estimation, and forecast in time series data. In particular, when outliers consist of a patch of additive outliers, the current outlier detection procedures suffer from the masking and swamping effects which make them inefficient. In this paper, we propose new outlier detection procedure based on high breakdown estimators, called as the dual robust filtering. Empirical and simulation studies in the autoregressive model with orders p show that the proposed procedure is effective.

The Estimation of Software Development Effort Using Multiple Regression Method (다중회귀 분석을 이용한 소프트웨어 개발노력추정)

  • Jung Hye-Jung;Yang Hae-Sool;Shin Seok-Kyoo;Lee Sang-Un
    • The KIPS Transactions:PartD
    • /
    • v.11D no.7 s.96
    • /
    • pp.1483-1490
    • /
    • 2004
  • To accomplish a project successfuly, we have to estimate develpment effort accurately. But, development effort is different to software size and operation environment. Usually, we made use of function point for estimating development effort. In this paper. we make use of 789 project data. It is related to development projects in 1990`s. We investigate the variable affecting development effort. Also, we exedcute multiple liner regression analysis for looking linear relation about variables. We find the regression equation for multistage by dividing PDR that influ-enced development effort step by step.

Estimation of Wood Oven-Dry Density by Using a Portable Dielectric Moisture Meter (휴대용 유전율식 수분계를 이용한 목재의 전건밀도 추정)

  • Kang, Chun-Won;Lim, Ho-Mook;Kang, Ho-Yang
    • Journal of the Korean Wood Science and Technology
    • /
    • v.45 no.5
    • /
    • pp.629-639
    • /
    • 2017
  • Tripitaka Koreana in Haein Temple, Hapcheon Province is the most historical and largest heritage in this country, however the species of their blocks have yet been unknown. A nondestructive test method is necessary to investigate their species. The oven-dry density of wood was measured by inversely using the principle of high frequency moisture meter. The oven-dry densities of more than 100 domestic species of specimens estimated by measurement method and high frequency moisture meter were compared and following conclusions were obtained. There was highly close correlation between the oven-dry density estimated by measurement method and the oven-dry density estimated by high frequency moisture meter. The densities of Tripitaka Koreana that was a global cultural heritage could be correctly estimated by using high frequency moisture meter and the equilibrium moisture content under which Tripitaka Koreana equilibrated, thus, it was expected to provide the key to species identification.

First-time estimation of HCHO column in major cities over Asia using multiple regression with satellite data (위성자료와 다중회귀분석법을 이용한 아시아 주요도시의 포름알데하이드 칼럼농도 추정연구)

  • Choi, Wonei;Hong, Hyunkee;Park, Junsung;Lee, Hanlim
    • Korean Journal of Remote Sensing
    • /
    • v.31 no.6
    • /
    • pp.523-530
    • /
    • 2015
  • A Multiple Regression Method (MRM) is used for the first time with Ozone Monitoring Instrument (OMI) and Moderate Resolution Imaging Spectroradiometer (MODIS) data to estimate formaldehyde (HCHO) Vertical Column Density (VCD). For a 3.5-year period from January 2005 through July 2008, HCHO VCD estimation is investigated in cities over Asia in two categorized areas: (1) Major cities in Northeast Asia (Beijing, Seoul, and Tokyo), (2) Major cities in Southeast Asia (New Delhi, Dhaka, and Bangkok). In the Major cities in Northeast Asia, there are good agreements between HCHO estimated by the multiple linear regression method ($HCHO_{MRM}$) and HCHO measured by OMI ($HCHO_{OMI}$) (0.78 < $R^2$ < 0.82). However, in Major cities in Southeast Asia, there were poor agreements between $HCHO_{OMI}$ and $HCHO_{MRM}$ (0.24 < $R^2$ < 0.39). In addition, an unbiased assessment of the MRM performance using modeling and validation groups shows that the performance of the MRM based on separate modeling and validation groups is comparable to that using all the data for deriving Multiple Regression Equations (MREs). This study demonstrates that MRM can be an alternative tool for HCHO estimation in certain areas over Asia.

Hydrologic Response Estimation Using Mallows' $C_L$ Statistics (Mallows의 $C_L$ 통계량을 이용한 수문응답 추정)

  • Seong, Gi-Won;Sim, Myeong-Pil
    • Journal of Korea Water Resources Association
    • /
    • v.32 no.4
    • /
    • pp.437-445
    • /
    • 1999
  • The present paper describes the problem of hydrologic response estimation using non-parametric ridge regression method. The method adapted in this work is based on the minimization of the $C_L$ statistics, which is an estimate of the mean square prediction error. For this method, effects of using both the identity matrix and the Laplacian matrix were considered. In addition, we evaluated methods for estimating the error variance of the impulse response. As a result of analyzing synthetic and real data, a good estimation was made when the Laplacian matrix for the weighting matrix and the bias corrected estimate for the error variance were used. The method and procedure presented in present paper will play a robust and effective role on separating hydrologic response.

  • PDF

Fitting Distribution of Accident Frequency of Freeway Horizontal Curve Sections & Development of Negative Binomial Regression Models (고속도로 평면선형상 사고빈도분포 추정을 통한 음이항회귀모형 개발 (기하구조요인을 중심으로))

  • 강민욱;도철웅;손봉수
    • Journal of Korean Society of Transportation
    • /
    • v.20 no.7
    • /
    • pp.197-204
    • /
    • 2002
  • 교통사고예측 및 예방을 위해서는 실제적으로 도로설계과정에서 제어가 가능한 도로 기하구조요소에 대한 사고관계를 파악함이 타당하다. 즉, 도로의 설계자는 도로건설에 앞서 기하구조요소와 사고와의 관계를 현장자료를 통해 정확히 밝혀 도로설계에 반영해야 한다. 이를 위해, 교통사고의 빈도분포를 박히는 것은 가장 기본이 되는 일이며, 교통사고 예측모형개발에 선행되어야 한다. 일반적으로 교통사고건수의 경우 분산이 평균보다 큰 과분산(overdispersion)의 특징을 가지고 있어 음이항 분포를 따른다고 알려져 있다. 따라서 본 논문은 사고모형의 개발에 앞서, 사고발생지점에 대한 도로설계요소와 기타 잠재적인 사고발생 관련요인이 비교적 잘 파악되어있는 호남고속도로를 중심으로 평면 선형상 곡선부에 대하여 교통사고의 분포를 적합도 검정을 통해 알아보고자 하였다. 사고자료는 한국도로송사의 호남고속도로 5년(1996∼2000)간 자료를 분석에 맞게 정리하였으며, 강민욱과 송봉수(2002)에서 제시한 평면선형에 있어서의 구간분할법을 이용하여 배향곡선구간과 단일곡선구간에 대한 사고분석을 하였다. 적합도 분석결과, 예상대로 음이항분포가 사고건수를 설명하기에 가장 적합한 확률분포로 제시되었으며, 이를 통해 최우추정법을 이용한 음이항회귀모형을 개발하였다. 구간분할법을 적용한 음이항회귀모형의 경우, 기존의 확률회귀토형에 비하여 높은 결정계수를 갖았으며, 모형에서 적용된 기하구조요소로는 차량 노출계수, 곡선반경, 단위거리 당 편경사변화값 등이다.

Time series analysis for Korean COVID-19 confirmed cases: HAR-TP-T model approach (한국 COVID-19 확진자 수에 대한 시계열 분석: HAR-TP-T 모형 접근법)

  • Yu, SeongMin;Hwang, Eunju
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.2
    • /
    • pp.239-254
    • /
    • 2021
  • This paper studies time series analysis with estimation and forecasting for Korean COVID-19 confirmed cases, based on the approach of a heterogeneous autoregressive (HAR) model with two-piece t (TP-T) distributed errors. We consider HAR-TP-T time series models and suggest a step-by-step method to estimate HAR coefficients as well as TP-T distribution parameters. In our proposed step-by-step estimation, the ordinary least squares method is utilized to estimate the HAR coefficients while the maximum likelihood estimation (MLE) method is adopted to estimate the TP-T error parameters. A simulation study on the step-by-step method is conducted and it shows a good performance. For the empirical analysis on the Korean COVID-19 confirmed cases, estimates in the HAR-TP-T models of order p = 2, 3, 4 are computed along with a couple of selected lags, which include the optimal lags chosen by minimizing the mean squares errors of the models. The estimation results by our proposed method and the solely MLE are compared with some criteria rules. Our proposed step-by-step method outperforms the MLE in two aspects: mean squares error of the HAR model and mean squares difference between the TP-T residuals and their densities. Moreover, forecasting for the Korean COVID-19 confirmed cases is discussed with the optimally selected HAR-TP-T model. Mean absolute percentage error of one-step ahead out-of-sample forecasts is evaluated as 0.0953% in the proposed model. We conclude that our proposed HAR-TP-T time series model with optimally selected lags and its step-by-step estimation provide an accurate forecasting performance for the Korean COVID-19 confirmed cases.