• 제목/요약/키워드: Linear regression models

검색결과 944건 처리시간 0.026초

Residuals Plots for Repeated Measures Data

  • 박태성
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2000년도 추계학술발표회 논문집
    • /
    • pp.187-191
    • /
    • 2000
  • In the analysis of repeated measurements, multivariate regression models that account for the correlations among the observations from the same subject are widely used. Like the usual univariate regression models, these multivariate regression models also need some model diagnostic procedures. In this paper, we propose a simple graphical method to detect outliers and to investigate the goodness of model fit in repeated measures data. The graphical method is based on the quantile-quantile(Q-Q) plots of the $X^2$ distribution and the standard normal distribution. We also propose diagnostic measures to detect influential observations. The proposed method is illustrated using two examples.

  • PDF

3차원 박판형성 공정 유한요소해석용 드로우비드 모델 (Drawbead Model for 3-Dimensional Finite Element Analysis of Sheet Metal Forming Processess)

  • 금영탁;김준환;차지혜
    • 소성∙가공
    • /
    • 제11권5호
    • /
    • pp.394-404
    • /
    • 2002
  • The drawbead model for a three-dimensional a finite element analysis of sheet metal forming processes is developed. The mathematical models of the basic drawbeads like circular drawbead, stepped drawbead, and squared drawbaed are first derived using the bending theory, belt-pulley equation, and Coulomb friction law. Next, the experiments for finding the drawing characteristics of the drawbead are performed. Based on mathematical models and drawing test results, expert models of basic drawbeads are then developed employing a linear multiple regression method. For the expert models of combined drawbeads such as the double circular drawbead, double stepped drawbead, circular-and-stepped drawbead, etc., those of the basic drawbeads are summed. Finally, in order to verify the expert models developed, the drawing characteristics calculated by the expert models of the double circular drawbead and circular-and-stepped drawbead are compared with those obtained from the experiments. The predictions by expert models agree well with the measurements by experiments.

호우피해자료에서의 고차원 자료 및 다중공선성 문제를 해소한 회귀모형 개발 (Development of Regression Models Resolving High-Dimensional Data and Multicollinearity Problem for Heavy Rain Damage Data)

  • 김정환;박지현;최창현;김형수
    • 대한토목학회논문집
    • /
    • 제38권6호
    • /
    • pp.801-808
    • /
    • 2018
  • 선형회귀모형의 학습은 일반적으로 자료의 개수가 설명변수의 개수보다 충분히 크고, 설명변수들 사이에 심각한 다중공선성이 없다는 가정 하에서 안정적으로 이루어진다. 본 연구에서는 이러한 가정이 위배되었을 경우 모형 학습의 어려움을 실제 호우피해자료를 분석함으로써 조명하였고, 이를 해결하기 위해 자료를 통합한 다음 주성분회귀모형 또는 능형회귀모형을 사용할 것을 검토하였다. 모형의 학습에 사용된 자료와 별도의 독립된 자료에서 제안된 모형들의 예측력을 평가하였고, 제안된 방법이 선형회귀모형보다 더 나은 예측력을 보이는 것을 확인하였다.

다변량 선형회귀분석을 이용한 증발접시계수 산정방법 적용성 검토 (Evaluation of applicability of pan coefficient estimation method by multiple linear regression analysis)

  • 임창수
    • 한국수자원학회논문집
    • /
    • 제55권3호
    • /
    • pp.229-243
    • /
    • 2022
  • 우리나라 11개 기상관측지역의 월별 기상자료가 증발접시계수에 미치는 영향을 분석하고, 증발접시계수 산정을 위한 4가지 형태의 다변량 선형회귀모형의 적용성을 검토하였다. 개발된 증발접시계수 산정모형의 적용성을 평가하기 위해서 기존에 다른 연구자들에 의해서 제안된 6가지의 모형과 비교 평가하였다. 우리나라 11개 기상관측지역에서 증발접시계수는 1, 2, 3, 7, 11, 12월은 기온에 가장 큰 영향을 받고, 다른 월들은 일사량에 가장 큰 영향을 받는 것으로 나타났다. 전반적으로 모든 월에서 풍속과 상대습도는 기온이나 일사량과 비교해서 증발접시계수에 큰 영향을 미치지 않는 것으로 나타났다. 모든 지역과 월에서 각 지역별로 5개의 독립변수(풍속, 상대습도, 기온, 일조시간과 가조시간의 비, 일사량)를 적용하여 유도된 모형이 가장 양호한 증발량 산정 결과를 보였다. 모형 검증결과에 의하면 다변량 선형회귀분석을 적용하여 증발접시계수를 산정하는 경우 일부 지역과 월에서 제한적으로 적용할 수 있을 것으로 판단된다.

통계적 축소법을 이용한 한반도 인근해역의 미래 표층수온 추정 (Prediction of Future Sea Surface Temperature around the Korean Peninsular based on Statistical Downscaling)

  • 함희정;김상수;윤우석
    • 산업기술연구
    • /
    • 제31권B호
    • /
    • pp.107-112
    • /
    • 2011
  • Recently, climate change around the world due to global warming has became an important issue and damages by climate change have a bad effect on human life. Changes of Sea Surface Temperature(SST) is associated with natural disaster such as Typhoon and El Nino. So we predicted daily future SST using Statistical Downscaling Method and CGCM 3.1 A1B scenario. 9 points of around Korea peninsular were selected to predict future SST and built up a regression model using Multiple Linear Regression. CGCM 3.1 was simulated with regression model, and that comparing Probability Density Function, Box-Plot, and statistical data to evaluate suitability of regression models, it was validated that regression models were built up properly.

  • PDF

고차원 선형 및 로지스틱 회귀모형에 대한 변분 베이즈 방법 소개 (Introduction to variational Bayes for high-dimensional linear and logistic regression models)

  • 장인송;이경재
    • 응용통계연구
    • /
    • 제35권3호
    • /
    • pp.445-455
    • /
    • 2022
  • 본 논문에서는 고차원 희소 회귀분석을 위한 기존의 베이지안 방법들을 소개하고, 다양한 모의실험 세팅에서 성능을 비교한다. 특히, 확장 가능하고 정확한 베이지안 추론을 가능하게 하는 변분 베이즈 방법(variational Bayes method) (Ray와 Szabó, 2021) 에 중점을 둔다. 시뮬레이션 자료를 기반으로 한 희소 고차원 선형회귀분석을 실시하고 변분 베이즈 방법의 성능을 다른 베이지안 및 빈도론 방법들과 비교한다. 로지스틱 회귀분석에서 변분 베이즈 방법의 실제 성능을 확인하기 위해 백혈병 유전자 발현 자료를 사용하여 실자료 분석을 수행한다.

Comparison of different post-processing techniques in real-time forecast skill improvement

  • Jabbari, Aida;Bae, Deg-Hyo
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2018년도 학술발표회
    • /
    • pp.150-150
    • /
    • 2018
  • The Numerical Weather Prediction (NWP) models provide information for weather forecasts. The highly nonlinear and complex interactions in the atmosphere are simplified in meteorological models through approximations and parameterization. Therefore, the simplifications may lead to biases and errors in model results. Although the models have improved over time, the biased outputs of these models are still a matter of concern in meteorological and hydrological studies. Thus, bias removal is an essential step prior to using outputs of atmospheric models. The main idea of statistical bias correction methods is to develop a statistical relationship between modeled and observed variables over the same historical period. The Model Output Statistics (MOS) would be desirable to better match the real time forecast data with observation records. Statistical post-processing methods relate model outputs to the observed values at the sites of interest. In this study three methods are used to remove the possible biases of the real-time outputs of the Weather Research and Forecast (WRF) model in Imjin basin (North and South Korea). The post-processing techniques include the Linear Regression (LR), Linear Scaling (LS) and Power Scaling (PS) methods. The MOS techniques used in this study include three main steps: preprocessing of the historical data in training set, development of the equations, and application of the equations for the validation set. The expected results show the accuracy improvement of the real-time forecast data before and after bias correction. The comparison of the different methods will clarify the best method for the purpose of the forecast skill enhancement in a real-time case study.

  • PDF

작품 가격 추정을 위한 기계 학습 기법의 응용 및 가격 결정 요인 분석 (Price Determinant Factors of Artworks and Prediction Model Based on Machine Learning)

  • 장동률;박민재
    • 품질경영학회지
    • /
    • 제47권4호
    • /
    • pp.687-700
    • /
    • 2019
  • Purpose: The purpose of this study is to investigate the interaction effects between price determinants of artworks. We expand the methodology in art market by applying machine learning techniques to estimate the price of artworks and compare linear regression and machine learning in terms of prediction accuracy. Methods: Moderated regression analysis was performed to verify the interaction effects of artistic characteristics on price. The moderating effects were studied by confirming the significance level of the interaction terms of the derived regression equation. In order to derive price estimation model, we use multiple linear regression analysis, which is a parametric statistical technique, and k-nearest neighbor (kNN) regression, which is a nonparametric statistical technique in machine learning methods. Results: Mostly, the influences of the price determinants of art are different according to the auction types and the artist 's reputation. However, the auction type did not control the influence of the genre of the work on the price. As a result of the analysis, the kNN regression was superior to the linear regression analysis based on the prediction accuracy. Conclusion: It provides a theoretical basis for the complexity that exists between pricing determinant factors of artworks. In addition, the nonparametric models and machine learning techniques as well as existing parameter models are implemented to estimate the artworks' price.

Quasi-Likelihood Approach for Linear Models with Censored Data

  • Ha, Il-Do;Cho, Geon-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제9권2호
    • /
    • pp.219-225
    • /
    • 1998
  • The parameters in linear models with censored normal responses are usually estimated by the iterative maximum likelihood and least square methods. However, the iterative least square method is simple but hardly has theoretical justification, and the iterative maximum likelihood estimating equations are complicatedly derived. In this paper, we justify these methods via Wedderburn (1974)'s quasi-likelihood approach. This provides an explicit justification for the iterative least square method and also directly the iterative maximum likelihood method for estimating the regression coefficients.

  • PDF

간척지 재배 근채류의 최대 엽장과 엽폭을 이용한 지하부 생체중 추정용 회귀 모델 결정 (Determination of Regression Model for Estimating Root Fresh Weight Using Maximum Leaf Length and Width of Root Vegetables Grown in Reclaimed Land)

  • 정대호;이평호;이인복
    • 한국환경농학회지
    • /
    • 제39권3호
    • /
    • pp.204-213
    • /
    • 2020
  • BACKGROUND: Since the number of crops cultivated in reclaimed land is huge, it is very difficult to quantify the total crop production. Therefore, a non-destructive method for predicting crop production is needed. Salt tolerant root vegetables such as red beets and sugar beet are suitable for cultivation in reclaimed land. If their underground biomass can be predicted, it helps to estimate crop productivity. Objectives of this study are to investigate maximum leaf length and weight of red beet, sugar beet, and turnips grown in reclaimed land, and to determine optimal model with regression analysis for linear and allometric growth models. METHODS AND RESULTS: Maximum leaf length, width, and root fresh weight of red beets, sugar beets, and turnips were measured. Ten linear models and six allometric growth models were selected for estimation of root fresh weight and non-linear regression analysis was conducted. The allometric growth model, which have a variable multiplied by square of maximum leaf length and maximum leaf width, showed highest R2 values of 0.67, 0.70, and 0.49 for red beets, sugar beets, and turnips, respectively. Validation results of the models for red beets and sugar beets showed the R2 values of 0.63 and 0.65, respectively. However, the model for turnips showed the R2 value of 0.48. The allometric growth model was suitable for estimating the root fresh weight of red beets and sugar beets, but the accuracy for turnips was relatively low. CONCLUSION: The regression models established in this study may be useful to estimate the total production of root vegetables cultivated in reclaimed land, and it will be used as a non-destructive method for prediction of crop information.