• 제목/요약/키워드: Linear regression models

검색결과 947건 처리시간 0.029초

지구 통계 모형을 이용한 양파 재배지 농업기상정보 생성 방법 (Production of Agrometeorological Information in Onion Fields using Geostatistical Models)

  • 임지은;윤상후
    • 한국환경과학회지
    • /
    • 제27권7호
    • /
    • pp.509-518
    • /
    • 2018
  • Weather is the most influential factor for crop cultivation. Weather information for cultivated areas is necessary for growth and production forecasting of agricultural crops. However, there are limitations in the meteorological observations in cultivated areas because weather equipment is not installed. This study tested methods of predicting the daily mean temperature in onion fields using geostatistical models. Three models were considered: inverse distance weight method, generalized additive model, and Bayesian spatial linear model. Data were collected from the AWS (automatic weather system), ASOS (automated synoptic observing system), and an agricultural weather station between 2013 and 2016. To evaluate the prediction performance, data from AWS and ASOS were used as the modeling data, and data from the agricultural weather station were used as the validation data. It was found that the Bayesian spatial linear regression performed better than other models. Consequently, high-resolution maps of the daily mean temperature of Jeonnam were generated using all observed weather information.

구조방정식을 이용한 도시부 4지 신호교차로의 사고원인 분석 (A Causational Study for Urban 4-legged Signalized Intersections using Structural Equation Method)

  • 오주택;이상규;허태영;황정원
    • 한국도로학회논문집
    • /
    • 제14권6호
    • /
    • pp.121-129
    • /
    • 2012
  • PURPOSES : Traffic accidents at intersections have been increased annually so that it is required to examine the causations to reduce the accidents. However, the current existing accident models were developed mainly with non-linear regression models such as Poisson methods. These non-linear regression methods lack to reveal complicated causations for traffic accidents, though they are right choices to study randomness and non-linearity of accidents. Therefore, to reveal the complicated causations of traffic accidents, this study used structural equation methods(SEM). METHODS : SEM used in this study is a statistical technique for estimating causal relations using a combination of statistical data and qualitative causal assumptions. SEM allow exploratory modeling, meaning they are suited to theory development. The method is tested against the obtained measurement data to determine how well the model fits the data. Among the strengths of SEM is the ability to construct latent variables: variables which are not measured directly, but are estimated in the model from several measured variables. This allows the modeler to explicitly capture the unreliability of measurement in the model, which allows the structural relations between latent variables to be accurately estimated. RESULTS : The study results showed that causal factors could be grouped into 3. Factor 1 includes traffic variables, and Factor 2 contains turning traffic variables. Factor 3 consists of other road element variables such as speed limits or signal cycles. CONCLUSIONS : Non-linear regression models can be used to develop accident predictions models. However, they lack to estimate causal factors, because they select only few significant variables to raise the accuracy of the model performance. Compared to the regressions, SEM has merits to estimate causal factors affecting accidents, because it allows the structural relations between latent variables. Therefore, this study used SEM to estimate causal factors affecting accident at urban signalized intersections.

A Comparative Study of Estimation by Analogy using Data Mining Techniques

  • Nagpal, Geeta;Uddin, Moin;Kaur, Arvinder
    • Journal of Information Processing Systems
    • /
    • 제8권4호
    • /
    • pp.621-652
    • /
    • 2012
  • Software Estimations provide an inclusive set of directives for software project developers, project managers, and the management in order to produce more realistic estimates based on deficient, uncertain, and noisy data. A range of estimation models are being explored in the industry, as well as in academia, for research purposes but choosing the best model is quite intricate. Estimation by Analogy (EbA) is a form of case based reasoning, which uses fuzzy logic, grey system theory or machine-learning techniques, etc. for optimization. This research compares the estimation accuracy of some conventional data mining models with a hybrid model. Different data mining models are under consideration, including linear regression models like the ordinary least square and ridge regression, and nonlinear models like neural networks, support vector machines, and multivariate adaptive regression splines, etc. A precise and comprehensible predictive model based on the integration of GRA and regression has been introduced and compared. Empirical results have shown that regression when used with GRA gives outstanding results; indicating that the methodology has great potential and can be used as a candidate approach for software effort estimation.

ARTIFICIAL NEURAL NETWORK FOR PREDICTION OF WATER QUALITY IN PIPELINE SYSTEMS

  • Kim, Ju-Hwan;Yoon, Jae-Heung
    • Water Engineering Research
    • /
    • 제4권2호
    • /
    • pp.59-68
    • /
    • 2003
  • The applicabilities and validities of two methodologies fur the prediction of THM (trihalomethane) formation in a water pipeline system were proposed and discussed. One is the multiple regression technique and the other is an artificial neural network technique. There are many factors which influence water quality, especially THMs formations in water pipeline systems. In this study, the prediction models of THM formation in water pipeline systems are developed based on the independent variables proposed by American Water Works Association(AWWA). Multiple linear/nonlinear regression models are estimated and three layer feed-forward artificial neural networks have been used to predict the THM formation in a water pipeline system. Input parameters of the models consist of organic compounds measured in water pipeline systems such as TOC, DOC and UV254. Also, the reaction time to each measuring site along pipeline is used as input parameter calculated by a hydraulic analysis. Using these variables as model parameters, four models are developed. And the predicted results from the four developed models are compared statistically to the measured THMs data set. It is shown that the artificial neural network approaches are much superior to the conventional regression approaches and that the developed models by neural network can be used more efficiently and reproduce more accurately the THMs formation in water pipeline systems, than the conventional regression methods proposed by AWWA.

  • PDF

감마 일반화 선형 모형에서의 가능도비 검정과 F-검정 비교연구 (Comparing the performance of likelihood ratio test and F-test for gamma generalized linear models)

  • 조성일;한정섭;이우주
    • 응용통계연구
    • /
    • 제31권4호
    • /
    • pp.475-484
    • /
    • 2018
  • 감마 일반화 선형모형은 음이 아니며 치우침이 있는 반응변수에 유용한 모형으로 알려져 있다. 그러나 포아송 분포 또는 이항 분포에 기반한 일반화 선형모형에 비해 적은 관심을 받아왔다. 특히, 회귀계수의 유의성 검정에 대해서는 연구가 면밀히 되어 있지 않다. 본 논문에서는 감마 일반화 선형 모형의 검정에 대해 다양한 통계량들을 알아보고 수치 연구를 통해 그들의 성능을 비교한다. 수치 실험의 결과 부분 이탈도 검정 방법의 문제점이 나타났으며, 가능도비 검정 방법과 F-검정 방법이 좋은 성능을 보임을 확인하였다.

More on directional regression

  • Kim, Kyongwon;Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • 제28권5호
    • /
    • pp.553-562
    • /
    • 2021
  • Directional regression (DR; Li and Wang, 2007) is well-known as an exhaustive sufficient dimension reduction method, and performs well in complex regression models to have linear and nonlinear trends. However, the extension of DR is not well-done upto date, so we will extend DR to accommodate multivariate regression and large p-small n regression. We propose three versions of DR for multivariate regression and discuss how DR is applicable for the latter regression case. Numerical studies confirm that DR is robust to the number of clusters and the choice of hierarchical-clustering or pooled DR.

회귀모형의 기울기에 대한 품행성 검정 (Parallelism Test of Slope in Simple Linear Regression Models)

  • 박현욱;김동재
    • Communications for Statistical Applications and Methods
    • /
    • 제16권1호
    • /
    • pp.75-83
    • /
    • 2009
  • 단순선형 회귀모형의 기울기에 대한 평행성 검정법을 제안하였다. 세 군 이상에서 기울기에 대하여 Tukey (1953)가 제안한 HSD방법을 이용한 모수적 검정법과 Kruskal-Wallis (1952) 검정법을 이용한 비모수적 검정법을 각각 제안하였다. 또한 모의실험을 통하여 기존의 검정법과 제안한 검정법의 검정력을 비교하였다.

일반화 선형모형을 통한 품질개선실험 자료분석 (Generalized Linear Models for the Analysis of Data from the Quality-Improvement Experiments)

  • 이영조;임용빈
    • 품질경영학회지
    • /
    • 제24권2호
    • /
    • pp.128-141
    • /
    • 1996
  • The advent of the quality-improvement movement caused a great expansion in the use of statistically designed experiments in industry. The regression method is often used for the analysis of data from such experiments. However, the data for a quality characterstic often takes the form of counts or the ratio of counts, e.g. fraction of defectives. For such data the analysis using generalized linear models is preferred to that using the simple regression model. In this paper we introduce the generalized linear model and show how it can be used for the analysis of non-normal data from quality-improvement experiments.

  • PDF

Statistical analysis of KNHANES data with measurement error models

  • Hwang, Jinseub
    • Journal of the Korean Data and Information Science Society
    • /
    • 제26권3호
    • /
    • pp.773-779
    • /
    • 2015
  • We study a statistical analysis about the fifth wave data of the Korea National Health and Nutrition Examination Survey based on linear regression models with measurement errors. The data is obtained from a national population-based complex survey. To demonstrate the availability of measurement error models, two results between the general linear regression model and measurement error model are compared based on the model selection criteria which are Akaike information criterion and Bayesian information criterion. For our study, we use the simulation extrapolation algorithm for measurement error model and the jackknife method for the estimation of standard errors.

3지와 4지 회전교차로의 사고분석 (Accident Analysis of 3-legged and 4-legged Roundabouts)

  • 박민규;박병호
    • 한국안전학회지
    • /
    • 제27권3호
    • /
    • pp.161-166
    • /
    • 2012
  • This study deals with the accident of roundabout. The objective is to analyze the traffic accidents occurred in 3-legged and 4-legged roundabouts through the developed models. In developing the multiple linear regression models, this study uses the number of traffic accidents as a dependent variable and such the variables as geometric structures, traffic characters and others as the independent variables. The correlation and multicollinearity of variables were analyzed using SPSS17.0. The main results are as follows. First, R-square value of developed models were analyzed to be 0.851(3-leg) and 0.689(4-leg), respectively. Second, the independent variables in the 3-legged roundabout accident model were analyzed to be the traffic volume and number of crosswalk, and the variables in the 4-legged roundabouts were evaluated to be the traffic volume and signal. Finally, the paired t-test shows that the predicted values and observed values are not statistically different.