• Title, Summary, Keyword: Multiple regression analysis

### Multivariate Analysis for Clinicians (임상의를 위한 다변량 분석의 실제)

• Oh, Joo Han;Chung, Seok Won
• Clinics in Shoulder and Elbow
• /
• v.16 no.1
• /
• pp.63-72
• /
• 2013
• In medical research, multivariate analysis, especially multiple regression analysis, is used to analyze the influence of multiple variables on the result. Multiple regression analysis should include variables in the model and the problem of multi-collinearity as there are many variables as well as the basic assumption of regression analysis. The multiple regression model is expressed as the coefficient of determination, $R^2$ and the influence of independent variables on result as a regression coefficient, ${\beta}$. Multiple regression analysis can be divided into multiple linear regression analysis, multiple logistic regression analysis, and Cox regression analysis according to the type of dependent variables (continuous variable, categorical variable (binary logit), and state variable, respectively), and the influence of variables on the result is evaluated by regression coefficient${\beta}$, odds ratio, and hazard ratio, respectively. The knowledge of multivariate analysis enables clinicians to analyze the result accurately and to design the further research efficiently.

### ALC(Autoclaved Lightweight Concrete) Hardness Prediction Research By Multiple Regression Analysis (다중회귀분석을 이용한 ALC 경도예측에 관한 연구)

• Kim, Gwang-Su;Baek, Seung-Hun
• Proceedings of the Safety Management and Science Conference
• /
• /
• pp.117-137
• /
• 2012
• In the ALC(Autoclaved lightweight concrete) manufacturing process, if the pre-cured semi-cake is removed after proper time is passed, it will be hard to retain the moisture and be easily cracked. Therefore, in this research, we took the research by multiple regression analysis to find relationship between variables for the prediction the hardness that is the control standard of the removal time. We study the relationship between Independent variables such as the V/T(Vibration Time), V/T movement, expansion height, curing time, placing temperature, Rising and C/S ratio and the Dependent variables, the hardness by multiple regression analysis. In this study, first, we calculated regression equation by the regression analysis, then we tried phased regression analysis, best subset regression analysis and residual analysis. At last, we could verify curing time, placing temperature, Rising and C/S ratio influence to the hardness by the estimated regression equation.

### A Study on Stochastic Estimation of Monthly Runoff by Multiple Regression Analysis (다중회귀분석에 의한 하천 월 유출량의 추계학적 추정에 관한 연구)

• 김태철;정하우
• Magazine of the Korean Society of Agricultural Engineers
• /
• v.22 no.3
• /
• pp.75-87
• /
• 1980
• Most hydro]ogic phenomena are the complex and organic products of multiple causations like climatic and hydro-geological factors. A certain significant correlation on the run-off in river basin would be expected and foreseen in advance, and the effect of each these causual and associated factors (independant variables; present-month rainfall, previous-month run-off, evapotranspiration and relative humidity etc.) upon present-month run-off(dependent variable) may be determined by multiple regression analysis. Functions between independant and dependant variables should be treated repeatedly until satisfactory and optimal combination of independant variables can be obtained. Reliability of the estimated function should be tested according to the result of statistical criterion such as analysis of variance, coefficient of determination and significance-test of regression coefficients before first estimated multiple regression model in historical sequence is determined. But some error between observed and estimated run-off is still there. The error arises because the model used is an inadequate description of the system and because the data constituting the record represent only a sample from a population of monthly discharge observation, so that estimates of model parameter will be subject to sampling errors. Since this error which is a deviation from multiple regression plane cannot be explained by first estimated multiple regression equation, it can be considered as a random error governed by law of chance in nature. This unexplained variance by multiple regression equation can be solved by stochastic approach, that is, random error can be stochastically simulated by multiplying random normal variate to standard error of estimate. Finally hybrid model on estimation of monthly run-off in nonhistorical sequence can be determined by combining the determistic component of multiple regression equation and the stochastic component of random errors. Monthly run-off in Naju station in Yong-San river basin is estimated by multiple regression model and hybrid model. And some comparisons between observed and estimated run-off and between multiple regression model and already-existing estimation methods such as Gajiyama formula, tank model and Thomas-Fiering model are done. The results are as follows. (1) The optimal function to estimate monthly run-off in historical sequence is multiple linear regression equation in overall-month unit, that is; Qn=0.788Pn+0.130Qn-1-0.273En-0.1 About 85% of total variance of monthly runoff can be explained by multiple linear regression equation and its coefficient of determination (R2) is 0.843. This means we can estimate monthly runoff in historical sequence highly significantly with short data of observation by above mentioned equation. (2) The optimal function to estimate monthly runoff in nonhistorical sequence is hybrid model combined with multiple linear regression equation in overall-month unit and stochastic component, that is; Qn=0. 788Pn+0. l30Qn-1-0. 273En-0. 10+Sy.t The rest 15% of unexplained variance of monthly runoff can be explained by addition of stochastic process and a bit more reliable results of statistical characteristics of monthly runoff in non-historical sequence are derived. This estimated monthly runoff in non-historical sequence shows up the extraordinary value (maximum, minimum value) which is not appeared in the observed runoff as a random component. (3) "Frequency best fit coefficient" (R2f) of multiple linear regression equation is 0.847 which is the same value as Gaijyama's one. This implies that multiple linear regression equation and Gajiyama formula are theoretically rather reasonable functions.

### Water Demand Forecasting by Characteristics of City Using Principal Component and Cluster Analyses

• Choi, Tae-Ho;Kwon, O-Eun;Koo, Ja-Yong
• Environmental Engineering Research
• /
• v.15 no.3
• /
• pp.135-140
• /
• 2010
• With the various urban characteristics of each city, the existing water demand prediction, which uses average liter per capita day, cannot be used to achieve an accurate prediction as it fails to consider several variables. Thus, this study considered social and industrial factors of 164 local cities, in addition to population and other directly influential factors, and used main substance and cluster analyses to develop a more efficient water demand prediction model that considers unique localities of each city. After clustering, a multiple regression model was developed that proved that the $R^2$ value of the inclusive multiple regression model was 0.59; whereas, those of Clusters A and B were 0.62 and 0.74, respectively. Thus, the multiple regression model was considered more reasonable and valid than the inclusive multiple regression model. In summary, the water demand prediction model using principal component and cluster analyses as the standards to classify localities has a better modification coefficient than that of the inclusive multiple regression model, which does not consider localities.

### ALC(Autoclaved Lightweight Concrete) Hardness Prediction by Multiple Regression Analysis (다중회귀분석을 이용한 ALC 경도예측에 관한 연구)

• Kim, Kwang-Soo;Baek, Seung-Hoon;Chung, Soon-Suk
• Asia-Pacific Journal of Business Venturing and Entrepreneurship
• /
• v.7 no.2
• /
• pp.101-111
• /
• 2012
• In the ALC(Autoclaved lightweight concrete) manufacturing process, if the pre-cured semi-cake is removed after proper time is passed, it will be hard to retain the moisture and be easily cracked. Therefore, in this research, we took the research by multiple regression analysis to find relationship between variables for the prediction the hardness that is the control standard of the removal time. We study the relationship between Independent variables such as the V/T(Vibration Time), V/T movement, expansion height, curing time, placing temperature, Rising and C/S ratio and the Dependent variables, the hardness by multiple regression analysis. In this study, first, we calculated regression equation by the regression analysis, then we tried phased regression analysis, best subset regression analysis and residual analysis. At last, we could verify curing time, placing temperature, Rising and C/S ratio influence to the hardness by the estimated regression equation.

### Quantitative Analysis by Derivative Spectrophotometry (III) -Simultaneous quantitation of vitamin B group and vitamin C in by multiple linear regression analysis-

• Park, Man-Ki;Cho, Jung-Hwan
• Archives of Pharmacal Research
• /
• v.11 no.1
• /
• pp.45-51
• /
• 1988
• The feature of resolution enhancement by derivative operation is linked to one of the multivariate analysis, which is multiple linear regression with two options, all possible and stepwise regression. Examined samples were synthetic mixtures of 5 vitamins, thiamine mononitrate, riboflavin phosphate, nicotinamide, pyridoxine hydrochloride and ascorbic acid. All components in mixture were quantified with reasonably good accuracy and precision. Whole data processing procedure was accomplished on-line by the development of three computer programs written in APPLESOFT BASIC language.

### Correlation Analysis of Water Quality According to Land Use Types of Reservoir Watershed (유역 토지이용과 저수지 수질의 상관관계 분석)

• Youn, Dong-Koun;Chung, Sang-Ok
• Proceedings of the Korean Society of Agricultural Engineers Conference
• /
• /
• pp.614-619
• /
• 2005
• The object of this study was to presented regression equations for obtaining simply and quickly values of water quality items, BOD, COD, T-N, and T-P. Regression equations obtained to analyze relationships for water quality items to land use types in agricultural reservoir watersheds. In order to derive regression equations, a multiple linear regression analysis was used in this studying reservoirs. In this regression analysis, a independent values used land used types and dependent values used BOD, COD, T-N, T-P values in water quality items. The results showed that numbers of regression equation ranging above 0.90 in a multiple correlation coefficient (MCC) was not found, ranging from 0.70 to 0.90 in the MCC was 6, ranging from 0.40 to 0.70 in the MCC was 20, and ranging from 0.20 to 0.40 in the MCC was 4. The results of this study can be used as a basic information for evaluating simply and quickly water quality for proposing and designing steps in water quality policy.

### Correlation Analysis between Climate and Contamination Degree through Multiple Regression Analysis (다중회귀 분석을 통한 기후 및 오손도 간의 상관관계 분석)

• Kim, Do-Young;Lee, Won-Young;Shim, Kyu-Il;Han, Sang-Ok;Park, Kang-Sik
• Proceedings of the Korean Institute of Electrical and Electronic Material Engineers Conference
• /
• /
• pp.49-52
• /
• 2003
• The performance of insulators under contaminated conditions is the underlying and the most factor that determines insulation design for outdoor applications, Among the contamination factors, The sea salt is the most dangerous factor, and the salt factor have closed relation with climatic conditions, such as wind, temperature, humidity and so on, Effect of these factors to insulation system is different of each other, and need to show the correlation by multiple regression analysis techniques. In this paper, predicted and analyzed equivalent salt deposit density (ESDD) by change climatic condition through multiple regression analysis.

### The Development of the DEA-AR Model using Multiple Regression Analysis and Efficiency Evaluation of Regional Corporation in Korea (다중회귀분석을 이용한 DEA-AR 모형 개발 및 국내 지방공사의 효율성 평가)

• Sim, Gwang-Sic;Kim, Jae-Yun
• Journal of the Korean Operations Research and Management Science Society
• /
• v.37 no.1
• /
• pp.29-43
• /
• 2012
• We design a DEA-AR model using multiple regression analysis with new methods which limit weights. When there are multiple input and single output variables, our model can be used, and the weights of input variables use the regression coefficient and coefficient of determination. To verify the effectiveness of the new model, we evaluate the efficiency of the Regional Corporations in Korea. Accordance with statistical analysis, it proved that there is no difference between the efficiency value of the DEA-AR using AHP and our DEA-AR model. Our model can be applied to a lot of research by substituting DEA-AR model relying on AHP in the future.

### A Comparison of Construction Cost Estimation Using Multiple Regression Analysis and Neural Network in Elementary School Project

• Cho, Hong-Gyu;Kim, Kyong-Gon;Kim, Jang-Young;Kim, Gwang-Hee
• Journal of the Korea Institute of Building Construction
• /
• v.13 no.1
• /
• pp.66-74
• /
• 2013
• In the early stages of a construction project, the most important thing is to predict construction costs in a rational way. For this reason, many studies have been performed on the estimation of construction costs for apartment housing and office buildings at early stage using artificial intelligence, statistics, and the like. In this study, cost data held by a provincial Office of Education on elementary schools constructed from 2004 to 2007 were used to compare the multiple regression model with an artificial neural network model. A total of 96 historical data were classified into 76 historical data for constructing models and 20 historical data for comparing the constructed regression model with the artificial neural network model. The results of an analysis of predicted construction costs were that the error rate of the artificial neural network model is lower than that of the multiple regression model.