• Title/Summary/Keyword: Linear Regression Algorithm

Search Result 285, Processing Time 0.025 seconds

EPB-TBM performance prediction using statistical and neural intelligence methods

  • Ghodrat Barzegari;Esmaeil Sedghi;Ata Allah Nadiri
    • Geomechanics and Engineering
    • /
    • v.37 no.3
    • /
    • pp.197-211
    • /
    • 2024
  • This research studies the effect of geotechnical factors on EPB-TBM performance parameters. The modeling was performed using simple and multivariate linear regression methods, artificial neural networks (ANNs), and Sugeno fuzzy logic (SFL) algorithm. In ANN, 80% of the data were randomly allocated to training and 20% to network testing. Meanwhile, in the SFL algorithm, 75% of the data were used for training and 25% for testing. The coefficient of determination (R2) obtained between the observed and estimated values in this model for the thrust force and cutterhead torque was 0.19 and 0.52, respectively. The results showed that the SFL outperformed the other models in predicting the target parameters. In this method, the R2 obtained between observed and predicted values for thrust force and cutterhead torque is 0.73 and 0.63, respectively. The sensitivity analysis results show that the internal friction angle (φ) and standard penetration number (SPT) have the greatest impact on thrust force. Also, earth pressure and overburden thickness have the highest effect on cutterhead torque.

Estimation and variable selection in censored regression model with smoothly clipped absolute deviation penalty

  • Shim, Jooyong;Bae, Jongsig;Seok, Kyungha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.6
    • /
    • pp.1653-1660
    • /
    • 2016
  • Smoothly clipped absolute deviation (SCAD) penalty is known to satisfy the desirable properties for penalty functions like as unbiasedness, sparsity and continuity. In this paper, we deal with the regression function estimation and variable selection based on SCAD penalized censored regression model. We use the local linear approximation and the iteratively reweighted least squares algorithm to solve SCAD penalized log likelihood function. The proposed method provides an efficient method for variable selection and regression function estimation. The generalized cross validation function is presented for the model selection. Applications of the proposed method are illustrated through the simulated and a real example.

Prediction of Melting Point for Drug-like Compounds Using Principal Component-Genetic Algorithm-Artificial Neural Network

  • Habibi-Yangjeh, Aziz;Pourbasheer, Eslam;Danandeh-Jenagharad, Mohammad
    • Bulletin of the Korean Chemical Society
    • /
    • v.29 no.4
    • /
    • pp.833-841
    • /
    • 2008
  • Principal component-genetic algorithm-multiparameter linear regression (PC-GA-MLR) and principal component-genetic algorithm-artificial neural network (PC-GA-ANN) models were applied for prediction of melting point for 323 drug-like compounds. A large number of theoretical descriptors were calculated for each compound. The first 234 principal components (PC’s) were found to explain more than 99.9% of variances in the original data matrix. From the pool of these PC’s, the genetic algorithm was employed for selection of the best set of extracted PC’s for PC-MLR and PC-ANN models. The models were generated using fifteen PC’s as variables. For evaluation of the predictive power of the models, melting points of 64 compounds in the prediction set were calculated. Root-mean square errors (RMSE) for PC-GA-MLR and PC-GA-ANN models are 48.18 and $12.77{^{\circ}C}$, respectively. Comparison of the results obtained by the models reveals superiority of the PC-GA-ANN relative to the PC-GA-MLR and the recently proposed models (RMSE = $40.7{^{\circ}C}$). The improvements are due to the fact that the melting point of the compounds demonstrates non-linear correlations with the principal components.

QSPR Study of the Absorption Maxima of Azobenzene Dyes

  • Xu, Jie;Wang, Lei;Liu, Li;Bai, Zikui;Wang, Luoxin
    • Bulletin of the Korean Chemical Society
    • /
    • v.32 no.11
    • /
    • pp.3865-3872
    • /
    • 2011
  • A quantitative structure-property relationship (QSPR) study was performed for the prediction of the absorption maxima of azobenzene dyes. The entire set of 191 azobenzenes was divided into a training set of 150 azobenzenes and a test set of 41 azobenzenes according to Kennard and Stones algorithm. A seven-descriptor model, with squared correlation coefficient ($R^2$) of 0.8755 and standard error of estimation (s) of 14.476, was developed by applying stepwise multiple linear regression (MLR) analysis on the training set. The reliability of the proposed model was further illustrated using various evaluation techniques: leave-many-out crossvalidation procedure, randomization tests, and validation through the test set.

Statistical analysis of KNHANES data with measurement error models

  • Hwang, Jinseub
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.3
    • /
    • pp.773-779
    • /
    • 2015
  • We study a statistical analysis about the fifth wave data of the Korea National Health and Nutrition Examination Survey based on linear regression models with measurement errors. The data is obtained from a national population-based complex survey. To demonstrate the availability of measurement error models, two results between the general linear regression model and measurement error model are compared based on the model selection criteria which are Akaike information criterion and Bayesian information criterion. For our study, we use the simulation extrapolation algorithm for measurement error model and the jackknife method for the estimation of standard errors.

Is it Possible to Predict the ADI of Pesticides using the QSAR Approach?

  • Kim, Jae Hyoun
    • Journal of Environmental Health Sciences
    • /
    • v.38 no.6
    • /
    • pp.550-560
    • /
    • 2012
  • Objectives: QSAR methodology was applied to explain two different sets of acceptable daily intake (ADI) data of 74 pesticides proposed by both the USEPA and WHO in terms of setting guidelines for food and drinking water. Methods: A subset of calculated descriptors was selected from Dragon$^{(R)}$ software. QSARs were then developed utilizing a statistical technique, genetic algorithm-multiple linear regression (GA-MLR). The differences in each specific model in the prediction of the ADI of the pesticides were discussed. Results: The stepwise multiple linear regression analysis resulted in a statistically significant QSAR model with five descriptors. Resultant QSAR models were robust, showing good utility across multiple classes of pesticide compounds. The applicability domain was also defined. The proposed models were robust and satisfactory. Conclusions: The QSAR model could be a feasible and effective tool for predicting ADI and for the comparison of logADIEPA to logADIWHO. The statistical results agree with the fact that USEPA focuses on more subtle endpoints than does WHO.

An Experimental Study on Mathematical Model to Predict Bead Width in GMA Weldment (GMA 용접부의 비드폭 예측을 위한 수학적 모델에 관한 실험적 연구)

  • Kim, Ill Soo;Park, Min Ho;Kim, Hak Hyoung;Lee, Jong Pyo;Park, Cheol Kyun;Shim, Ji Yeon
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.32 no.2
    • /
    • pp.209-217
    • /
    • 2015
  • Generally welding is one of the most important processes to have a strong influence on the quality and productivity from a manufacture-based industry such as shipbuilding, automotive and machinery. The GMA(Gas Metal Arc) welding process involves large number of interdependent welding parameters which may affect product quality, productivity and cost effectiveness. To solve such problems, mathematical models are required to select the welding parameters for GMA welding process. In this study, the GMA welding process was studied using the information generated during the welding. The statistical analysis of a generalized regression approach was conducted by the following three methods: Firstly using the mathematical model (linear regression, 2nd regression); Secondly GA(Genetic Algorithm) with intelligent models; And finally using response surface analysis of models to develop the relationships between welding parameters and bead width as welding quality.

Prediction of California bearing ratio (CBR) for coarse- and fine-grained soils using the GMDH-model

  • Mintae Kim;Seyma Ordu;Ozkan Arslan;Junyoung Ko
    • Geomechanics and Engineering
    • /
    • v.33 no.2
    • /
    • pp.183-194
    • /
    • 2023
  • This study presents the prediction of the California bearing ratio (CBR) of coarse- and fine-grained soils using artificial intelligence technology. The group method of data handling (GMDH) algorithm, an artificial neural network-based model, was used in the prediction of the CBR values. In the design of the prediction models, various combinations of independent input variables for both coarse- and fine-grained soils have been used. The results obtained from the designed GMDH-type neural networks (GMDH-type NN) were compared with other regression models, such as linear, support vector, and multilayer perception regression methods. The performance of models was evaluated with a regression coefficient (R2), root-mean-square error (RMSE), and mean absolute error (MAE). The results showed that GMDH-type NN algorithm had higher performance than other regression methods in the prediction of CBR value for coarse- and fine-grained soils. The GMDH model had an R2 of 0.938, RMSE of 1.87, and MAE of 1.48 for the input variables {G, S, and MDD} in coarse-grained soils. For fine-grained soils, it had an R2 of 0.829, RMSE of 3.02, and MAE of 2.40, when using the input variables {LL, PI, MDD, and OMC}. The performance evaluations revealed that the GMDH-type NN models were effective in predicting CBR values of both coarse- and fine-grained soils.

Improvement of Recognition Performance for Limabeam Algorithm by using MLLR Adaptation

  • Nguyen, Dinh Cuong;Choi, Suk-Nam;Chung, Hyun-Yeol
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.8 no.4
    • /
    • pp.219-225
    • /
    • 2013
  • This paper presents a method using Maximum-Likelihood Linear Regression (MLLR) adaptation to improve recognition performance of Limabeam algorithm for speech recognition using microphone array. From our investigation on Limabeam algorithm, we can see that the performance of filtering optimization depends strongly on the supporting optimal state sequence and this sequence is created by using Viterbi algorithm trained with HMM model. So we propose an approach using MLLR adaptation for the recognition of speech uttered in a new environment to obtain better optimal state sequence that support for the filtering parameters' optimal step. Experimental results show that the system embedded with MLLR adaptation presents the word correct recognition rate 2% higher than that of original calibrate Limabeam and also present 7% higher than that of Delay and Sum algorithm. The best recognition accuracy of 89.4% is obtained when we use 4 microphones with 5 utterances for adaptation.

Fuzzy Polynomial Neural Networks based on GMDH algorithm and Polynomial Fuzzy Inference (GMDH 알고리즘과 다항식 퍼지추론에 기초한 퍼지 다항식 뉴럴 네트워크)

  • 박호성;윤기찬;오성권
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2000.05a
    • /
    • pp.130-133
    • /
    • 2000
  • In this paper, a new design methodology named FNNN(Fuzzy Polynomial Neural Network) algorithm is proposed to identify the structure and parameters of fuzzy model using PNN(Polynomial Neural Network) structure and a fuzzy inference method. The PNN is the extended structure of the GMDH(Group Method of Data Handling), and uses several types of polynomials such as linear, quadratic and modified quadratic besides the biquadratic polynomial used in the GMDH. The premise of fuzzy inference rules defines by triangular and gaussian type membership function. The fuzzy inference method uses simplified and regression polynomial inference method which is based on the consequence of fuzzy rule expressed with a polynomial such as linear, quadratic and modified quadratic equation are used. Each node of the FPNN is defined as fuzzy rules and its structure is a kind of neuro-fuzzy architecture Several numerical example are used to evaluate the performance of out proposed model. Also we used the training data and testing data set to obtain a balance between the approximation and generalization of proposed model.

  • PDF