• Title/Summary/Keyword: Regression Analysis Method

Search Result 4,614, Processing Time 0.036 seconds

Prediction of Dietary Knowledge using Multiple Regression Analysis for Preventing Stomach Diseases (위장질환 예방을 위한 다중회귀분석을 이용한 식이지식 예측)

  • Choi, So-Young;Kim, Joo-Chang;Chung, Kyungyong
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.7
    • /
    • pp.1-6
    • /
    • 2019
  • Modern society is undergoing nutritional imbalance according to the diet as the number of one person increases. This is increasing the incidence of chronic diseases such as gastrointestinal diseases and digestive diseases. This study suggests the prediction of dietary knowledge using multiple regression analysis for preventing chronic stomach diseases. The proposed method manages user's stomach diseases and dietary nutrition through the prediction of nutrition knowledge. It collects user's PHR through smart device and integrates in the health platform. The integrated data analyzes the dietary and activity of the user through multiple regression analysis. It predicts the required nutrients and provides services to users through applications. Therefore, it suggests recommended dietary components and consumed calories, appropriate dietary components based on the user's basal metabolism, and gastrointestinal levels. With the personalized health management, modern people can manage gastrointestinal diseases through a balanced diet.

Development of a soil total carbon prediction model using a multiple regression analysis method

  • Jun-Hyuk, Yoo;Jwa-Kyoung, Sung;Deogratius, Luyima;Taek-Keun, Oh;Jaesung, Cho
    • Korean Journal of Agricultural Science
    • /
    • v.48 no.4
    • /
    • pp.891-897
    • /
    • 2021
  • There is a need for a technology that can quickly and accurately analyze soil carbon contents. Existing soil carbon analysis methods are cumbersome in terms of professional manpower requirements, time, and cost. It is against this background that the present study leverages the soil physical properties of color and water content levels to develop a model capable of predicting the carbon content of soil sample. To predict the total carbon content of soil, the RGB values, water content of the soil, and lux levels were analyzed and used as statistical data. However, when R, G, and B with high correlations were all included in a multiple regression analysis as independent variables, a high level of multicollinearity was noted and G was thus excluded from the model. The estimates showed that the estimation coefficients for all independent variables were statistically significant at a significance level of 1%. The elastic values of R and B for the soil carbon content, which are of major interest in this study, were -2.90 and 1.47, respectively, showing that a 1% increase in the R value was correlated with a 2.90% decrease in the carbon content, whereas a 1% increase in the B value tallied with a 1.47% increase in the carbon content. Coefficient of determination (R2), root mean square error (RMSE), and mean absolute percentage error (MAPE) methods were used for regression verification, and calibration samples showed higher accuracy than the validation samples in terms of R2 and MAPE.

A Study on the Simplified Model for the Weight Estimation of Floating Offshore Plant using the Statistical Method (통계적 방법을 이용한 부유식 해양 플랜트의 중량 추정용 간이 모델 연구)

  • Seo, Seong-Ho;Roh, Myung-Il;Ku, Nam-Kug;Shin, Hyun-Kyung
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.50 no.6
    • /
    • pp.373-382
    • /
    • 2013
  • The weight of floating offshore plant, such as an FPSO(Floating, Production, Storage, and Off-loading unit) and an offshore wind turbine, is important for estimating the amount of production material and for determining the production method. Furthermore, the weight is a factor which affects in the building cost and production time of the floating offshore plant. Although the importance of the weight has long been recognized, the weight has been roughly estimated by using the existing design and production data, and designer's experience. To solve this problem, a simplified model for the weight estimation of the floating offshore plant using the statistical method was proposed in this study. To do this, various data for estimating the weight of the floating offshore plant were collected through the literature survey, and then the correlation analysis and the multiple regression analysis were performed to generate the simplified model for the weight estimation. Finally, to examine the applicability of the developed model, it was applied to examples of the weight estimation of an FPSO topsides and an offshore wind turbine. As a result, it was shown that the developed model can be applied the weight estimation process of the floating offshore plant at the early design stage.

Fuzzy Nonlinear Regression Model (퍼지비선형회귀모형)

  • Hwang, Seung-Gook;Park, Young-Man;Seo, Yoo-Jin;Park, Kwang-Pak
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.8 no.6
    • /
    • pp.99-105
    • /
    • 1998
  • This paper is to propose the fuzzy regression model using genetic algorithm which is fuzzy nonlinear regression model. Genetic algorithm is used to classify the input data for better fuzzy regression analysis. From this partition. each data can be have the grade of membership function which is belonged to a divided data group. The data group, from optimal partition of the region of each variable, have different fuzzy parameters of fuzzy linear regression model one another. We compound the fuzzy output of each data group so as to obtain the final fuzzy number for a data. We show the efficiency of this method by means of demonstration of a case study.

  • PDF

ARTIFICIAL NEURAL NETWORK FOR PREDICTION OF WATER QUALITY IN PIPELINE SYSTEMS

  • Kim, Ju-Hwan;Yoon, Jae-Heung
    • Water Engineering Research
    • /
    • v.4 no.2
    • /
    • pp.59-68
    • /
    • 2003
  • The applicabilities and validities of two methodologies fur the prediction of THM (trihalomethane) formation in a water pipeline system were proposed and discussed. One is the multiple regression technique and the other is an artificial neural network technique. There are many factors which influence water quality, especially THMs formations in water pipeline systems. In this study, the prediction models of THM formation in water pipeline systems are developed based on the independent variables proposed by American Water Works Association(AWWA). Multiple linear/nonlinear regression models are estimated and three layer feed-forward artificial neural networks have been used to predict the THM formation in a water pipeline system. Input parameters of the models consist of organic compounds measured in water pipeline systems such as TOC, DOC and UV254. Also, the reaction time to each measuring site along pipeline is used as input parameter calculated by a hydraulic analysis. Using these variables as model parameters, four models are developed. And the predicted results from the four developed models are compared statistically to the measured THMs data set. It is shown that the artificial neural network approaches are much superior to the conventional regression approaches and that the developed models by neural network can be used more efficiently and reproduce more accurately the THMs formation in water pipeline systems, than the conventional regression methods proposed by AWWA.

  • PDF

Introduction to variational Bayes for high-dimensional linear and logistic regression models (고차원 선형 및 로지스틱 회귀모형에 대한 변분 베이즈 방법 소개)

  • Jang, Insong;Lee, Kyoungjae
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.3
    • /
    • pp.445-455
    • /
    • 2022
  • In this paper, we introduce existing Bayesian methods for high-dimensional sparse regression models and compare their performance in various simulation scenarios. Especially, we focus on the variational Bayes approach proposed by Ray and Szabó (2021), which enables scalable and accurate Bayesian inference. Based on simulated data sets from sparse high-dimensional linear regression models, we compare the variational Bayes approach with other Bayesian and frequentist methods. To check the practical performance of the variational Bayes in logistic regression models, a real data analysis is conducted using leukemia data set.

New fuzzy method in choosing Ground Motion Prediction Equation (GMPE) in probabilistic seismic hazard analysis

  • Mahmoudi, Mostafa;Shayanfar, MohsenAli;Barkhordari, Mohammad Ali;Jahani, Ehsan
    • Earthquakes and Structures
    • /
    • v.10 no.2
    • /
    • pp.389-408
    • /
    • 2016
  • Recently, seismic hazard analysis has become a very significant issue. New systems and available data have been also developed that could help scientists to explain the earthquakes phenomena and its physics. Scientists have begun to accept the role of uncertainty in earthquake issues and seismic hazard analysis. However, handling the existing uncertainty is still an important problem and lack of data causes difficulties in precisely quantifying uncertainty. Ground Motion Prediction Equation (GMPE) values are usually obtained in a statistical method: regression analysis. Each of these GMPEs uses the preliminary data of the selected earthquake. In this paper, a new fuzzy method was proposed to select suitable GMPE at every intensity (earthquake magnitude) and distance (site distance to fault) according to preliminary data aggregation in their area using ${\alpha}$ cut. The results showed that the use of this method as a GMPE could make a significant difference in probabilistic seismic hazard analysis (PSHA) results instead of selecting one equation or using logic tree. Also, a practical example of this new method was described in Iran as one of the world's earthquake-prone areas.

Penalized least distance estimator in the multivariate regression model (다변량 선형회귀모형의 벌점화 최소거리추정에 관한 연구)

  • Jungmin Shin;Jongkyeong Kang;Sungwan Bang
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.1
    • /
    • pp.1-12
    • /
    • 2024
  • In many real-world data, multiple response variables are often dependent on the same set of explanatory variables. In particular, if several response variables are correlated with each other, simultaneous estimation considering the correlation between response variables might be more effective way than individual analysis by each response variable. In this multivariate regression analysis, least distance estimator (LDE) can estimate the regression coefficients simultaneously to minimize the distance between each training data and the estimates in a multidimensional Euclidean space. It provides a robustness for the outliers as well. In this paper, we examine the least distance estimation method in multivariate linear regression analysis, and furthermore, we present the penalized least distance estimator (PLDE) for efficient variable selection. The LDE technique applied with the adaptive group LASSO penalty term (AGLDE) is proposed in this study which can reflect the correlation between response variables in the model and can efficiently select variables according to the importance of explanatory variables. The validity of the proposed method was confirmed through simulations and real data analysis.

Mathematical Model of the Edge Sealing Parameters for Vacuum Glazing Panel Using Multiple Regression Method (다중회귀분석법을 이용한 진공유리패널 모서리 접합부와 공정변수간의 수학적 모델 개발)

  • Kim, Young-Shin;Jeon, Euy-Sik
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.13 no.3
    • /
    • pp.961-966
    • /
    • 2012
  • The concern about vacuum glass is enhanced as society gets greener and becomes more concerned about energy savings due to the rising cost of oil. The glass edge sealing process needs the high reliability among the main process for the vacuum glass development in order to maintain between the two glass by the vacuum. In this paper, the process of the edge sealing was performed by using the hydrogen mixture gas which is the high density heat source unlike the traditional method glass edge sealing by using the frit as the soldering process. The ambient temperature in the electric furnace was set in the edge sealing to prevents the thermal impact and transformation of the glasses and the temperature distribution uniformity was measured. The parameter of the edge sealing was set through the basic test and the mathematical relation with the area of the glass edge parts according to the parameter was drawn using the multiple regression analysis method.

Pre-processing and Bias Correction for AMSU-A Radiance Data Based on Statistical Methods (통계적 방법에 근거한 AMSU-A 복사자료의 전처리 및 편향보정)

  • Lee, Sihye;Kim, Sangil;Chun, Hyoung-Wook;Kim, Ju-Hye;Kang, Jeon-Ho
    • Atmosphere
    • /
    • v.24 no.4
    • /
    • pp.491-502
    • /
    • 2014
  • As a part of the KIAPS (Korea Institute of Atmospheric Prediction Systems) Package for Observation Processing (KPOP), we have developed the modules for Advanced Microwave Sounding Unit-A (AMSU-A) pre-processing and its bias correction. The KPOP system calculates the airmass bias correction coefficients via the method of multiple linear regression in which the scan-corrected innovation and the thicknesses of 850~300, 200~50, 50~5, and 10~1 hPa are respectively used for dependent and independent variables. Among the four airmass predictors, the multicollinearity has been shown by the Variance Inflation Factor (VIF) that quantifies the severity of multicollinearity in a least square regression. To resolve the multicollinearity, we adopted simple linear regression and Principal Component Regression (PCR) to calculate the airmass bias correction coefficients and compared the results with those from the multiple linear regression. The analysis shows that the order of performances is multiple linear, principal component, and simple linear regressions. For bias correction for the AMSU-A channel 4 which is the most sensitive to the lower troposphere, the multiple linear regression with all four airmass predictors is superior to the simple linear regression with one airmass predictor of 850~300 hPa. The results of PCR with 95% accumulated variances accounted for eigenvalues showed the similar results of the multiple linear regression.