• Title/Summary/Keyword: the multiple regression analysis

Search Result 9,700, Processing Time 0.04 seconds

Is it Possible to Predict the ADI of Pesticides using the QSAR Approach?

  • Kim, Jae Hyoun
    • Journal of Environmental Health Sciences
    • /
    • v.38 no.6
    • /
    • pp.550-560
    • /
    • 2012
  • Objectives: QSAR methodology was applied to explain two different sets of acceptable daily intake (ADI) data of 74 pesticides proposed by both the USEPA and WHO in terms of setting guidelines for food and drinking water. Methods: A subset of calculated descriptors was selected from Dragon$^{(R)}$ software. QSARs were then developed utilizing a statistical technique, genetic algorithm-multiple linear regression (GA-MLR). The differences in each specific model in the prediction of the ADI of the pesticides were discussed. Results: The stepwise multiple linear regression analysis resulted in a statistically significant QSAR model with five descriptors. Resultant QSAR models were robust, showing good utility across multiple classes of pesticide compounds. The applicability domain was also defined. The proposed models were robust and satisfactory. Conclusions: The QSAR model could be a feasible and effective tool for predicting ADI and for the comparison of logADIEPA to logADIWHO. The statistical results agree with the fact that USEPA focuses on more subtle endpoints than does WHO.

Bayesian test for the differences of survival functions in multiple groups

  • Kim, Gwangsu
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.2
    • /
    • pp.115-127
    • /
    • 2017
  • This paper proposes a Bayesian test for the equivalence of survival functions in multiple groups. Proposed Bayesian test use the model of Cox's regression with time-varying coefficients. B-spline expansions are used for the time-varying coefficients, and the proposed test use only the partial likelihood, which provides easier computations. Various simulations of the proposed test and typical tests such as log-rank and Fleming and Harrington tests were conducted. This result shows that the proposed test is consistent as data size increase. Specifically, the power of the proposed test is high despite the existence of crossing hazards. The proposed test is based on a Bayesian approach, which is more flexible when used in multiple tests. The proposed test can therefore perform various tests simultaneously. Real data analysis of Larynx Cancer Data was conducted to assess applicability.

A Study on increasing the fitness of forecasts using Dynamic Model (동적 모형에 의한 예측치의 정도 향상에 관한 연구)

  • 윤석환;윤상원;신용백
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.19 no.40
    • /
    • pp.1-14
    • /
    • 1996
  • We develop a dynamic demand forecasting model compared to regression analysis model and AutoRegressive Integrated Moving Average(ARIMA) model. The dynamic model can apply to the current dynamic data to forecasts through introducing state equation. A multiple regression model and ARIMA model using given data are designed via the model analysis. The forecasting fitness evaluation between the designed models and the dynamic model is compared with the criterion of sum of squared error.

  • PDF

A Study on the Participation toward Voluntary Activities for Elderly Women's (여성노인의 자원봉사참여 활성화에 관한 연구)

  • 심미영;정정숙;염동문
    • Korean Journal of Human Ecology
    • /
    • v.7 no.2
    • /
    • pp.115-126
    • /
    • 2004
  • The purpose of this study was to examine the influential variables of participation toward voluntary activities of elderly women's. For this purpose, the data collected 312 women's elderly in Jinju city. Statistics employed for the analysis were frequencies, logit regression and multiple regression analysis. The major results of this study were as follows; In participation intention toward voluntary activities for women's elderly, the variables which influence were job, state of health, level of life, need of voluntary activities, and satisfaction of voluntary activities in the past.

  • PDF

Mediation Effect of Cognitive Emotion-regulation Strategy in the Relationship between Family Stress and Marital Satisfaction -focused on the comparison of husbands and wives- (중년기 부부의 가족스트레스와 결혼만족도 관계에서 인지적 정서조절전략의 매개효과 -남편과 아내의 비교를 중심으로-)

  • Kim, Du-Gil;You, Young-Dal
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.11
    • /
    • pp.177-191
    • /
    • 2015
  • This study was designed to investigate the differences in the mediation effects of cognitive emotion-regulation in the relationships between family stress and marital satisfaction of middle-aged husbands and wives. The data were underwent through the SPSS 19. Win for frequency, paired-t-test, multiple regression analysis, and for the further analysis of hiearchical multiple regression and the analysis for the mediation effect proposed by Baron & Kenny(1996) and Sobel test(Sobel, 1982). The results were as follows. First, hiearchical multiple regression analysis showed that education level, couples' problems, replanning strategies were proved to be significant for the marital satisfaction of the husbands, while age, couples' problems, acceptance strategies were significant for the marital satisfaction of the wives. Second, cognitive emotion-regulation strategies were proved to partially mediate in the relationship between family stress and marital satisfaction. The limitations of the study and implications for the further research were discussed based on the results.

Comparison of Different Multiple Linear Regression Models for Real-time Flood Stage Forecasting (실시간 수위 예측을 위한 다중선형회귀 모형의 비교)

  • Choi, Seung Yong;Han, Kun Yeun;Kim, Byung Hyun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.32 no.1B
    • /
    • pp.9-20
    • /
    • 2012
  • Recently to overcome limitations of conceptual, hydrological and physics based models for flood stage forecasting, multiple linear regression model as one of data-driven models have been widely adopted for forecasting flood streamflow(stage). The objectives of this study are to compare performance of different multiple linear regression models according to regression coefficient estimation methods and determine most effective multiple linear regression flood stage forecasting models. To do this, the time scale was determined through the autocorrelation analysis of input data and different flood stage forecasting models developed using regression coefficient estimation methods such as LS(least square), WLS(weighted least square), SPW(stepwise) was applied to flood events in Jungrang stream. To evaluate performance of established models, fours statistical indices were used, namely; Root mean square error(RMSE), Nash Sutcliffe efficiency coefficient (NSEC), mean absolute error (MAE), adjusted coefficient of determination($R^{*2}$). The results show that the flood stage forecasting model using SPW(stepwise) parameter estimation can carry out the river flood stage prediction better in comparison with others, and the flood stage forecasting model using LS(least square) parameter estimation is also found to be slightly better than the flood stage forecasting model using WLS(weighted least square) parameter estimation.

Multiple Regression Analysis for Piercing Punch Profile Optimization to Prevent Tearing During Tee Pipe Burring (다중 회귀 분석을 활용한 Tee-Pipe 버링 공정에서 찢어짐 방지를 위한 피어싱 펀치 형상 최적 설계)

  • Lee, Y.S.;Kim, J.Y.;Kang, J.S.;Hong, S.
    • Transactions of Materials Processing
    • /
    • v.26 no.5
    • /
    • pp.271-276
    • /
    • 2017
  • A tee is the most common pipefitting used to combine or divide fluid flow. Tees can connect pipes of different diameters or change the direction of a pipe run. To manufacture tee type of stainless steel pipe, combinations of punch piercing and burr forming have been widely used in the industry. However, such method is considerably time consuming with regard to performing empirical work necessary to attain process conditions to prevent upper end tearing of the tee product and meet target tee height. Numerous experiments have shown that the piercing profile is the main cause of defects mentioned above. Furthermore, the mold design is formed through trial and error according to pipe diameters and changes in requirements. Thus, the objective of this study was to perform piercing and burring process analysis via finite element analysis using DYNAFORM to resolve problems mentioned above. An optimization design method was used to determine the piercing punch profile. Three radii of the piercing punch (i.e., large, small, and joined radii) were selected as design variables to minimize thinning of a tee pipe. Based on results of correlation and multiple regression analyses, we developed a predictive approximation model to satisfy requirements for both thickness reduction and target height. The new piercing punch profile was then applied to actual tee forming using the developed prediction equation. Model results were found to be in good agreement with experimental results.

Relationship between Plant Species Covers and Soil Chemical Properties in Poorly Controlled Waste Landfill Sites

  • Kim, Kee-Dae;Lee, Eun-Ju
    • Journal of Ecology and Environment
    • /
    • v.30 no.1
    • /
    • pp.39-47
    • /
    • 2007
  • The relationships between the cover of herbaceous species and 15 soil chemical properties (organic carbon contents, total N, available P, exchangeable K, Na, Ca and Mg, HCl-extractable Cd, Cr, Cu, Fe, Mn, Ni, Pb and Zn) in nine poorly controlled waste landfill sites in Korea were examined by correlation analysis and multiple regression equations. Species showed different patterns of correlation between their cover values and soil chemical properties. The cover of Ambrosia artemisiifolia var. elatior, Aster subulatus var. sandwicensis and Erechtites hieracifolia were negatively correlated with the contents of Fe, Mn and Ni within landfill soils. Total cover of all species in quadrats was positively correlated with the contents of Cd and negatively correlated with the contents of Mn and Fe from stepwise regression analysis with 15 soil properties. Canonical correspondence analysis demonstrated that the distribution of native and exotic plants on poorly controlled landfills was significantly influenced by the contents of Na and Ca in soils, respectively.

ARTIFICIAL NEURAL NETWORK FOR PREDICTION OF WATER QUALITY IN PIPELINE SYSTEMS

  • Kim, Ju-Hwan;Yoon, Jae-Heung
    • Water Engineering Research
    • /
    • v.4 no.2
    • /
    • pp.59-68
    • /
    • 2003
  • The applicabilities and validities of two methodologies fur the prediction of THM (trihalomethane) formation in a water pipeline system were proposed and discussed. One is the multiple regression technique and the other is an artificial neural network technique. There are many factors which influence water quality, especially THMs formations in water pipeline systems. In this study, the prediction models of THM formation in water pipeline systems are developed based on the independent variables proposed by American Water Works Association(AWWA). Multiple linear/nonlinear regression models are estimated and three layer feed-forward artificial neural networks have been used to predict the THM formation in a water pipeline system. Input parameters of the models consist of organic compounds measured in water pipeline systems such as TOC, DOC and UV254. Also, the reaction time to each measuring site along pipeline is used as input parameter calculated by a hydraulic analysis. Using these variables as model parameters, four models are developed. And the predicted results from the four developed models are compared statistically to the measured THMs data set. It is shown that the artificial neural network approaches are much superior to the conventional regression approaches and that the developed models by neural network can be used more efficiently and reproduce more accurately the THMs formation in water pipeline systems, than the conventional regression methods proposed by AWWA.

  • PDF

Applications of response dimension reduction in large p-small n problems

  • Minjee Kim;Jae Keun Yoo
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.2
    • /
    • pp.191-202
    • /
    • 2024
  • The goal of this paper is to show how multivariate regression analysis with high-dimensional responses is facilitated by the response dimension reduction. Multivariate regression, characterized by multi-dimensional response variables, is increasingly prevalent across diverse fields such as repeated measures, longitudinal studies, and functional data analysis. One of the key challenges in analyzing such data is managing the response dimensions, which can complicate the analysis due to an exponential increase in the number of parameters. Although response dimension reduction methods are developed, there is no practically useful illustration for various types of data such as so-called large p-small n data. This paper aims to fill this gap by showcasing how response dimension reduction can enhance the analysis of high-dimensional response data, thereby providing significant assistance to statistical practitioners and contributing to advancements in multiple scientific domains.