• Title/Summary/Keyword: Multiple-Linear-Regression

Search Result 1,745, Processing Time 0.025 seconds

Resampling-based Test of Hypothesis in L1-Regression

  • Kim, Bu-Yong
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.3
    • /
    • pp.643-655
    • /
    • 2004
  • L$_1$-estimator in the linear regression model is widely recognized to have superior robustness in the presence of vertical outliers. While the L$_1$-estimation procedures and algorithms have been developed quite well, less progress has been made with the hypothesis test in the multiple L$_1$-regression. This article suggests computer-intensive resampling approaches, jackknife and bootstrap methods, to estimating the variance of L$_1$-estimator and the scale parameter that are required to compute the test statistics. Monte Carlo simulation studies are performed to measure the power of tests in small samples. The simulation results indicate that bootstrap estimation method is the most powerful one when it is employed to the likelihood ratio test.

Auditory Perception Experiment on Attribute of Road Traffic Noise Causing Annoyance with Identical Linear Sound Pressure Level (동일한 선형 음압 레벨의 도로교통소음의 성가심 유발 인자에 관한 연구)

  • An, Jang-Ho;Schang, Seo-Il;Ko, J.H.;Chun, Hyung-Jun
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2006.11a
    • /
    • pp.641-648
    • /
    • 2006
  • This study investigates which sound quality indices except SPL raise annoyance response. For investigation, auditory perception experiments for road traffic noise with identical linear SPL were performed by Paired Comparison Method. The numerical results of a Paired Comparison experiment express relative preference about annoyance. So that these relative preference scores are to be correlated to sound quality indices, which are absolute, a transformation is required to go from the relative domain to an absolute and linear scale of preference. The results of the transformation will be the 'merit values,' which quantifies the annoyance(in this case) of the road traffic noise on a linear scale. Using multiple regression, a formula is established that can calculate predicted merit values. Furthermore, This investigation offers a method selecting sound samples that represent various sound quality indices values to use experiment.

  • PDF

Chewing difficulty and multiple chronic conditions in Korean elders: KNHANES IV (임상가를 위한 특집 3 - 한국 노인에서 저작불편감과 복합만성질 환의 연관성: 제4기 국민건강영양조사)

  • Han, Dong-Hun
    • The Journal of the Korean dental association
    • /
    • v.51 no.9
    • /
    • pp.511-517
    • /
    • 2013
  • To assess the association between oral health and general health, this study examined the relationship between chewing difficulty and twelve chronic health conditions such as hypertension, hyperlipidemia, diabetes, cerebro- and cardiovascular disease, musculoskeletal disease, respiratory disease, eye/nose/throat disease, stomach/intestinal ulcer, renal dysfunction, thyroid disease, depression, and cancer in Korea. The study population was 3,066 elders aged 65 years old and more from the fourth Korean National Health and Nutrition Examination Survey. Chewing difficulty was measured on a 5-point Likert scale. Chronic conditions were assessed by self-reported questionnaire. Confounders were age, gender, education, income, smoking, drinking, and obesity. Chi-square test, general linear model, and multiple logistic regression model were done with complex sampling design. Musculoskeletal disease (adjusted odds ratio=1.33), respiratory disease (adjusted odds ratio=1.52), and cancer (adjusted odds ratio=1.58) were independently associated with chewing difficulty. Multiple chronic conditions with more than 4 chronic disease showed significant association with chewing difficulty (adjusted odds ratio=1.37).

A New Calibration Method Based on the Recursive Linear Regression with Variables Selection

  • Park, Kwang-Su;Jun, Chi-Hyuck
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1241-1241
    • /
    • 2001
  • We propose a new calibration method, which uses the linearization method for spectral responses and the repetitive adoptions of the linearization weight matrices to construct a frature. Weight matrices are estimated through multiple linear regression (or principal component regression or partial least squares) with forward variable selection. The proposed method is applied to three data sets. The first is FTIR spectral data set for FeO content from sinter process and the second is NIR spectra from trans-alkylation process having two constituent variables. The third is NIR spectra of crude oil with three physical property variables. To see the calibration performance, we compare the new method with the PLS. It is found that the new method gives a little better performance than the PLS and the calibration result is stable in spite of the collinearity among each selected spectral responses. Furthermore, doing the repetitive adoptions of linearization matrices in the proposed methods, uninformative variables are disregarded. That is, the new methods include the effect of variables subset selection, simultaneously.

  • PDF

Prediction of the transfer length of prestressing strands with neural networks

  • Marti-Vargas, Jose R.;Ferri, Francesc J.;Yepes, Victor
    • Computers and Concrete
    • /
    • v.12 no.2
    • /
    • pp.187-209
    • /
    • 2013
  • This paper presents a study on the prediction of transfer length of 13 mm seven-wire prestressing steel strand in pretensioned prestressed concrete members with rectangular cross-section including several material properties and design and manufacture parameters. To this end, a carefully selected database consisting of 207 different cases coming from 18 different sources spanning a variety of practical transfer length prediction situations was compiled. 16 single input features and 5 combined input features are analyzed. A widely used feedforward neural regression model was considered as a representative of several machine learning methods that have already been used in the engineering field. Classical multiple linear regression was also considered in order to comparatively assess performance and robustness in this context. The results show that the implemented model has good prediction and generalization capacity when it is used on large input data sets of practical interest from the engineering point of view. In particular, a neural model is proposed -using only 4 hidden units and 10 input variables-which significantly reduces in 30% and 60% the errors in transfer length prediction when using standard linear regression or fixed formulas, respectively.

Concrete properties prediction based on database

  • Chen, Bin;Mao, Qian;Gao, Jingquan;Hu, Zhaoyuan
    • Computers and Concrete
    • /
    • v.16 no.3
    • /
    • pp.343-356
    • /
    • 2015
  • 1078 sets of mixtures in total that include fly ash, slag, and/or silica fume have been collected for prediction on concrete properties. A new database platform (Compos) has been developed, by which the stepwise multiple linear regression (SMLR) and BP artificial neural networks (BP ANNs) programs have been applied respectively to identify correlations between the concrete properties (strength, workability, and durability) and the dosage and/or quality of raw materials'. The results showed obvious nonlinear relations so that forecasting by using nonlinear method has clearly higher accuracy than using linear method. The forecasting accuracy rises along with the increasing of age and the prediction on cubic compressive strength have the best results, because the minimum average relative error (MARE) for 60-day cubic compressive strength was less than 8%. The precision for forecasting of concrete workability takes the second place in which the MARE is less than 15%. Forecasting on concrete durability has the lowest accuracy as its MARE has even reached 30%. These conclusions have been certified in a ready-mixed concrete plant that the synthesized MARE of 7-day/28-day strength and initial slump is less than 8%. The parameters of BP ANNs and its conformation have been discussed as well in this study.

Analysis of Regional Fertility Gap Factors Using Explainable Artificial Intelligence (설명 가능한 인공지능을 이용한 지역별 출산율 차이 요인 분석)

  • Dongwoo Lee;Mi Kyung Kim;Jungyoon Yoon;Dongwon Ryu;Jae Wook Song
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.47 no.1
    • /
    • pp.41-50
    • /
    • 2024
  • Korea is facing a significant problem with historically low fertility rates, which is becoming a major social issue affecting the economy, labor force, and national security. This study analyzes the factors contributing to the regional gap in fertility rates and derives policy implications. The government and local authorities are implementing a range of policies to address the issue of low fertility. To establish an effective strategy, it is essential to identify the primary factors that contribute to regional disparities. This study identifies these factors and explores policy implications through machine learning and explainable artificial intelligence. The study also examines the influence of media and public opinion on childbirth in Korea by incorporating news and online community sentiment, as well as sentiment fear indices, as independent variables. To establish the relationship between regional fertility rates and factors, the study employs four machine learning models: multiple linear regression, XGBoost, Random Forest, and Support Vector Regression. Support Vector Regression, XGBoost, and Random Forest significantly outperform linear regression, highlighting the importance of machine learning models in explaining non-linear relationships with numerous variables. A factor analysis using SHAP is then conducted. The unemployment rate, Regional Gross Domestic Product per Capita, Women's Participation in Economic Activities, Number of Crimes Committed, Average Age of First Marriage, and Private Education Expenses significantly impact regional fertility rates. However, the degree of impact of the factors affecting fertility may vary by region, suggesting the need for policies tailored to the characteristics of each region, not just an overall ranking of factors.

Effects of Environmental Correlates on Alcohol-related Problems among Colleges (대학교의 환경적 특성이 음주폐해에 미친 영향)

  • Kim, Kwang-Kee;Jang, Seung-Ock;JeKarl, Jung
    • Korean Journal of Health Education and Promotion
    • /
    • v.23 no.3
    • /
    • pp.65-83
    • /
    • 2006
  • Objectives: This is one of the first efforts to describe incidence of alcohol-related problems and to identify environmental correlates associated with them among colleges. Methods: Date were collected by a sample of 105 college administrators who are in charge of student affairs in colleges nationwide through self-administrated questionnaire. Both logistic and linear multiple regression analyses were employed to identify the correlates associated with alcohol-related problems. Results: Most of colleges(76.6%) under study reported to have at least one alcohol-related problem in previous years. Interpersonal violence was alcohol-related problem taken placed most frequently, followed by making noise episode, having property damaged and motor vehicle accidents. Logistic regression analysis identified factors associated with incidents of alcohol related problems. They included being private colleges, numbers of prevention activities, product promotion and marketing by alcohol industry and alcohol accessibility to drinking context. Multiple regression analyses showed that correlates associated with numbers of alcohol-related problems included being a private college, being located in rural area, having drinking density, product promotion and availability of alternative activities to drinking. Conclusions: Environmental correlates were associated with incidence of alcohol related problems in colleges nationwide. Policy implications were discussed.

Analysis of Accident Characteristics and Development of Accident Models in the Signalized Intersections of Cheongju and Cheongwon (지방부 신호교차로 사고특성분석 및 모형개발 (청주.청원을 중심으로))

  • Park, Byung-Ho;Yoo, Doo-Seon;Yang, Jeong-Mo;Lee, Young-Min
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.2
    • /
    • pp.35-46
    • /
    • 2008
  • The purposes of this study are to analyze the characteristics and to develop the models of traffic accidents. In pursuing the above, this study gives particular attentions to developing the models(multiple linear, poisson and negative binomial regression) using the data of Cheongju and Cheongwon signalized intersections. The main results analyzed are as follows. First, the accident characteristics of rural area were defined by factor. Second, 4 accident models which are all statistically significant were developed. Finally, such the variables as $X_2$ and $X_{11}$ were evaluated to be specific variables which reflect the characteristics of rural area.

The Estimation of Software Development Effort Using Multiple Regression Method (다중회귀 분석을 이용한 소프트웨어 개발노력추정)

  • Jung Hye-Jung;Yang Hae-Sool;Shin Seok-Kyoo;Lee Sang-Un
    • The KIPS Transactions:PartD
    • /
    • v.11D no.7 s.96
    • /
    • pp.1483-1490
    • /
    • 2004
  • To accomplish a project successfuly, we have to estimate develpment effort accurately. But, development effort is different to software size and operation environment. Usually, we made use of function point for estimating development effort. In this paper. we make use of 789 project data. It is related to development projects in 1990`s. We investigate the variable affecting development effort. Also, we exedcute multiple liner regression analysis for looking linear relation about variables. We find the regression equation for multistage by dividing PDR that influ-enced development effort step by step.