• Title/Summary/Keyword: Correlation regression analysis

Search Result 6,870, Processing Time 0.03 seconds

Canonical Correlation: Permutation Tests and Regression

  • Yoo, Jae-Keun;Kim, Hee-Youn;Um, Hye-Yeon
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.3
    • /
    • pp.471-478
    • /
    • 2012
  • In this paper, we present a permutation test to select the number of pairs of canonical variates in canonical correlation analysis. The existing chi-squared test is known to be limited to normality in use. We compare the existing test with the proposed permutation test and study their asymptotic behaviors through numerical studies. In addition, we connect canonical correlation analysis to regression and we we show that certain inferences in regression can be done through canonical correlation analysis. A regression analysis of real data through canonical correlation analysis is illustrated.

Analysis of the Correlation and Regression Analysis Studies from the Korean Journal of Women Health Nursing over the Past Three Years (2007~2009) (최근 3년간(2007~2009년) 여성건강간호학회지의 상관분석과 회귀분석 통계활용 논문 분석)

  • Lee, Eun-Joo;Lee, Eun-Hee;Kim, Jeung-Im;Kang, Hee-Sun;Oh, Hyun-Ei;Jun, Eun-Mi;Cheon, Suk-Hee
    • Women's Health Nursing
    • /
    • v.17 no.2
    • /
    • pp.187-194
    • /
    • 2011
  • Purpose: This study investigated the statistical methods and the results had reported correlation/regression analysis in the studies of Korean Journal of Women Health Nursing (KJWHN). Methods: We reviewed 45 studies using correlation/regression analysis for the suitability of the statistical methods and the research purposes, the criteria for analysis of figures, tables and charts had published in the KJWHN from vol 13 (1) in 2007 to vol 15 (4) in 2009. Results: Forty three studies were fitted to their statistical methodology and their research purposes. Eleven studies considered the minimum sample size. Fourteen regression studies used multiple regression and 12 studies used forward method for variable entry. Only one study among the 17 regression studies accomplished scatter plots and residuals examination. Sixteen studies in correlation studies and six studies in regression studies showed some errors in either the title, variables, category of figures, tables and charts. In the regression study, all reported $R^2$ and ${\beta}$ values except one. Conclusion: It was found that there were still statistical errors or articulation errors in the statistical analysis. All reviewers need to be reviewed more closely for detecting errors not only during reviewing process of the manuscript but also periodic publication for the quality of this academic journal.

Statistical Study on Correlation Between Design Variable and Shape Error in Flexible Stretch Forming (가변스트레치성형 설계변수와 성형오차의 상관관계에 대한 통계적 연구)

  • Seo, Y.H.;Heo, S.C.;Kang, B.S.;Kim, J.
    • Transactions of Materials Processing
    • /
    • v.20 no.2
    • /
    • pp.124-131
    • /
    • 2011
  • A flexible stretch forming process is useful for small quantity batch production because various shape changes of the flexible die can be achieved conveniently. In this study, the design variables, namely, the punch size, curvature radius and elastic pad thickness, were quantitatively evaluated to understand their influence on sheet formability using statistical methods such as the correlation and regression analyses. Forming simulations were designed and conducted by a three-way factorial design to obtain numerical values of a shape error. Linear relationships between the design variables and the shape error resulted from the Pearson correlation analysis. Subsequently, a regression analysis was also conducted between the design variables and the shape error. A regression equation was derived and used in the flexible die design stage to estimate the shape error.

Correlations and Regression Analysis Between Reservoir Water Quality Parameters (농업용저수지 수질인자간 상관성 및 획귀분석)

  • Choi, Eun-Hee;Park, Youmg-Suk
    • KCID journal
    • /
    • v.18 no.1
    • /
    • pp.25-32
    • /
    • 2011
  • In order to effectively manage the reservoir, reservoir water quality management should be based on physicochemical and configurational characteristics. In this research, correlation between factors affecting the reservoir water quality was examined. Chl-a and COD shows the highest positive correlation. Chl-a and T-P also has a high positive correlation, however Chl-a and T-N show lower correlation relatively. Even though T-N is an important factor for phytoplankton growth which increase Chl-a concentration, corelation of Ch1-a and T-N shows that enough nitrogen in the reservoir isn't no longer limiting factor. The age of reservoir can cause of increasing COD and SS. Embankment height and elevation of reservoirs shows strong negative correlation to water quality. That means reservoir which is higher embankment height and locate in higher elevations is less contaminated. Regression expression was derived with Chl-a and water quality parameters, and height of reservoir. Finally Chl-a was simulated using regression expression and it was a good approach to predict the Chl-a concentration.

  • PDF

Understanding of the Misuse Cases of Quantitative and Qualitative Regression Analysis (정량적, 정성적 회귀분석의 오적용과 이해)

  • Choe, Seong-Un
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2011.11a
    • /
    • pp.213-217
    • /
    • 2011
  • The research shows misuse cases of quantitative regression analysis used in QC circle activity and six sigma movement which presents guidelines of correct use for quality practitioners. Additionally, the qualitative regression analysis that responses nonconforming ratio of variable y, is reviewed based on misuse cases for proper use by practitioners in the field. In most cases, there are frequent errors that involve the correlation analysis or ANOVA, regardless of using quantitative regression analysis. In addition, qualitative regression analysis for the nonconforming ratio that has dependent variable of discrete and categorical data, is often applied with quantitative regression and result in ineffective quality improvement.

  • PDF

Analysis for Insulating Degradation Characteristics with Aging Time for Oil-filled Transformers and/or Correlation between using Linear Regression Method (유입식 변압기의 열화시간에 따른 절연 열화특성 및 선형회귀법을 이용한 상관관계 분석)

  • Lee, Seung-Min
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.59 no.4
    • /
    • pp.693-699
    • /
    • 2010
  • General transformer's life is known as paper insulation' life. If a transformer is degraded by these aging factors, it is known that electrical, mechanical and chemical characteristics for transformer's oil-paper are changed. When the kraft paper is aged, the cellulose polymer chains break down into shorter lengths. It causes decrease in both tensile strength and degree of polymerization of paper insulation. The paper breakdown is accompanied by an increase in the content of furanic compounds within the dielectric liquid. In this paper it is aimed at analysis on correlation between aging characteristics for insulating diagnosis of thermally aged paper. For investigating the accelerated aging process of oil-paper samples accelerating aging cell was manufactured for estimating variation of paper insulation during 500 hours at $140^{\circ}C$ temperature. To derive the results, it was performed analysis such as tensile strength(TS), depolymerization(DP), dielectric strength(DS), relative permittivity, water content(WC) and furan compound(FC) for aged paper. Also for analyzing correlation between insulating degradation characteristics, we used linear regression method. As as results of linear regression analysis, there was a close correlation between TS and DP. WC, FC. But dielectric strength was a weak correlation with aging time.

Forecasting Energy Consumption of Steel Industry Using Regression Model (회귀 모델을 활용한 철강 기업의 에너지 소비 예측)

  • Sung-Ho KANG;Hyun-Ki KIM
    • Journal of Korea Artificial Intelligence Association
    • /
    • v.1 no.2
    • /
    • pp.21-25
    • /
    • 2023
  • The purpose of this study was to compare the performance using multiple regression models to predict the energy consumption of steel industry. Specific independent variables were selected in consideration of correlation among various attributes such as CO2 concentration, NSM, Week Status, Day of week, and Load Type, and preprocessing was performed to solve the multicollinearity problem. In data preprocessing, we evaluated linear and nonlinear relationships between each attribute through correlation analysis. In particular, we decided to select variables with high correlation and include appropriate variables in the final model to prevent multicollinearity problems. Among the many regression models learned, Boosted Decision Tree Regression showed the best predictive performance. Ensemble learning in this model was able to effectively learn complex patterns while preventing overfitting by combining multiple decision trees. Consequently, these predictive models are expected to provide important information for improving energy efficiency and management decision-making at steel industry. In the future, we plan to improve the performance of the model by collecting more data and extending variables, and the application of the model considering interactions with external factors will also be considered.

Regression and Correlation Analysis via Dynamic Graphs

  • Kang, Hee Mo;Sim, Songyong
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.695-705
    • /
    • 2003
  • In this article, we propose a regression and correlation analysis via dynamic graphs and implement them in Java Web Start. For the polynomial relations between dependent and independent variables, dynamic graphics are implemented for both polynomial regression and spline estimates for an instant model selection. The results include basic statistics. They are available both as a web-based service and an application.

Correlation analysis of primal cuts weight, fat contents, and auction prices in Landrace × Yorkshire × Duroc pig carcasses by VCS2000

  • Youngho Lim;Yunhwan Park;Gwantae Kim;Jaeyoung Kim;Jongtae Seo;Jaesik Lee;Jungseok Choi
    • Journal of Animal Science and Technology
    • /
    • v.66 no.4
    • /
    • pp.834-845
    • /
    • 2024
  • Currently, in pork auctions in Korea, only carcass weight and backfat thickness provide information on meat quantity, while the production volume of primal cuts and fat contents remains largely unknown. This study aims to predict the production of primal cuts in pigs and investigate how these carcass traits affect pricing. Using the VCS2000, the production of shoulder blade, loin, belly, shoulder picnic, and ham was measured for gilts (17,257 pigs) and barrows (16,365 pigs) of LYD (Landrace × Yorkshire × Duroc) pigs. Single and multiple regression analysis were conducted to analyze the relationship between the primal cuts and carcass weight. The study also examined the correlation between each primal cut, backfat thickness (1st thoracic vertebra backfat thickness, grading backfat thickness, and Multi-brached muscle middle backfat thickness), pork belly fat percentage, total fat yield, and auction price. A multiple regression analysis was conducted between the carcass traits that showed a high correlation and the auction price. After conducting a single regression analysis on the primal cuts of gilt and barrow, all coefficients of determination (R2) were 0.77 or higher. In the multiple regression analysis, the R2 value was 0.98 or higher. The correlation coefficient between the carcass weights and the auction price exceeded 0.70, while the correlation coefficients between the primal cuts and the auction prices were above 0.65. In terms of fat content, the backfat thickness of gilt exhibited a correlation coefficient of 0.70, and all other items had a correlation coefficient of 0.47 or higher. The correlation coefficients between the Forequarter, Middle, and Hindquarter and the auction price were 0.62 or higher. The R2 values of the multiple regression analysis between carcass traits and auction price were 0.5 or higher for gilts and 0.4 or higher for barrows. The regression equations between carcass weight and primal cuts derived in this study exhibited high determination coefficients, suggesting that they could serve as reliable means to predict primal cut production from pig carcasses. Elucidating the correlation between primal cuts, fat contents and auction prices can provide economic indicators for pork and assist in guiding the direction of pig farming.

Correlation and Simple Linear Regression (상관성과 단순선형회귀분석)

  • Pak, Son-Il;Oh, Tae-Ho
    • Journal of Veterinary Clinics
    • /
    • v.27 no.4
    • /
    • pp.427-434
    • /
    • 2010
  • Correlation is a technique used to measure the strength or the degree of closeness of the linear association between two quantitative variables. Common misuses of this technique are highlighted. Linear regression is a technique used to identify a relationship between two continuous variables in mathematical equations, which could be used for comparison or estimation purposes. Specifically, regression analysis can provide answers for questions such as how much does one variable change for a given change in the other, how accurately can the value of one variable be predicted from the knowledge of the other. Regression does not give any indication of how good the association is while correlation provides a measure of how well a least-squares regression line fits the given set of data. The better the correlation, the closer the data points are to the regression line. In this tutorial article, the process of obtaining a linear regression relationship for a given set of bivariate data was described. The least square method to obtain the line which minimizes the total error between the data points and the regression line was employed and illustrated. The coefficient of determination, the ratio of the explained variation of the values of the independent variable to total variation, was described. Finally, the process of calculating confidence and prediction interval was reviewed and demonstrated.