• Title/Summary/Keyword: Regression Analysis Method

Search Result 4,600, Processing Time 0.028 seconds

Orographic Precipitation Analysis with Regional Frequency Analysis and Multiple Linear Regression (지역빈도해석 및 다중회귀분석을 이용한 산악형 강수해석)

  • Yun, Hye-Seon;Um, Myoung-Jin;Cho, Won-Cheol;Heo, Jun-Haeng
    • Journal of Korea Water Resources Association
    • /
    • v.42 no.6
    • /
    • pp.465-480
    • /
    • 2009
  • In this study, single and multiple linear regression model were used to derive the relationship between precipitation and altitude, latitude and longitude in Jejudo. The single linear regression analysis was focused on whether orographic effect was existed in Jejudo by annual average precipitation, and the multiple linear regression analysis on whether orographic effect was applied to each duration and return period of quantile from regional frequency analysis by index flood method. As results of the regression analysis, it shows the relationship between altitude and precipitation strongly form a linear relationship as the length of duration and return period increase. The multiple linear regression precipitation estimates(which used altitude, latitude, and longitude information) were found to be more reasonable than estimates obtained using altitude only or altitude-latitude and altitude-longitude. Especially, as results of spatial distribution analysis by kriging method using GIS, it also provides realistic estimates for precipitation that the precipitation was occurred the southeast region as real climate of Jejudo. However, the accuracy of regression model was decrease which derived a short duration of precipitation or estimated high region precipitation even had long duration. Consequently the other factor caused orographic effect would be needed to estimate precipitation to improve accuracy.

On Bootstrapping; Bartlett Adjusted Empirical Likelihood Ratio Statistic in Regression Analysis

  • Woochul Kim;Duk-Hyun Ko;Keewon Lee
    • Journal of the Korean Statistical Society
    • /
    • v.25 no.2
    • /
    • pp.205-216
    • /
    • 1996
  • The bootstrap calibration method for empirical likelihood is considered to make a confidence region for the regression coefficients. Asymptotic properties are studied regarding the coverage probability. Small sample simulation results reveal that the bootstrap calibration works quite well.

  • PDF

Mixed Effects Kernel Binomial Regression

  • Hwang, Chang-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.19 no.4
    • /
    • pp.1327-1334
    • /
    • 2008
  • Mixed effect binomial regression models are widely used for analysis of correlated count data in which the response is the result of a series of one of two possible disjoint outcomes. In this paper, we consider kernel extensions with nonparametric fixed effects and parametric random effects. The estimation is through the penalized likelihood method based on kernel trick, and our focus is on the efficient computation and the effective hyperparameter selection. For the selection of hyperparameters, cross-validation techniques are employed. Examples illustrating usage and features of the proposed method are provided.

  • PDF

Preventing the Musculoskeletal Disorders using Association Rule - Based on Result of Multiple Logistic Regression - (연관규칙을 이용한 근골격계 질환 예방 - 다변량 로지스틱 회귀분석의 결과를 기반으로 -)

  • Park, Seung-Hun;Lee, Seog-Hwan
    • Journal of the Korea Safety Management & Science
    • /
    • v.9 no.4
    • /
    • pp.29-38
    • /
    • 2007
  • We adapted association rules of data mining in order to investigate the relation among the factors of musculoskeletal disorders and proposed the method of preventing the musculoskeletal disorders associated with multiple logistic regression in previous study. This multiple logistic regression was difficult to establish the method of preventing musculoskeletal disorders in case factors can't be managed by worker himself, i.e., age, gender, marital status. In order to solve this problem, we devised association rules of factors of musculoskeletal disorders and proposed the interactive method of preventing the musculoskeletal disorders, by applying association rules with the result of multiple logistic regression in previous study. The result of correlation analysis showed that prevention method of one part also prevents musculoskeletal disorders of other parts of body.

A modified partial least squares regression for the analysis of gene expression data with survival information

  • Lee, So-Yoon;Huh, Myung-Hoe;Park, Mira
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1151-1160
    • /
    • 2014
  • In DNA microarray studies, the number of genes far exceeds the number of samples and the gene expression measures are highly correlated. Partial least squares regression (PLSR) is one of the popular methods for dimensional reduction and known to be useful for the classifications of microarray data by several studies. In this study, we suggest a modified version of the partial least squares regression to analyze gene expression data with survival information. The method is designed as a new gene selection method using PLSR with an iterative procedure of imputing censored survival time. Mean square error of prediction criterion is used to determine the dimension of the model. To visualize the data, plot for variables superimposed with samples are used. The method is applied to two microarray data sets, both containing survival time. The results show that the proposed method works well for interpreting gene expression microarray data.

Analyzing Operation Deviation in the Deasphalting Process Using Multivariate Statistics Analysis Method

  • Park, Joo-Hwang;Kim, Jong-Soo;Kim, Tai-Suk
    • Journal of Korea Multimedia Society
    • /
    • v.17 no.7
    • /
    • pp.858-865
    • /
    • 2014
  • In the case of system like MES, various sensors collect the data in real time and save it as a big data to monitor the process. However, if there is big data mining in distributed computing system, whole processing process can be improved. In this paper, system to analyze the cause of operation deviation was built using the big data which has been collected from deasphalting process at the two different plants. By applying multivariate statistical analysis to the big data which has been collected through MES(Manufacturing Execution System), main cause of operation deviation was analyzed. We present the example of analyzing the operation deviation of deasphalting process using the big data which collected from MES by using multivariate statistics analysis method. As a result of regression analysis of the forward stepwise method, regression equation has been found which can explain 52% increase of performance compare to existing model. Through this suggested method, the existing petrochemical process can be replaced which is manual analysis method and has the risk of being subjective according to the tester. The new method can provide the objective analysis method based on numbers and statistic.

Analysis of the Correlation and Regression Analysis Studies from the Korean Journal of Women Health Nursing over the Past Three Years (2007~2009) (최근 3년간(2007~2009년) 여성건강간호학회지의 상관분석과 회귀분석 통계활용 논문 분석)

  • Lee, Eun-Joo;Lee, Eun-Hee;Kim, Jeung-Im;Kang, Hee-Sun;Oh, Hyun-Ei;Jun, Eun-Mi;Cheon, Suk-Hee
    • Women's Health Nursing
    • /
    • v.17 no.2
    • /
    • pp.187-194
    • /
    • 2011
  • Purpose: This study investigated the statistical methods and the results had reported correlation/regression analysis in the studies of Korean Journal of Women Health Nursing (KJWHN). Methods: We reviewed 45 studies using correlation/regression analysis for the suitability of the statistical methods and the research purposes, the criteria for analysis of figures, tables and charts had published in the KJWHN from vol 13 (1) in 2007 to vol 15 (4) in 2009. Results: Forty three studies were fitted to their statistical methodology and their research purposes. Eleven studies considered the minimum sample size. Fourteen regression studies used multiple regression and 12 studies used forward method for variable entry. Only one study among the 17 regression studies accomplished scatter plots and residuals examination. Sixteen studies in correlation studies and six studies in regression studies showed some errors in either the title, variables, category of figures, tables and charts. In the regression study, all reported $R^2$ and ${\beta}$ values except one. Conclusion: It was found that there were still statistical errors or articulation errors in the statistical analysis. All reviewers need to be reviewed more closely for detecting errors not only during reviewing process of the manuscript but also periodic publication for the quality of this academic journal.

A Study on Improving the predict accuracy rate of Hybrid Model Technique Using Error Pattern Modeling : Using Logistic Regression and Discriminant Analysis

  • Cho, Yong-Jun;Hur, Joon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.2
    • /
    • pp.269-278
    • /
    • 2006
  • This paper presents the new hybrid data mining technique using error pattern, modeling of improving classification accuracy. The proposed method improves classification accuracy by combining two different supervised learning methods. The main algorithm generates error pattern modeling between the two supervised learning methods(ex: Neural Networks, Decision Tree, Logistic Regression and so on.) The Proposed modeling method has been applied to the simulation of 10,000 data sets generated by Normal and exponential random distribution. The simulation results show that the performance of proposed method is superior to the existing methods like Logistic regression and Discriminant analysis.

  • PDF

RS-based method for estimating statistical moments and its application to reliability analysis (반응표면을 활용한 통계적 모멘트 추정 방법과 신뢰도해석에 적용)

  • Huh, Jae-Sung;Kwak, Byung-Man
    • Proceedings of the KSME Conference
    • /
    • 2004.11a
    • /
    • pp.852-857
    • /
    • 2004
  • A new and efficient method for estimating the statistical moments of a system performance function has been developed. The method consists of two steps: (1) An approximate response surface is generated by a quadratic regression model, and (2) the statistical moments of the regression model are then calculated by experimental design techniques proposed by Seo and $Kwak^{(4)}$. In this approach, the size of experimental region affects the accuracy of the statistical moments. Therefore, the region size should be selected suitably. The D-optimal design and the central composite design are adopted over the selected experimental region for the regression model. Finally, the Pearson system is adopted to decide the distribution type of the system performance function and to analyze structural reliability.

  • PDF

Estimation of Users대 Benefit Value for Woobang Tower Land in Taegu Using Travel Cost Method (여행비용접근법을 통한 대구 우방타워랜드의 편익가치 측정)

  • 김수봉;심애경;권기찬
    • Journal of Environmental Science International
    • /
    • v.10 no.3
    • /
    • pp.173-178
    • /
    • 2001
  • The aim of this paper is to evaluate users benefit values of theme park using Travel Cost Method with special reference to Woobang Tower Land in Taegu for the estimation of economic values. This research is mainly based on questionnaire survey of 100 users of the theme park. Socio-economic factors such as income, year of education, annual income, age and money(travel cost) are analysed from 5 residential areas of the respondents. Multiple regression analysis was used for the evaluation of annual number of park visitings based on the analysis. The regression model shows NV = $\alpha$+$\beta_1$TC+$\beta_2$INC+$\beta_3$EDU+$\beta_4$AGE (NV : Annual Number of Visitings, TC : Travel Cost, INC : Annual Income, EDU : Years of Education, AGE : Age). Regarding to visitors demand curve based on the equation showed that annual economic values of Woobang Tower Land was estimated as 50billion Korean Won.

  • PDF