• 제목/요약/키워드: multivariate regression analysis

검색결과 1,078건 처리시간 0.029초

INFLUENCE ANALYSIS FOR A LINEAR HYPOTHESIS IN MULTIVARIATE REGRESSION MODEL

  • Kim, Myung-Geun
    • Journal of applied mathematics & informatics
    • /
    • 제13권1_2호
    • /
    • pp.479-485
    • /
    • 2003
  • The influence of observations on the Wilks' lambda test of a linear hypothesis in multivariate regression is investigated using the local influence method. The perturbation scheme of case-weights is considered. A numerical example is given to show the effectiveness of the local influence method in identifying the influential observations.

2000년 미국대선 플로리다주의 투표결과 분석 (Statistical Outliers in Florida Counties at the Presidential Election 2000)

  • 김현철
    • 응용통계연구
    • /
    • 제15권1호
    • /
    • pp.21-32
    • /
    • 2002
  • We searched out in the votes data of the State of Florida at presidential election 2000. We used a multivariate regression analysis. We got there were several outliers including Palm Beach County. It means that we should analyze the number of disqualified ballots which were double-punched as well as the votes, to insist the " Butterfly Ballot" made Palm Beach outlier.

A multivariate adaptive regression splines model for estimation of maximum wall deflections induced by braced excavation

  • Xiang, Yuzhou;Goh, Anthony Teck Chee;Zhang, Wengang;Zhang, Runhong
    • Geomechanics and Engineering
    • /
    • 제14권4호
    • /
    • pp.315-324
    • /
    • 2018
  • With rapid economic growth, numerous deep excavation projects for high-rise buildings and subway transportation networks have been constructed in the past two decades. Deep excavations particularly in thick deposits of soft clay may cause excessive ground movements and thus result in potential damage to adjacent buildings and supporting utilities. Extensive plane strain finite element analyses considering small strain effect have been carried out to examine the wall deflections for excavations in soft clay deposits supported by diaphragm walls and bracings. The excavation geometrical parameters, soil strength and stiffness properties, soil unit weight, the strut stiffness and wall stiffness were varied to study the wall deflection behaviour. Based on these results, a multivariate adaptive regression splines model was developed for estimating the maximum wall deflection. Parametric analyses were also performed to investigate the influence of the various design variables on wall deflections.

Fused inverse regression with multi-dimensional responses

  • Cho, Youyoung;Han, Hyoseon;Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • 제28권3호
    • /
    • pp.267-279
    • /
    • 2021
  • A regression with multi-dimensional responses is quite common nowadays in the so-called big data era. In such regression, to relieve the curse of dimension due to high-dimension of responses, the dimension reduction of predictors is essential in analysis. Sufficient dimension reduction provides effective tools for the reduction, but there are few sufficient dimension reduction methodologies for multivariate regression. To fill this gap, we newly propose two fused slice-based inverse regression methods. The proposed approaches are robust to the numbers of clusters or slices and improve the estimation results over existing methods by fusing many kernel matrices. Numerical studies are presented and are compared with existing methods. Real data analysis confirms practical usefulness of the proposed methods.

Analyzing Operation Deviation in the Deasphalting Process Using Multivariate Statistics Analysis Method

  • Park, Joo-Hwang;Kim, Jong-Soo;Kim, Tai-Suk
    • 한국멀티미디어학회논문지
    • /
    • 제17권7호
    • /
    • pp.858-865
    • /
    • 2014
  • In the case of system like MES, various sensors collect the data in real time and save it as a big data to monitor the process. However, if there is big data mining in distributed computing system, whole processing process can be improved. In this paper, system to analyze the cause of operation deviation was built using the big data which has been collected from deasphalting process at the two different plants. By applying multivariate statistical analysis to the big data which has been collected through MES(Manufacturing Execution System), main cause of operation deviation was analyzed. We present the example of analyzing the operation deviation of deasphalting process using the big data which collected from MES by using multivariate statistics analysis method. As a result of regression analysis of the forward stepwise method, regression equation has been found which can explain 52% increase of performance compare to existing model. Through this suggested method, the existing petrochemical process can be replaced which is manual analysis method and has the risk of being subjective according to the tester. The new method can provide the objective analysis method based on numbers and statistic.

다변수모델에 의한 압축지수 $C_c$ 및 압축비 $C_r$의 통계적 해석 (A Multivariate Regression Analysis for Compression Index and Compressibility Ratio)

  • 홍병만
    • 한국관개배수논문집
    • /
    • 제5권1호
    • /
    • pp.75-82
    • /
    • 1998
  • A multivariate regression analysis for compression index and com- pressibility ratio of clayey soils in regard to some soil indices, i.e natural water content, Atterberg limits, and in-situ void ratio, was presented to estimate the primary consolidation s

  • PDF

로지스틱 회귀분석을 통한 청년 우울감의 다변량 분석 및 영향 요인 연구 (Multivariate Analysis and Determinants of Youth Depression through Logistic Regression)

  • Seong Eum LEE
    • Journal of Korea Artificial Intelligence Association
    • /
    • 제1권2호
    • /
    • pp.7-13
    • /
    • 2023
  • In this paper, Depression is a mental disorder characterized by a lack of enthusiasm and feelings of sadness, which significantly impairs daily functioning. In 2018, there was an increase in book sales in the essay genre, particularly the popularity of "healing essays." This trend is seen as challenging the negative image and prejudices associated with depression. In 2021, a significant rise in the proportion of 20-year-old patients with depression is attributed to factors like job-related stress, interpersonal issues, and financial burdens. Additionally, there is a strong correlation between depression and suicidal thoughts, particularly among individuals who have experienced feelings of depression. Despite the increasing prevalence of depression among young adults, research in this area is lacking. To address this gap, statistical tools such as logistic regression and chi-squared tests are employed. The analysis reveals various independent variables associated with feelings of depression, shedding light on the relationships between these factors.

경영정보의 인과구조 구축을 위한 다변량통계기법 적용에 관한 연구 (A study on applying multivariate statistical method for making casual structure in management information)

  • 조성훈;김태성
    • 한국경영과학회:학술대회논문집
    • /
    • 한국경영과학회 1996년도 추계학술대회발표논문집; 고려대학교, 서울; 26 Oct. 1996
    • /
    • pp.117-120
    • /
    • 1996
  • The objective of this study is to suggest modified Covariance Structure Analysis that combine with existing Multivariate Statistical Method which is used Casual Analysis Method in Management Information. For this purpose, we'll consider special feature and limitation about Correlation Analysis, Regression Analysis, Path Analysis and connect Covariance Structure Analysis with Statistical Factor Analysis so that theoretical casual model compare with variables structure in collecting data. A example is also presented to show the practical applicability of this approach.

  • PDF

다변량 분위수 회귀나무 모형에 대한 연구 (Multivariate quantile regression tree)

  • 김재오;조형준;방성완
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권3호
    • /
    • pp.533-545
    • /
    • 2017
  • 분위수 회귀모형은 반응변수의 조건부 분포에 대하여 포괄적이고 유용한 통계적 정보를 제공한다. 그러나 많은 실제 자료는 설명변수와 반응변수가 비선형의 관계를 갖고 있어 전통적인 선형 분위수 회귀모형은 왜곡되고 잘못된 결과를 초래할 수 있다. 또한 자료의 복잡성이 증가하여 반응변수가 여러개인 다변량 자료의 분석에 대한 보다 정확한 예측과 더불어 풍부한 해석에 대한 요구가 증가하고 있다. 이러한 이유로 본 연구에서는 다변량 분위수 회귀나무 모형을 제안하였다. 본 연구에서는 기존의 다변량 회귀나무 모형의 분할변수 선택 알고리즘의 문제점을 지적하고 향상된 분할변수 선택 알고리즘을 제안하였다. 제안한 알고리즘은 합리적인 계산시간으로 적용 가능하며 분할변수 선택에서 편향 발생의 문제를 갖지 않는 동시에 기존 방법보다 더 정확하게 분할변수를 선택할 수 있있다. 본 연구에서는 모의실험과 실증 예제를 통해 제안한 방법의 우수한 성능과 유용성을 확인하였다.

다변량 분석법을 이용한 소양강댐 상류 유역의 하천 수질 평가 (Evaluation of Water Quality on the Upstreams of the Soyanggang Dam by using Multivariate Analysis)

  • 최한규;백효선;허준영
    • 산업기술연구
    • /
    • 제22권A호
    • /
    • pp.201-210
    • /
    • 2002
  • The object of this study is to evaluate the factors affecting the water quality and to propose the influence of dominant factor quantitatively. The correlation analysis was performed to know the correlationship among the water quality items As a result of partial correlation analysis, it was shown that the water quality items are affected by the rainfall item directly. The factor analysis was performed to grasp some number of factors on each point for deducing the items of similar variable characteristics. The four points were divided into different factor groups. It was grasped that $NH_3-N$ and $NO_3-N$ Items have different variable characteristics after comparing the items. The Multiple regression analysis can decrease the number of observation. In the deduced multiple regression formula, it was shown that the rate of T-N, $NH_3-N$ and $NO_3-N$ in the independent variable took about 60% among all the regression formulas.

  • PDF