• 제목/요약/키워드: Multivariate Statistical Analysis

검색결과 632건 처리시간 0.025초

다변수통계방법을 이용한 산지분류에 관한 연구 (A Study on Forest Land Classification Using Multivariate Statistical Methods : A Case Study at Mt. Kwanak)

  • 정순오
    • 한국조경학회지
    • /
    • 제13권1호
    • /
    • pp.43-66
    • /
    • 1985
  • Korea needs proper and rational public policies on conservation and use of forest land and other natural resources because of the accelerating expansion of national land developments in recent years. Unfortunately, there is no systematic planning system to support the needs. Generally, forest land use planning needs suitability analysis based on efficient land classification system. The goal of this study was to classify a forest land using multivariate satistical methods. A case study was carried out in winter of 1983 on a mountainous area higher than 100m above sea level located at Mt. Kwanak in Anyang -city, Kyung-gi-do (province). The study area was 19.80 km$^2$wide and was divided into 1, 383 Operational Taxonomic Units (OTU's) by a 120m$\times$120m grid. Fourteen descriptors were identified and quantified for each OTU from existing national land data : elevation, slope, aspect, terrain form, geologic material, surface soil permeability, topsoil type, depth of the solum, soil acidity, forest cover type, stand size class, stand age class, stand density class, and simple forest soil capability class. For this study, a FORTRAN IV program was written for input and output map data, and the computer statistics packages, SPSS and BMD, were used to perform the multivariate statistical analysis. Fourteen variables were analyzed to investigate the characteristics of their fire quench distribution and to estimate the correlation coefficients among them. Principal component analysis was executed to find the dimensions of forest land characteristics, and factor scores were used for proper samples of OTU throughout the study area. In order to develop the classes of forest land classification based on 102 surrogates, cluster and discriminant analyses of principal descriptor variable matrix were undertaken. Results obtained through a series of multivariate statistical analyses were as follows ; 1) Principal component analysis was proved to be a useful tool for data selection and identification of principal descriptor variables which represented the characteristics of forest land and facilitated the selection of samples.

  • PDF

다변량 통계 기법을 이용한 물리검층 자료로부터의 암석물리학상 결정 (Automatic Electrofacies Classification from Well Logs Using Multivariate Statistical Techniques)

  • 임종세;김정환;강주명
    • 지구물리와물리탐사
    • /
    • 제1권3호
    • /
    • pp.170-175
    • /
    • 1998
  • 이 연구는 다변량 통계 기법을 이용한 물리검층 자료로부터의 암석물리학상 결정으로 암상을 예측하는 것이다. 기술 통계 분석으로 물리검층 자료의 특성을 파악하고 주성분 분석에 의한 다변량 검층 자료들의 상관도 분석을 통해 변수들을 변환시켜 새로운 변수인 주성분을 구하고 변수들의 차원을 축소한다. 통계적 방법에 의한 주성분 검층 자료의 구획에 의한 효율적 자료 축소와 계산의 효율성을 높여 양질의 해석결과를 얻을 수 있다. 구획된 주성분 검층 자료로부터 계보적 군집 분석에 의해 암석물리학상을 결정한다. 최적 암석물리학상의 수는 전체 변동과 군집내의 변동사이의 비와 코어자료 등에 의해 비교 결정된다. 이 연구에서 개발된 암석물리학상 결정법을 국내대륙붕 물리검층자료에 적용한 결과 결정된 암석물리학상은 시추 코어 및 시추 암편 분석에 의한 암상 구분화와 잘 일치하였다. 이러한 연구는 저류층 특성인자의 신뢰성 있고 정량적인 평가로 유전 개발 및 생산 계획 시 유용한 도구로 활용될 수 있을 것이다.

  • PDF

Diagnosis of Observations after Fit of Multivariate Skew t-Distribution: Identification of Outliers and Edge Observations from Asymmetric Data

  • Kim, Seung-Gu
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.1019-1026
    • /
    • 2012
  • This paper presents a method for the identification of "edge observations" located on a boundary area constructed by a truncation variable as well as for the identification of outliers and the after fit of multivariate skew $t$-distribution(MST) to asymmetric data. The detection of edge observation is important in data analysis because it provides information on a certain critical area in observation space. The proposed method is applied to an Australian Institute of Sport(AIS) dataset that is well known for asymmetry in data space.

다변량 통계기법을 활용한 실시간 수질이상 유무 판단 시스템 개발 (Development of Real-Time Water Quality Abnormality Warning System for Using Multivariate Statistical Method)

  • 허태영;전항배;박상민;이영주
    • 대한환경공학회지
    • /
    • 제37권3호
    • /
    • pp.137-144
    • /
    • 2015
  • 본 연구는 다변량 통계기법 중 하나인 주성분분석을 활용하여 실시간으로 수질이상 유무를 판단할 수 있는 경보시스템 개발을 목적으로 하였다. 본 연구에서는 다변량 분석 방법 중 수질항목 간의 상관성을 고려한 주성분 분석 방법을 실시간으로 수질이상 유무를 판단하는 알고리즘에 적용시켰다. K-water에서 제공하는 실제 자료를 이용하여 수질 이상에 대한 실시간 감시 알고리즘의 활용성을 검증하였으며, 집중호우 등과 같은 기후변화에 따른 수질이상에 대해서는 기상청 자료와의 비교를 통해 검증하였다.

다변량 확률분포함수의 추정을 위한 MKDE-ebd 개발 (Development of MKDE-ebd for Estimation of Multivariate Probabilistic Distribution Functions)

  • 강영진;노유정;임오강
    • 한국전산구조공학회논문집
    • /
    • 제32권1호
    • /
    • pp.55-63
    • /
    • 2019
  • 공학문제에서 많은 확률 변수들은 상관성을 가지고 있고, 입력변수의 상관성은 기계시스템의 통계적 성능 분석 결과에 큰 영향을 미친다. 하지만, 상관 변수들은 결합분포함수를 모델링하기 어렵다는 이유로 종종 독립변수로 취급되거나 특정한 모수적 모델로 표현되는 경우가 많으며, 특히 데이터가 적은 경우 결합분포함수를 정확히 모델링하는데 더 큰 어려움이 있다. 본 연구에서 개발된 경계데이터를 이용한 다변량 커널밀도추정은 비선형성을 갖는 다양한 형태의 다변량 확률 분포 추정을 위해 개발되었다. 다변량 커널밀도추정은 주어진 데이터와 균등분포함수의 파라미터의 신뢰구간으로부터 생성된 경계데이터를 결합하여 데이터의 질과 수에 덜 민감하다. 따라서 제안된 방법은 보수적인 통계모델링과 신뢰성 해석 결과를 도출할 수 있으며, 통계시뮬레이션과 공학예제를 통해 그 성능을 검증하였다.

Note on response dimension reduction for multivariate regression

  • Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • 제26권5호
    • /
    • pp.519-526
    • /
    • 2019
  • Response dimension reduction in a sufficient dimension reduction (SDR) context has been widely ignored until Yoo and Cook (Computational Statistics and Data Analysis, 53, 334-343, 2008) founded theories for it and developed an estimation approach. Recent research in SDR shows that a semi-parametric approach can outperform conventional non-parametric SDR methods. Yoo (Statistics: A Journal of Theoretical and Applied Statistics, 52, 409-425, 2018) developed a semi-parametric approach for response reduction in Yoo and Cook (2008) context, and Yoo (Journal of the Korean Statistical Society, 2019) completes the semi-parametric approach by proposing an unstructured method. This paper theoretically discusses and provides insightful remarks on three versions of semi-parametric approaches that can be useful for statistical practitioners. It is also possible to avoid numerical instability by presenting the results for an orthogonal transformation of the response variables.

다변량 통계분석법을 이용한 PET 중합공정 중 직접 에스테르화 반응기의 거동 및 생산제품 예측 (Multivariate Statistical Analysis Approach to Predict the Reactor Properties and the Product Quality of a Direct Esterification Reactor for PET Synthesis)

  • 김성영;정창복;최수형;이범석;이범석
    • 제어로봇시스템학회논문지
    • /
    • 제11권6호
    • /
    • pp.550-557
    • /
    • 2005
  • The multivariate statistical analysis methods, using both multiple linear regression(MLR) and partial least square(PLS), have been applied to predict the reactor properties and the product quality of a direct esterification reactor for polyethylene terephthalate(PET) synthesis. On the basis of the set of data including the flow rate of water vapor, the flow rate of EG vapor, the concentration of acid end groups of a product and other operating conditions such as temperature, pressure, reaction times and feed monomer mole ratio, two multi-variable analysis methods have been applied. Their regression and prediction abilities also have been compared. The prediction results are critically compared with the actual plant data and the other mathematical model based results in reliability. This paper shows that PLS method approach can be used for the reasonably accurate prediction of a product quality of a direct esterification reactor in PET synthesis process.

다변량 통계기법을 이용한 K및 n의 산정에 관한 연구 (A Study on the Estimation of Coefficients K and n Using Multivariate Data Analysis)

  • 백용진;최재성;배동명;김경진
    • 한국소음진동공학회논문집
    • /
    • 제13권8호
    • /
    • pp.583-590
    • /
    • 2003
  • For the preestimate of the vibration level of the ground next to a dwelling, a multivariate statistical analysis on the experiment data acquired from a variety of construction sites was performed, and then a new estimate model for the value of K and n that can be applied in the diagnosis of the damage was offered. The results maybe summarized as follows : First, the $K_{95}$ and n showed high correlation at P$\leq$0.05. Specially the correlation coefficient about $W_{max}$, S were higher in $K_{95}$ than in n. indicating that $K_{95}$ is generally associated with source conditions. Second, the factor analysis permitted to identify two major sources in each fraction. These sources accounted for at least 73 % of valiance of $K_{95}$. Third, the multiple regression model for the estimate of $K_{95}$ was developed from Fac1 which depend upon the source conditions and Fac2 which depend upon the transmission conditions. The n value is able to determine from the correlation relationship associated with $K_{95}$./.

A Comparison Study of Multivariate Binary and Continuous Outcomes

  • Pak, Dae-Woo;Cho, Hyung-Jun
    • 응용통계연구
    • /
    • 제25권4호
    • /
    • pp.605-612
    • /
    • 2012
  • Multivariate data are often generated with multiple outcomes in various fields. Multiple outcomes could be mixed as continuous and discrete. Because of their complexity, the data are often dealt with by separately applying regression analysis to each outcome even though they are associated the each other. This univariate approach results in the low efficiency of estimates for parameters. We study the efficiency gains of the multivariate approaches relative to the univariate approach with the mixed data that include continuous and binary outcomes. All approaches yield consistent estimates for parameters with complete data. By jointly estimating parameters using multivariate methods, it is generally possible to obtain more accurate estimates for parameters than by a univariate approach. The association between continuous and binary outcomes creates a gap in efficiency between multivariate and univariate approaches. We provide a guidance to analyze the mixed data.