• 제목/요약/키워드: multivariate data analysis

검색결과 1,441건 처리시간 0.031초

FAULT DETECTION, MONITORING AND DIAGNOSIS OF SEQUENCING BATCH REACTOR FOR INTEGRATED WASTEWATER TREATMENT MANAGEMENT SYSTEM

  • Yoo, Chang-Kyoo;Vanrolleghem, Peter A.;Lee, In-Beum
    • Environmental Engineering Research
    • /
    • 제11권2호
    • /
    • pp.63-76
    • /
    • 2006
  • Multivariate analysis and batch monitoring on a pilot-scale sequencing batch reactor (SBR) are described for integrated wastewater treatment management system, where a batchwise multiway independent component analysis method (MICA) are used to extract meaningful hidden information from non-Gaussian wastewater treatment data. Three-way batch data of SBR are unfolded batch-wisely, and then a non-Gaussian multivariate monitoring method is used to capture the non-Gaussian characteristics of normal batches in biological wastewater treatment plant. It is successfully applied to an 80L SBR for biological wastewater treatment, which is characterized by a variety of error sources with non-Gaussian characteristics. The batchwise multivariate monitoring results of a pilot-scale SBR for integrated wastewater treatment management system showed more powerful monitoring performance on a WWTP application than the conventional method since it can extract non-Gaussian source signals which are independent and cross-correlation of variables.

월유량에 대한 일변량 및 다변량 AR모형의 비교 (A Comparison of Univariate and Multivariate AR Models for Monthly River Flow Series)

  • 이원환;심재현
    • 물과 미래
    • /
    • 제23권1호
    • /
    • pp.99-107
    • /
    • 1990
  • 수자원 개발계획 및 목공구조물의 합리적 설계를 위해서는 과거의 수문관측자료에 의거한 해석이 필요하며, 일반적인 수문현상은 무작위적인 인자가 포함되기 때문에 이를 고려한 통계적 기법, 즉 추계학적 해석기법이 필요하다고 하겠다. 본 연구에서는 남한강 상류의 동일유역 4개 지점(단양, 정선, 영월, 평창)의 월유량 자료를 일변량 AR(1), AR(2)모형과 다변량 AR(1), AR(2)모형에 적용하여 각 모형의 통계적 특성치를 분석하고, 월유량을 모의발생시켜, 일변량 모형과 다변량 모형을 비교하였다. 각각의 모형에 의한 모의발생 계열의 비교, 분석을 통하여 볼 때, 단일지점만을 고려하는 일변량 모형에 비해 지점간의 공선형성을 고려하는 다변량 모형이 동일유역의 월유량 해석에 있어서 더 적합함을 알 수 있었다.

  • PDF

식생이 무성한 지역에서의 Principal Component Analysis 에 의한 Landsat TM 자료의 광역지질도 작성 (Regional Geological Mapping by Principal Component Analysis of the Landsat TM Data in a Heavily Vegetated Area)

  • 朴鍾南;徐延熙
    • 대한원격탐사학회지
    • /
    • 제4권1호
    • /
    • pp.49-60
    • /
    • 1988
  • Principal Component Analysis (PCA) was applied for regional geological mapping to a multivariate data set of the Landsat TM data in the heavily vegetated and topographically rugged Chungju area. The multivariate data set selection was made by statistical analysis based on the magnitude of regression of squares in multiple regression, and it includes R1/2/R3/4, R2/3, R5/7/R4/3, R1/2, R3/4. R4/3. AND R4/5. As a result of application of PCA, some of later principal components (in this study PC 3 and PC 5) are geologically more significant than earlier major components, PC 1 and PC 2 herein. The earlier two major components which comprise 96% of the total information of the data set, mainly represent reflectance of vegetation and topographic effects, while though the rest represent 3% of the total information which statistically indicates the information unstable, geological significance of PC3 and PC5 in the study implies that application of the technique in more favorable areas should lead to much better results.

Assessment of Water Quality using Multivariate Statistical Techniques: A Case Study of the Nakdong River Basin, Korea

  • Park, Seongmook;Kazama, Futaba;Lee, Shunhwa
    • Environmental Engineering Research
    • /
    • 제19권3호
    • /
    • pp.197-203
    • /
    • 2014
  • This study estimated spatial and seasonal variation of water quality to understand characteristics of Nakdong river basin, Korea. All together 11 parameters (discharge, water temperature, dissolved oxygen, 5-day biochemical oxygen demand, chemical oxygen demand, pH, suspended solids, electrical conductivity, total nitrogen, total phosphorus, and total organic carbon) at 22 different sites for the period of 2003-2011 were analyzed using multivariate statistical techniques (cluster analysis, principal component analysis and factor analysis). Hierarchical cluster analysis grouped whole river basin into three zones, i.e., relatively less polluted (LP), medium polluted (MP) and highly polluted (HP) based on similarity of water quality characteristics. The results of factor analysis/principal component analysis explained up to 83.0%, 81.7% and 82.7% of total variance in water quality data of LP, MP, and HP zones, respectively. The rotated components of PCA obtained from factor analysis indicate that the parameters responsible for water quality variations were mainly related to discharge and total pollution loads (non-point pollution source) in LP, MP and HP areas; organic and nutrient pollution in LP and HP zones; and temperature, DO and TN in LP zone. This study demonstrates the usefulness of multivariate statistical techniques for analysis and interpretation of multi-parameter, multi-location and multi-year data sets.

Clustering Technique for Multivariate Data Analysis

  • Lee, Jin-Ki
    • 한국국방경영분석학회지
    • /
    • 제6권2호
    • /
    • pp.89-127
    • /
    • 1980
  • The multivariate analysis techniques of cluster analysis are examined in this article. The theory and applications of the techniques and computer software concerning these techniques are discussed and sample jobs are included. A hierarchical cluster analysis algorithm, available in the IMSL software package, is applied to a set of data extracted from a group of subjects for the purpose of partitioning a collection of 26 attributes of a weapon system into six clusters of superattributes. A nonhierarchical clustering procedure were applied to a collection of data of tanks considering of twenty-four observations of ten attributes of tanks. The cluster analysis shows that the tanks cluster somewhat naturally by nationality. The principal componant analysis and the discriminant analysis show that tank weight is the single most important discriminator among nationality although they are not shown in this article because of the space restriction. This is a part of thesis for master's degree in operations research.

  • PDF

A Resetting Scheme for Process Parameters using the Mahalanobis-Taguchi System

  • Park, Chang-Soon
    • 응용통계연구
    • /
    • 제25권4호
    • /
    • pp.589-603
    • /
    • 2012
  • Mahalanobis-Taguchi system(MTS) is a statistical tool for classifying the normal group and abnormal group in multivariate data structures. In addition to the classification itself, the MTS uses a method for selecting variables useful for the classification. This method can be used efficiently especially when the abnormal group data are scattered without a specific directionality. When the feedback adjustment procedure through the measurements of the process output for controlling process input variables is not practically possible, the reset procedure can be an alternative one. This article proposes a reset procedure using the MTS. Moreover, a method for identifying input variables to reset is also proposed by the use of the contribution. The identification of the root-cause parameters using the existing dimension-reduced contribution tends to be difficult due to the variety of correlation relationships of multivariate data structures. However, it became possible to provide an improved decision when used together with the location-centered contribution and the individual-parameter contribution.

다변량분석법을 이용한 충청북도 읍면단위 농촌계획 수립을 위한 지역유형구분 분석 (A Classification of Regional Pattern Analysis for the Planning in Chungbuk using Multivariate Analysis)

  • 윤성수;주호길
    • 농촌계획
    • /
    • 제11권2호
    • /
    • pp.35-41
    • /
    • 2005
  • It is necessary that the basic concept of rural planning update from economics based on the production and sale into experience of natural resources and traditional culture. For the purpose of set up development direction for rural district, it is requisite to the multivariate analysis. In this study, the methods of the classification of rural village with existing data are studied, the results looking for applying to the making of principal viewpoint of the development. The analysis methods of classification are used the PCA, CA and combination of these, and making the revised method for localization of the rural district. In this study, we implement classification of regional pattern analysis for the planning of rural district in Chungbuk province.

Artificial Neural Networks for Interest Rate Forecasting based on Structural Change : A Comparative Analysis of Data Mining Classifiers

  • Oh, Kyong-Joo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권3호
    • /
    • pp.641-651
    • /
    • 2003
  • This study suggests the hybrid models for interest rate forecasting using structural changes (or change points). The basic concept of this proposed model is to obtain significant intervals caused by change points, to identify them as the change-point groups, and to reflect them in interest rate forecasting. The model is composed of three phases. The first phase is to detect successive structural changes in the U. S. Treasury bill rate dataset. The second phase is to forecast the change-point groups with data mining classifiers. The final phase is to forecast interest rates with backpropagation neural networks (BPN). Based on this structure, we propose three hybrid models in terms of data mining classifier: (1) multivariate discriminant analysis (MDA)-supported model, (2) case-based reasoning (CBR)-supported model, and (3) BPN-supported model. Subsequently, we compare these models with a neural network model alone and, in addition, determine which of three classifiers (MDA, CBR and BPN) can perform better. For interest rate forecasting, this study then examines the prediction ability of hybrid models to reflect the structural change.

  • PDF

단변량 및 다변량 함수 데이터에 대한 분산분석의 활용 (Application of functional ANOVA and functional MANOVA)

  • 김미정
    • 응용통계연구
    • /
    • 제35권5호
    • /
    • pp.579-591
    • /
    • 2022
  • 함수 데이터는 다양한 분야에서 수집되고 있으며, 집단 간의 함수 데이터를 비교해야하는 경우가 종종 발생한다. 이럴 경우 점별 분산분석 방법을 이용하여 설명하기에는 무리가 있으며, 통합된 결과를 제시할 필요가 있다. 이에 대한 다양한 연구가 제안되었으며, 최근에 R 패키지 fdANOVA로 구현되었다. 이 논문에서 우선 분산분석 및 다변량 분산분석을 설명하고, 최근에 제안된 다양한 단변량 및 다변량 함수 데이터 분산분석을 설명하고자 한다. 또한 R 패키지 fdANOVA의 사용 방법을 설명하고, 이 패키지를 이용하여 서울과 부산 지역의 주별 기온을 단변량 함수 데이터 분산분석을 통해 비교하고, 손글씨 이미지를 다변량 함수 데이터로 변환하여 다변량 함수 데이터 분산분석을 이용하여 비교하고자 한다.

이변량 지역빈도해석을 이용한 우리나라 극한 강우 분석 (Bivariate regional frequency analysis of extreme rainfalls in Korea)

  • 신주영;정창삼;안현준;허준행
    • 한국수자원학회논문집
    • /
    • 제51권9호
    • /
    • pp.747-759
    • /
    • 2018
  • 다변량 빈도해석과 지역빈도해석의 장점을 동시에 가지는 다변량 지역빈도해석은 다양한 변수를 고려함으로써 수문 현상에 대하여 많은 정보를 얻을 수 있고 많은 가용 자료 수로 인하여 높은 정확도의 분석결과를 도출할 수 있다. 현재까지는 우리나라의 강우 자료를 이용하여 다변량 지역빈도해석이 시도된 적이 없어 국내의 강우 자료를 대상으로 다변량 지역빈도해석의 적용성을 검토할 필요가 있다. 본 연구에서는 다변량 지역빈도해석의 매개변수 추정, 최적 분포형 선정, 확률수문량 성장곡선 추정 등에 집중하여 이변량 수문자료인 연 최대 강우량-지속기간 자료에 대하여 이변량 지역빈도해석의 적용성을 평가하였다. 기상청 71개 지점에 대하여 분석을 실시하였다. 본 연구를 통해 적용된 지역강우자료의 최적 copula 모형으로는 Frank와 Gumbel copula 모형이 선택되었고 주변분포형에 대해서는 지역별로 Gumbel과 대수정규분포와 같은 다양한 분포형이 최적 분포형으로 선택되었다. 상대제곱근오차(relative root mean square error)를 기준으로 지역빈도해석이 지점빈도해석보다 안정적이고 정확한 확률수문량 곡선 추정을 하였다. 이변량 강우분석에서 지역빈도해석을 적용하면 안정적인 수공구조물 설계기준 제시와 강우-지속기간 관계를 모형화 할 수 있을 것으로 기대된다.