• Title/Summary/Keyword: 다변량통계기법

Search Result 132, Processing Time 0.033 seconds

Design of a Wastewater Treatment Plant Upgrading to Advanced Nutrient Removal Treatment Using Modeling Methodology and Multivariate Statistical Analysis for Process Optimization (하수처리장의 고도처리 upgrading 설계와 공정 최적화를 위한 다변량 통계분석)

  • Kim, MinJeong;Kim, MinHan;Kim, YongSu;Yoo, ChangKyoo
    • Korean Chemical Engineering Research
    • /
    • v.48 no.5
    • /
    • pp.589-597
    • /
    • 2010
  • Strengthening the regulation standard of biological nutrient in wastewater treatment plant(WWTP), the necessity of repair of WWTP which is operated in conventional activated sludge process to advanced nutrient removal treatment is increased. However, in full-scale wastewater treatment system, it is not easy to fine the optimized operational condition of the advanced nutrient removal treatment through experiment due to the complex response of various influent conditions and operational conditions. Therefore, in this study, an upgrading design of conventional activated sludge process to advanced nutrient removal process using the modeling and simulation method based on activated sludge model(ASMs) is executed. And a design optimization of advanced treatment process using the response surface method(RSM) is carried out for statistical and systematic approach. In addition, for the operational optimization of full-scale WWTP, a correct analysis about kinetic variables of wastewater treatment is necessary. In this study, through partial least square(PLS) analysis which is one of the multivariable statistical analysis methods, a correlation between the kinetic variables of wastewater treatment system is comprehended, and the most effective variables to the advanced treatment operation result is deducted. Through this study, the methodology for upgrading design and operational optimization of advanced treatment process is provided, and an efficient repair of WWTP to advanced treatment can be expected reducing the design time and costs.

Assessment of Water Quality in the Miho Stream Using Multivariate Statistics (다변량 통계기법을 이용한 미호천 본류 수질특성 평가)

  • Yoon, Hyeyoung;Kim, Jeehyun;Chae, Minhee;Cho, Yoonhae;Cheon, Seuk
    • Journal of Environmental Impact Assessment
    • /
    • v.28 no.4
    • /
    • pp.373-386
    • /
    • 2019
  • In The study, is to investigate the spatial characteristics of the Miho stream, which is the main tributary of the Geum River system, and to identify the main factors influencing the water quality using water quality analysis and multivariate analysis. The survey subjects were selected as 7 main sites in the Miho stream water system, From 2012 to 2017, 16 items including weather temperature and weather data were used for multivariate analysis. As a result of the water quality analysis, the average concentration of BOD and COD for 6 years was 3grade (normal) compared with the water quality environmental standard (river) of conditions. The concentrations of nitrogen and phosphorus were highest at th upstream site, then decreased and then increased again by the hydrogeological and geomorphological effect. Cluster analysis of spatial and water quality characteristics, it was evaluated as three clusters and the pollution sources is the greatest impact. As a result of principal component analysis and factor analysis on each cluster and mainstream, three to four major components were extracted. Main stream and the Cluster 1, Cluster 3 first principal factor included nitrogen and seasonal factors,first factor of Cluster 2 included nitrogen and water temperature. Nitrogen is the principal factor which affects water quality in Miho stream.

Color Component Analysis For Image Retrieval (이미지 검색을 위한 색상 성분 분석)

  • Choi, Young-Kwan;Choi, Chul;Park, Jang-Chun
    • The KIPS Transactions:PartB
    • /
    • v.11B no.4
    • /
    • pp.403-410
    • /
    • 2004
  • Recently, studies of image analysis, as the preprocessing stage for medical image analysis or image retrieval, are actively carried out. This paper intends to propose a way of utilizing color components for image retrieval. For image retrieval, it is based on color components, and for analysis of color, CLCM (Color Level Co-occurrence Matrix) and statistical techniques are used. CLCM proposed in this paper is to project color components on 3D space through geometric rotate transform and then, to interpret distribution that is made from the spatial relationship. CLCM is 2D histogram that is made in color model, which is created through geometric rotate transform of a color model. In order to analyze it, a statistical technique is used. Like CLCM, GLCM (Gray Level Co-occurrence Matrix)[1] and Invariant Moment [2,3] use 2D distribution chart, which use basic statistical techniques in order to interpret 2D data. However, even though GLCM and Invariant Moment are optimized in each domain, it is impossible to perfectly interpret irregular data available on the spatial coordinates. That is, GLCM and Invariant Moment use only the basic statistical techniques so reliability of the extracted features is low. In order to interpret the spatial relationship and weight of data, this study has used Principal Component Analysis [4,5] that is used in multivariate statistics. In order to increase accuracy of data, it has proposed a way to project color components on 3D space, to rotate it and then, to extract features of data from all angles.

Characterization of Water Quality in Changnyeong-Haman Weir Section Using Statistical Analyses (통계분석을 이용한 낙동강 창녕함안보 구간의 수질특성 연구)

  • Gwak, Bo-ra;Kim, Il-kyu
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.38 no.2
    • /
    • pp.71-78
    • /
    • 2016
  • The study of water environment system in Changnyeong-Haman weir section using a statistical analysis has been conducted. Statistical analyses used in this study were the correlation analysis, the principal components, and the factor analysis. The purpose of the study is to establish better understanding of relationships between water quality factors in the Changnyeong-Haman weir section which can provide useful information to manage Nakdong river. According to correlation analyses on COD and TOC, it revealed that the value of correlation coefficient was 0.844. Furthermore, the results from the principal component analysis categorized the water quality factors into three factor groups, the first principal factor group included COD, TOC, BOD, pH, water temperature (WT). And, it was observed that the concentration of cyanobacteria in the water body decreased, while the concentrations of the diatoms and the green algae increased after the events of rainfall.

Analysis of Regional Environment in the Nak-Dong River Watershed using Geographic Information System (지리정보시스템을 이용한 낙동강 유역권의 광역환경분석)

  • Jung, Sung-Kwan;Park, Kyung-Hun
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.3 no.1
    • /
    • pp.12-22
    • /
    • 2000
  • Recently urbanization and industrialization around the Nak-dong river watershed have lead to the regional environmental problems. In this viewpoint, we took up variables which were related to watershed environment, and found out spatial and environmental properties of the Nak-dong river using factor analysis, ANOVA test and geographic information system. The results may be summarized as follows; three common factors which were named as urban, agricultural and industrial pollutant factor extracted from statistical methods. Spatial distribution of watershed environment could be found by connection attributes of factor scores derived from factor analysis to digital map using GIS. According to the results, distribution of pollutant sources were concentrated in the main stream of the Nak-dong river and its tributaries, Kum-ho river. So it is necessary to manage the watershed environment in due consideration of environmental carrying capacity.

  • PDF

Early Vegetation Succession in Abandoned Field in Cheju Island (제주도 저지대 묵밭 식물군락의 2차 천이)

  • 유영한;이창석
    • The Korean Journal of Ecology
    • /
    • v.26 no.4
    • /
    • pp.209-214
    • /
    • 2003
  • In order to clarify the successional trend and its characteristics of plant community in abandoned field in Cheju Island, we investigated the seral changes of vegetation height, coverage, growth form, species richness, diversity and dominance index along with the time lapse, and then compared with those of mainland researches. And in order to seek the overall trend of vegetation changes, we used cluster, TWINSPAN and PCA ordination techniques. The succession was characterized by the lower species number, lower vegetation height, longer period of herb dominant and later invasion of tree species. These results may come from that ecological traits of Cheju Island itself, and ecosystem fragmented influences that block a seed (propagule) transport. Sere of the dominant species was shown as follows: Mazus japonicus(0.5∼1 years)→Conyza sumatrensis (2 years)→Artemisia princeps var. orientalis(4 years)→Artemisia princeps var. orientalis, indigofera pseudotinctoria(5 years)→Rosa multiflora, Miscanthus sinensis, etc.(8 years)→Miscanthus sinensis(12 years)→Boehmeria nivea, Pueraria thunbergiana etc.(15 years)→Mallotus japonicus(20 years)→Litsea japonica, Machilus thunbergii (20 years<). Abandoned fields were classified into three groups according to time lapse; earlier stage(0∼1 years), middle stage(2∼8 years) and later stage(8∼20 years).

Establishment of discrimination system using multivariate analysis of FT-IR spectroscopy data from different species of artichoke (Cynara cardunculus var. scolymus L.) (FT-IR 스펙트럼 데이터 기반 다변량통계분석기법을 이용한 아티초크의 대사체 수준 품종 분류)

  • Kim, Chun Hwan;Seong, Ki-Cheol;Jung, Young Bin;Lim, Chan Kyu;Moon, Doo Gyung;Song, Seung Yeob
    • Horticultural Science & Technology
    • /
    • v.34 no.2
    • /
    • pp.324-330
    • /
    • 2016
  • To determine whether FT-IR spectral analysis based on multivariate analysis for whole cell extracts can be used to discriminate between artichoke (Cynara cardunculus var. scolymus L.) plants at the metabolic level, leaves of ten artichoke plants were subjected to Fourier transform infrared(FT-IR) spectroscopy. FT-IR spectral data from leaves were analyzed by principal component analysis (PCA), partial least square discriminant analysis (PLS-DA) and hierarchical clustering analysis (HCA). FT-IR spectra confirmed typical spectral differences between the frequency regions of 1,700-1,500, 1,500-1,300 and $1,100-950cm^{-1}$, respectively. These spectral regions reflect the quantitative and qualitative variations of amide I, II from amino acids and proteins ($1,700-1,500cm^{-1}$), phosphodiester groups from nucleic acid and phospholipid ($1,500-1,300cm^{-1}$) and carbohydrate compounds ($1,100-950cm^{-1}$). PCA revealed separate clusters that corresponded to their species relationship. Thus, PCA could be used to distinguish between artichoke species with different metabolite contents. PLS-DA showed similar species classification of artichoke. Furthermore these metabolic discrimination systems could be used for the rapid selection and classification of useful artichoke cultivars.

Variable Selection for Multi-Purpose Multivariate Data Analysis (다목적 다변량 자료분석을 위한 변수선택)

  • Huh, Myung-Hoe;Lim, Yong-Bin;Lee, Yong-Goo
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.1
    • /
    • pp.141-149
    • /
    • 2008
  • Recently we frequently analyze multivariate data with quite large number of variables. In such data sets, virtually duplicated variables may exist simultaneously even though they are conceptually distinguishable. Duplicate variables may cause problems such as the distortion of principal axes in principal component analysis and factor analysis and the distortion of the distances between observations, i.e. the input for cluster analysis. Also in supervised learning or regression analysis, duplicated explanatory variables often cause the instability of fitted models. Since real data analyses are aimed often at multiple purposes, it is necessary to reduce the number of variables to a parsimonious level. The aim of this paper is to propose a practical algorithm for selection of a subset of variables from a given set of p input variables, by the criterion of minimum trace of partial variances of unselected variables unexplained by selected variables. The usefulness of proposed method is demonstrated in visualizing the relationship between selected and unselected variables, in building a predictive model with very large number of independent variables, and in reducing the number of variables and purging/merging categories in categorical data.

A Comparison of Cluster Analyses and Clustering of Sensory Data on Hanwoo Bulls (군집분석 비교 및 한우 관능평가데이터 군집화)

  • Kim, Jae-Hee;Ko, Yoon-Sil
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.4
    • /
    • pp.745-758
    • /
    • 2009
  • Cluster analysis is the automated search for groups of related observations in a data set. To group the observations into clusters many techniques has been proposed, and a variety measures aimed at validating the results of a cluster analysis have been suggested. In this paper, we compare complete linkage, Ward's method, K-means and model-based clustering and compute validity measures such as connectivity, Dunn Index and silhouette with simulated data from multivariate distributions. We also select a clustering algorithm and determine the number of clusters of Korean consumers based on Korean consumers' palatability scores for Hanwoo bull in BBQ cooking method.

Data-driven modeling of the anaerobic wastewater treatment plant using robust adaptive dynamic PLS method

  • Lee Hae Woo;Lee Min Woo;Joung Jea Youl;Park Jong Moon
    • 한국생물공학회:학술대회논문집
    • /
    • 2004.07a
    • /
    • pp.47-84
    • /
    • 2004
  • Principal Component Analysis나 Partial Least Squares와 같은 다변량 통계 기법은 변수간의 correlation structure로부터 공정의 variance를 설명할 수 있는 latent variable를 얻고 이를 이용하여 공정을 효과적으로 modeling할 수 있는 방법으로 최근 들어 많은 관심을 얻고 있다. 하지만 PLS는 공정이 stationary state에 있다고 가정하기 때문에, 생물학적 공정의 non-stationary and time-varying behavior를 설명하기에 부적절하다. 본 논문에서는 PLS 알고리즘의 혐기성 폐수처리 공정에의 적용에 있어, 이와 같은 문제를 해결하기 위해서 adaptive PLS 알고리즘을 사용함으로써 변화하는 공정의 특성에 대응하여 모델을 update하는 방법을 이용하였다. 하지만 실시간 데이터로부터 adaptive PLS 방법을 적용하는 데에는 많은 어려움이 존재하며, 특히 outlier나 abnormal disturbance에 모델이 부적절하게 adaptation하는 문제가 발생할 수 있다. 따라서 이의 해결을 위해 adaptive PLS를 적용하는데 있어 robustness를 향상시키기 위해 monitoring index를 이용하여 abnormal data에 weight를 주고 안정적인 모델의 update가 가능하게 하는 방법을 제안하였으며, 이를 적용하여 성공적으로 혐기성 폐수처리 공정의 Output을 예측하고 효과적으로 공정을 모니터링할 수 있었다. 만들어진 PLS 모델은 산업폐수를 처리하기 위한 industrial plan에서 측정된 실제 데이터에 적용하여 그 효용성을 입증하였으며, 그 결과는 mechanistic model을 적용하기 힘든 실공정에 비교적 쉽게 implementation할 수 있는 장점이 있다.

  • PDF