• Title/Summary/Keyword: Root clustering

Search Result 37, Processing Time 0.018 seconds

Regional Frequency Analysis for Rainfall using L-Moment (L-모멘트법에 의한 강우의 지역빈도분석)

  • Koh, Deuk-Koo;Choo, Tai-Ho;Maeng, Seung-Jin;Trivedi, Chanda
    • The Journal of the Korea Contents Association
    • /
    • v.8 no.3
    • /
    • pp.252-263
    • /
    • 2008
  • This study was conducted to derive the optimal regionalization of the precipitation data which can be classified on the basis of climatologically and geographically homogeneous regions all over the regions except Cheju and Ulreung islands in Korea. A total of 65 rain gauges were used to regional analysis of precipitation. Annual maximum series for the consecutive durations of 1, 3, 6, 12, 24, 36, 48 and 72hr were used for various statistical analyses. K-means clustering mettled is used to identify homogeneous regions all over the regions. Five homogeneous regions for the precipitation were classified by the K-means clustering. Using the L-moment ratios and Kolmogorov-Smirnov test, the underlying regional probability distribution was identified to be the generalized extreme value (GEV) distribution among applied distributions. The regional and at-site parameters of the generalized extreme value distribution were estimated by the linear combination of the probability weighted moments, L-moment. The regional and at-site analysis for the design rainfall were tested by Monte Carlo simulation. Relative root-mean-square error (RRMSE), relative bias (RBIAS) and relative reduction (RR) in RRMSE were computed and compared with those resulting from at-site Monte Carlo simulation. All show that the regional analysis procedure can substantially reduce the RRMSE, RBIAS and RR in RRMSE in the prediction of design rainfall. Consequently, optimal design rainfalls following the regions and consecutive durations were derived by the regional frequency analysis.

Estimation of Drought Rainfall by Regional Frequency Analysis Using L and LH-Moments (II) - On the method of LH-moments - (L 및 LH-모멘트법과 지역빈도분석에 의한 가뭄우량의 추정 (II)- LH-모멘트법을 중심으로 -)

  • Lee, Soon-Hyuk;Yoon , Seong-Soo;Maeng , Sung-Jin;Ryoo , Kyong-Sik;Joo , Ho-Kil;Park , Jin-Seon
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.46 no.5
    • /
    • pp.27-39
    • /
    • 2004
  • In the first part of this study, five homogeneous regions in view of topographical and geographically homogeneous aspects except Jeju and Ulreung islands in Korea were accomplished by K-means clustering method. A total of 57 rain gauges were used for the regional frequency analysis with minimum rainfall series for the consecutive durations. Generalized Extreme Value distribution was confirmed as an optimal one among applied distributions. Drought rainfalls following the return periods were estimated by at-site and regional frequency analysis using L-moments method. It was confirmed that the design drought rainfalls estimated by the regional frequency analysis were shown to be more appropriate than those by the at-site frequency analysis. In the second part of this study, LH-moment ratio diagram and the Kolmogorov-Smirnov test on the Gumbel (GUM), Generalized Extreme Value (GEV), Generalized Logistic (GLO) and Generalized Pareto (GPA) distributions were accomplished to get optimal probability distribution. Design drought rainfalls were estimated by both at-site and regional frequency analysis using LH-moments and GEV distribution, which was confirmed as an optimal one among applied distributions. Design rainfalls were estimated by at-site and regional frequency analysis using LH-moments, the observed and simulated data resulted from Monte Carlotechniques. Design drought rainfalls derived by regional frequency analysis using L1, L2, L3 and L4-moments (LH-moments) method have shown higher reliability than those of at-site frequency analysis in view of RRMSE (Relative Root-Mean-Square Error), RBIAS (Relative Bias) and RR (Relative Reduction) for the estimated design drought rainfalls. Relative efficiency were calculated for the judgment of relative merits and demerits for the design drought rainfalls derived by regional frequency analysis using L-moments and L1, L2, L3 and L4-moments applied in the first report and second report of this study, respectively. Consequently, design drought rainfalls derived by regional frequency analysis using L-moments were shown as more reliable than those using LH-moments. Finally, design drought rainfalls for the classified five homogeneous regions following the various consecutive durations were derived by regional frequency analysis using L-moments, which was confirmed as a more reliable method through this study. Maps for the design drought rainfalls for the classified five homogeneous regions following the various consecutive durations were accomplished by the method of inverse distance weight and Arc-View, which is one of GIS techniques.

Estimation of Drought Rainfall by Regional Frequency Analysis using L and LH-Moments(I) - On the Method of L-Moments - (L 및 LH-모멘트법과 지역빈도분석에 의한 가뭄우량의 추정(I) - L-모멘트법을 중심으로 -)

  • 이순혁;윤성수;맹승진;류경식;주호길
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.45 no.5
    • /
    • pp.97-109
    • /
    • 2003
  • This study is mainly conducted to derive the design drought rainfall by the consecutive duration using probability weighted moments with rainfall in the regional drought frequency analysis. It is anticipated to suggest optimal design drought rainfall of hydraulic structures for the water requirement and drought frequency of occurrence for the safety of water utilization through this study. Preferentially, this study was conducted to derive the optimal regionalization of the precipitation data that can be classified by the climatologically and geographically homogeneous regions all over the regions except Cheju and Ulreung islands in Korea. Five homogeneous regions in view of topographical and climatological aspects were accomplished by K-means clustering method. Using the L-moment ratio diagram and Kolmogorov-Smirnov test, generalized extreme value distribution was confirmed as the best fitting one among applied distributions. At-site and regional parameters of the generalized extreme value distribution were estimated by the method of L-moments. Design drought rainfalls using L-moments following the consecutive duration were derived by the at-site and regional analysis using the observed and simulated data resulted from Monte Carlo techniques. Relative root-mean-square error (RRMSE), relative bias (RBIAS) and relative reduction (RR) in RRMSE for the design drought rainfall derived by at-site and regional analysis in the observed an simulated data were computed and compared. In has shown that the regional frequency analysis procedure can substantially more reduce the RRMSE. RBIAS and RR in RRMSE than those of at-site analysis in the prediction of design drought rainfall. Consequently, optimal design drought rainfalls following the regions and consecutive durations were derived by the regional frequency analysis.

Short-term Forecasting of Power Demand based on AREA (AREA 활용 전력수요 단기 예측)

  • Kwon, S.H.;Oh, H.S.
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.39 no.1
    • /
    • pp.25-30
    • /
    • 2016
  • It is critical to forecast the maximum daily and monthly demand for power with as little error as possible for our industry and national economy. In general, long-term forecasting of power demand has been studied from both the consumer's perspective and an econometrics model in the form of a generalized linear model with predictors. Time series techniques are used for short-term forecasting with no predictors as predictors must be predicted prior to forecasting response variables and containing estimation errors during this process is inevitable. In previous researches, seasonal exponential smoothing method, SARMA (Seasonal Auto Regressive Moving Average) with consideration to weekly pattern Neuron-Fuzzy model, SVR (Support Vector Regression) model with predictors explored through machine learning, and K-means clustering technique in the various approaches have been applied to short-term power supply forecasting. In this paper, SARMA and intervention model are fitted to forecast the maximum power load daily, weekly, and monthly by using the empirical data from 2011 through 2013. $ARMA(2,\;1,\;2)(1,\;1,\;1)_7$ and $ARMA(0,\;1,\;1)(1,\;1,\;0)_{12}$ are fitted respectively to the daily and monthly power demand, but the weekly power demand is not fitted by AREA because of unit root series. In our fitted intervention model, the factors of long holidays, summer and winter are significant in the form of indicator function. The SARMA with MAPE (Mean Absolute Percentage Error) of 2.45% and intervention model with MAPE of 2.44% are more efficient than the present seasonal exponential smoothing with MAPE of about 4%. Although the dynamic repression model with the predictors of humidity, temperature, and seasonal dummies was applied to foretaste the daily power demand, it lead to a high MAPE of 3.5% even though it has estimation error of predictors.

Development of ensemble machine learning model considering the characteristics of input variables and the interpretation of model performance using explainable artificial intelligence (수질자료의 특성을 고려한 앙상블 머신러닝 모형 구축 및 설명가능한 인공지능을 이용한 모형결과 해석에 대한 연구)

  • Park, Jungsu
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.36 no.4
    • /
    • pp.239-248
    • /
    • 2022
  • The prediction of algal bloom is an important field of study in algal bloom management, and chlorophyll-a concentration(Chl-a) is commonly used to represent the status of algal bloom. In, recent years advanced machine learning algorithms are increasingly used for the prediction of algal bloom. In this study, XGBoost(XGB), an ensemble machine learning algorithm, was used to develop a model to predict Chl-a in a reservoir. The daily observation of water quality data and climate data was used for the training and testing of the model. In the first step of the study, the input variables were clustered into two groups(low and high value groups) based on the observed value of water temperature(TEMP), total organic carbon concentration(TOC), total nitrogen concentration(TN) and total phosphorus concentration(TP). For each of the four water quality items, two XGB models were developed using only the data in each clustered group(Model 1). The results were compared to the prediction of an XGB model developed by using the entire data before clustering(Model 2). The model performance was evaluated using three indices including root mean squared error-observation standard deviation ratio(RSR). The model performance was improved using Model 1 for TEMP, TN, TP as the RSR of each model was 0.503, 0.477 and 0.493, respectively, while the RSR of Model 2 was 0.521. On the other hand, Model 2 shows better performance than Model 1 for TOC, where the RSR was 0.532. Explainable artificial intelligence(XAI) is an ongoing field of research in machine learning study. Shapley value analysis, a novel XAI algorithm, was also used for the quantitative interpretation of the XGB model performance developed in this study.

Development of a Simple and Reproducible Method for Removal of Contaminants from Ginseng Protein Samples Prior to Proteomics Analysis (활성탄을 이용한 불순물제거에 의한 효과적인 인삼 조직 단백질체 분석 방법 개선 연구)

  • Gupta, Ravi;Kim, So Wun;Min, Chul Woo;Sung, Gi-Ho;Agrawal, Ganesh Kumar;Rakwal, Randeep;Jo, Ick Hyun;Bang, Kyong Hwan;Kim, Young-Chang;Kim, Kee-Hong;Kim, Sun Tae
    • Journal of Life Science
    • /
    • v.25 no.7
    • /
    • pp.826-832
    • /
    • 2015
  • This study describes the effects of activated charcoal on the removal of salts, detergents, and pigments from protein extracts of ginseng leaves and roots. Incubation of protein extracts with 5% (w/v) activated charcoal (100-400 mesh) for 30 min at 4℃ almost removed the salts and detergents including NP-40 as can be observed on SDS-PAGE. In addition, analysis of chlorophyll content showed significant depletion of chlorophyll (~33%) after activated charcoal treatment, suggesting potential effect of activated charcoal on removal of pigments too along with the salts and detergents. 2-DE analysis of activated charcoal treated protein samples showed better resolution of proteins, further indicating the efficacy of activated charcoal in clearing of protein samples. In case of root proteins, although not major differences were observed on SDS-PAGE, 2-DE gels showed better resolution of spots after charcoal treatment. In addition, both Hierarchical clustering (HCL) and Principle component analysis (PCA) clearly separated acetone sample from rest of the samples. Phenol and AC-phenol samples almost overlapped each other suggesting no major differences between these samples. Overall, these results showed that activated charcoal can be used in a simple manner to remove the salts, detergents and pigments from the protein extracts of various plant tissues.

Price Volatility, Seasonality and Day-of-the Week Effect for Aquacultural Fishes in Korean Fishery Markets (수산물 시장에서의 양식 어류 가격변동성.계절성.요일효과에 관한 연구 - 노량진수산시장의 넙치와 조피볼락을 중심으로 -)

  • Ko, Bong-Hyun
    • The Journal of Fisheries Business Administration
    • /
    • v.40 no.2
    • /
    • pp.49-70
    • /
    • 2009
  • This study proviedes GARCH model(Bollerslev, 1986) to analyze the structural characteristics of price volatility in domestic aquacultural fish market of Korea. As a case study, flatfish and rock-fish are analyzed as major species with relatively high portion in an aspect of production volume among fish captured in Korea. For analyzing, this study uses daily market data (dating from Jan 1 2000 to June 30, 2008) published by the Noryangjin Fisheries Wholesale Market which is located in Seoul of Korea. This study performs normality test on trading volume and price volatility of flatfish and rock-fish as an advanced empirical approach. The normality test adopted is Jarque-Bera test statistic. As a result, first, a null hypothesis that "an empirical distribution follows normal distribution" was rejected in both fishes. The distribution of daily market data of them were not only biased toward positive(+) direction in terms of kurtosis and skewness, but also characterized by leptokurtic distribution with long right tail. Secondly, serial correlations were found in data on market trading volume and price volatility of two species during very long period. Thirdly, the results of unit root test and ARCH-LM test showed that all data of time series were very stationary and demonstrated effects of ARCH. These statistical characteristics can be explained as a reasonable ground for supporting the fitness of GARCH model in order to estimate conditional variances that reveal price volatility in empirical analysis. From empirical data analysis above, this study drew the following conclusions. First of all, from an empirical analysis on potential effects of seasonality and the day of week on price volatility of aquacultural fish, Monday effects were found in both species and Thursday and Friday effects were also found in flatfish. This indicates that Monday is effective in expanding price volatility of aquacultural fish market and also Monday has higher effects upon the price volatility of fish than other days of week have since it has more new information for weekend. Secondly, the empirical analysis led to a common conclusion that there was very high price volatility of flatfish and rock-fish. This points out that the persistency parameter($\lambda$), an index of possibility for current volatility to sustain similarly in the future, was higher than 0.8-equivalently nearly to 1-in both flatfish and rock-fish, which presents volatility clustering. Also, this study estimated and compared and model that hypothesized normal distributions in order to determine fitness of respective models. As a result, the fitness of GARCH(1, 1)-t model was better than model where the distribution of error term was hypothesized through-distribution due to characteristics of fat-tailed distribution, was also better than model, as described in the results of basic statistic analysis. In conclusion, this study has an important mean in that it was introduced firstly in Korea to investigate in price volatility of Korean aquacultural fishery products, although there was partially a limited of official statistic data. Therefore, it is expected that the results of this study will be useful as a reference material for making and assessing governmental policies. Also, it is looked forward that the results will be helpful to build a fishery business plan as and aspect of producer, and also to take timely measures to potential price fluctuations of fishery products in market. Hence, it is advisable that further studies related to such price volatility in fishery market will extend and evolve into a wider variety of articles and issues in near future.

  • PDF