• Title/Summary/Keyword: Combination 예측 모델

Search Result 171, Processing Time 0.025 seconds

WQI Class Prediction of Sihwa Lake Using Machine Learning-Based Models (기계학습 기반 모델을 활용한 시화호의 수질평가지수 등급 예측)

  • KIM, SOO BIN;LEE, JAE SEONG;KIM, KYUNG TAE
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.27 no.2
    • /
    • pp.71-86
    • /
    • 2022
  • The water quality index (WQI) has been widely used to evaluate marine water quality. The WQI in Korea is categorized into five classes by marine environmental standards. But, the WQI calculation on huge datasets is a very complex and time-consuming process. In this regard, the current study proposed machine learning (ML) based models to predict WQI class by using water quality datasets. Sihwa Lake, one of specially-managed coastal zone, was selected as a modeling site. In this study, adaptive boosting (AdaBoost) and tree-based pipeline optimization (TPOT) algorithms were used to train models and each model performance was evaluated by metrics (accuracy, precision, F1, and Log loss) on classification. Before training, the feature importance and sensitivity analysis were conducted to find out the best input combination for each algorithm. The results proved that the bottom dissolved oxygen (DOBot) was the most important variable affecting model performance. Conversely, surface dissolved inorganic nitrogen (DINSur) and dissolved inorganic phosphorus (DIPSur) had weaker effects on the prediction of WQI class. In addition, the performance varied over features including stations, seasons, and WQI classes by comparing spatio-temporal and class sensitivities of each best model. In conclusion, the modeling results showed that the TPOT algorithm has better performance rather than the AdaBoost algorithm without considering feature selection. Moreover, the WQI class for unknown water quality datasets could be surely predicted using the TPOT model trained with satisfactory training datasets.

Performance Improvement of Speaker Recognition by MCE-based Score Combination of Multiple Feature Parameters (MCE기반의 다중 특징 파라미터 스코어의 결합을 통한 화자인식 성능 향상)

  • Kang, Ji Hoon;Kim, Bo Ram;Kim, Kyu Young;Lee, Sang Hoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.6
    • /
    • pp.679-686
    • /
    • 2020
  • In this thesis, an enhanced method for the feature extraction of vocal source signals and score combination using an MCE-Based weight estimation of the score of multiple feature vectors are proposed for the performance improvement of speaker recognition systems. The proposed feature vector is composed of perceptual linear predictive cepstral coefficients, skewness, and kurtosis extracted with lowpass filtered glottal flow signals to eliminate the flat spectrum region, which is a meaningless information section. The proposed feature was used to improve the conventional speaker recognition system utilizing the mel-frequency cepstral coefficients and the perceptual linear predictive cepstral coefficients extracted with the speech signals and Gaussian mixture models. In addition, to increase the reliability of the estimated scores, instead of estimating the weight using the probability distribution of the convectional score, the scores evaluated by the conventional vocal tract, and the proposed feature are fused by the MCE-Based score combination method to find the optimal speaker. The experimental results showed that the proposed feature vectors contained valid information to recognize the speaker. In addition, when speaker recognition is performed by combining the MCE-based multiple feature parameter scores, the recognition system outperformed the conventional one, particularly in low Gaussian mixture cases.

Preliminary Uncertainty Analysis to Build a Data-Driven Prediction Model for Water Quality in Paldang Dam (팔당댐 유역의 데이터 기반 수질 예측 모형 구성을 위한 사전 불확실성 분석)

  • Lee, Eun Jeong;Keum, Ho Jun
    • Ecology and Resilient Infrastructure
    • /
    • v.9 no.1
    • /
    • pp.24-35
    • /
    • 2022
  • For water quality management, it is necessary to continuously improve the forecasting by analyzing the past water quality, and a Data-driven model is emerging as an alternative. Because the Data-driven model is built based on a wide range of data, it is essential to apply the correlation analysis method for the combination of input variables to obtain more reliable results. In this study, the Gamma Test was applied as a preceding step to build a faster and more accurate data-driven water quality prediction model. First, a physical-based model (HSPF, EFDC) was operated to produce daily water quality reflecting the complexity of the watershed according to various hydrological conditions for Paldang Dam. The Gamma Test was performed on the water quality at the water quality prediction site (Paldangdam2) and major rivers flowing into the Paldang Dam, and the method of selecting the optimal input data combination was presented through the analysis results (Gamma, Gradient, Standar Error, V-Ratio). As a result of the study, the selection criteria for a more efficient combination of input data that can save time by omitting trial and error when building a data-driven model are presented.

A Effect of Heavy Metal to Toxicity of Triclosan Focused on Vibrio fischeri Assay (Triclosan의 독성에 중금속이 미치는 영향 - V. fischeri Assay 관련 내용 중심으로 -)

  • Kim, Ji-Sung;Kim, Il-Ho;Lee, Woo-Mi;Lee, Hye-In;Kim, Seok-Gu
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.36 no.3
    • /
    • pp.153-161
    • /
    • 2014
  • The purpose of this study is to evaluate effect of heavy metals (i.e., $Cu^{2+}$, $Zn^{2+}$, $Cr^{6+}$, $Cd^{2+}$, $Hg^{2+}$, and $Pb^{2+}$) to toxicity of Triclosan as binary mixture. The individual toxicity and combined toxic effects of Triclosan with heavy metals were evaluated by Vibrio fischeri assay. In individual toxicity, the $Hg^{2+}$ was found to be most toxic followed by Triclosan, $Pb^{2+}$, $Cr^{6+}$, $Cu^{2+}$, $Zn^{2+}$, and $Cd^{2+}$, respectively. To evaluate combined toxic effect, correlation analysis of 'predicted value' calculated by Concentration addition (CA) model and Independent action (IA) model with 'experimental value' were performed based on the toxicity of individual compound. As a result, all of the combinations showed that IA model were more correlated with experimental value than CA model. On the basis of the median effect concentration of combination ($EC_{50mix}$) predicted by IA model, experimental $EC_{50mix}$ of Triclosan + Cu, Triclosan + Zn, Triclosan + Pb, Triclosan + Hg, Triclosan + Cd, and Triclosan + Cr were 191%, 226%, 138%, 137%, 209%, and 138% of $EC_{50mix}$ predicted by IA model, respectively, indicating that all of the combinations produced antagonistic effect.

Development of Prediction Model for Improvement of Safety Facilities in Frequent Traffic Accidents (교통사고 잦은 곳 안전시설 개선 방안 예측 모델 개발)

  • Jaekyung Kwon;Siwon Kim;Jae seong Hwang;Jaehyung Lee;Choul ki Lee
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.1
    • /
    • pp.16-24
    • /
    • 2023
  • Accidents are greatly reduced through projects to improve frequent traffic accidents. These results show that safety facilities play a big role. Traffic accidents are caused by various causes and various environmental factors, and it is difficult to achieve improvement effects by installing one safety facility or facilities without standards. Therefore, this study analyzed the improvement effect of each accident type by combining the two safety facilities, and suggested a method of predicting the combination of safety facilities suitable for a specific point, including environmental factors such as road type, road type, and traffic. The prediction was carried out by selecting an XGBoost technique that creates one strong prediction model by combining prediction models that can be simple classification. Through this, safety facilities that have had positive effects through improvement projects and safety facilities to be installed at points in need of improvement were derived, and safety facilities effect analysis and prediction methods for future installation points were presented.

Comparison of Seismic Data Interpolation Performance using U-Net and cWGAN (U-Net과 cWGAN을 이용한 탄성파 탐사 자료 보간 성능 평가)

  • Yu, Jiyun;Yoon, Daeung
    • Geophysics and Geophysical Exploration
    • /
    • v.25 no.3
    • /
    • pp.140-161
    • /
    • 2022
  • Seismic data with missing traces are often obtained regularly or irregularly due to environmental and economic constraints in their acquisition. Accordingly, seismic data interpolation is an essential step in seismic data processing. Recently, research activity on machine learning-based seismic data interpolation has been flourishing. In particular, convolutional neural network (CNN) and generative adversarial network (GAN), which are widely used algorithms for super-resolution problem solving in the image processing field, are also used for seismic data interpolation. In this study, CNN-based algorithm, U-Net and GAN-based algorithm, and conditional Wasserstein GAN (cWGAN) were used as seismic data interpolation methods. The results and performances of the methods were evaluated thoroughly to find an optimal interpolation method, which reconstructs with high accuracy missing seismic data. The work process for model training and performance evaluation was divided into two cases (i.e., Cases I and II). In Case I, we trained the model using only the regularly sampled data with 50% missing traces. We evaluated the model performance by applying the trained model to a total of six different test datasets, which consisted of a combination of regular, irregular, and sampling ratios. In Case II, six different models were generated using the training datasets sampled in the same way as the six test datasets. The models were applied to the same test datasets used in Case I to compare the results. We found that cWGAN showed better prediction performance than U-Net with higher PSNR and SSIM. However, cWGAN generated additional noise to the prediction results; thus, an ensemble technique was performed to remove the noise and improve the accuracy. The cWGAN ensemble model removed successfully the noise and showed improved PSNR and SSIM compared with existing individual models.

Performance Prediction of Geothermal Heat Pump System by Line-Source and Modified DST(TRNVDSTP) Models (선형열원 모델과 수정 DST(TRNVDSTP) 모델에 의한 지열 히트펌프 시스템 성능 예측)

  • Sohn, Byong-Hu
    • Journal of the Korean Society for Geothermal and Hydrothermal Energy
    • /
    • v.8 no.2
    • /
    • pp.61-69
    • /
    • 2012
  • Geothermal heat pump(GHP) systems have been shown to be an environmentally-friendly, efficient alternative to traditional cooling and heating systems in both residential and commercial applications. Although some experimental work related to performance evaluation of GHP systems with vertical borehole ground heat exchangers for commercial buildings has been done, relatively little has been reported on the performance simulation of these systems. The aim of this study is to evaluate the cooling and heating performance of the GHP system with 30 borehole ground heat exchangers applied to an commercial building($1,210m^2$) in Seoul. For this purpose, a typical design procedure was involved with a combination of design parameters such as building loads, heat pump capacity, circulating pump, borehole diameter, and ground effective thermal properties, etc. The cooling and heating performance prediction of the system was conducted with different prediction methods and then each result is compared.

Application of Artificial Neural Networks for Prediction of the Flow and Strength of Controlled Low Strength Material (CLSM의 플로우 및 일축압축강도 예측을 위한 인공신경망 적용)

  • Lim, Jong-Goo;Kim, Yeon-Joong;Chun, Byung-Sik
    • Journal of the Korean Geotechnical Society
    • /
    • v.27 no.1
    • /
    • pp.17-24
    • /
    • 2011
  • The characteristics of flow and strength of CLSM depend on the combination ratio including the fly ash, pond ash, cement, water quantity and etc. However, it is very difficult to draw the mechanism about the flow, strength and the mixing ratio of each components. Therefore, the method of calculation drawing the flow about the component ratio of CLSM and compression strength value is needed for the valid practical use of CLSM. To verify the efficiency of artificial neural network, new data which were not used for establishing the model were predicted and compared with the results of laboratory tests. In this research, it was used to evaluate the learning efficiency of the artificial neural network model and the prediction ability by changing the node number of hidden layer, learning rate, momentum, target system error and hidden layer. By using the results, the optimized artificial neural network model which is suitable for a flow and compressive strength estimate of CLSM was determined.

A Study on Optimized Artificial Neural Network Model for the Prediction of Bearing Capacity of Driven Piles (항타말뚝의 지지력 예측을 위한 최적의 인공신경망모델에 관한 연구)

  • Park Hyun-Il;Seok Jeong-Woo;Hwang Dae-Jin;Cho Chun-Whan
    • Journal of the Korean Geotechnical Society
    • /
    • v.22 no.6
    • /
    • pp.15-26
    • /
    • 2006
  • Although numerous investigations have been performed over the years to predict the behavior and bearing capacity of piles, the mechanisms are not yet entirely understood. The prediction of bearing capacity is a difficult task, because large numbers of factors affect the capacity and also have complex relationship one another. Therefore, it is extremely difficult to search the essential factors among many factors, which are related with ground condition, pile type, driving condition and others, and then appropriately consider complicated relationship among the searched factors. The present paper describes the application of Artificial Neural Network (ANN) in predicting the capacity including its components at the tip and along the shaft from dynamic load test of the driven piles. Firstly, the effect of each factor on the value of bearing capacity is investigated on the basis of sensitivity analysis using ANN modeling. Secondly, the authors use the design methodology composed of ANN and genetic algorithm (GA) to find optimal neural network model to predict the bearing capacity. The authors allow this methodology to find the appropriate combination of input parameters, the number of hidden units and the transfer structure among the input, the hidden and the out layers. The results of this study indicate that the neural network model serves as a reliable and simple predictive tool for the bearing capacity of driven piles.

Assessment of the Prediction Performance of Ensemble Size-Related in GloSea5 Hindcast Data (기상청 기후예측시스템(GloSea5)의 과거기후장 앙상블 확대에 따른 예측성능 평가)

  • Park, Yeon-Hee;Hyun, Yu-Kyung;Heo, Sol-Ip;Ji, Hee-Sook
    • Atmosphere
    • /
    • v.31 no.5
    • /
    • pp.511-523
    • /
    • 2021
  • This study explores the optimal ensemble size to improve the prediction performance of the Korea Meteorological Administration's operational climate prediction system, global seasonal forecast system version 5 (GloSea5). The GloSea5 produces an ensemble of hindcast data using the stochastic kinetic energy backscattering version2 (SKEB2) and timelagged ensemble. An experiment to increase the hindcast ensemble from 3 to 14 members for four initial dates was performed and the improvement and effect of the prediction performance considering Root Mean Square Error (RMSE), Anomaly Correlation Coefficient (ACC), ensemble spread, and Ratio of Predictable Components (RPC) were evaluated. As the ensemble size increased, the RMSE and ACC prediction performance improved and more significantly in the high variability area. In spread and RPC analysis, the prediction accuracy of the system improved as the ensemble size increased. The closer the initial date, the better the predictive performance. Results show that increasing the ensemble to an appropriate number considering the combination of initial times is efficient.