• Title/Summary/Keyword: model predictions

Search Result 2,075, Processing Time 0.03 seconds

Applicability of QSAR Models for Acute Aquatic Toxicity under the Act on Registration, Evaluation, etc. of Chemicals in the Republic of Korea (화평법에 따른 급성 수생독성 예측을 위한 QSAR 모델의 활용 가능성 연구)

  • Kang, Dongjin;Jang, Seok-Won;Lee, Si-Won;Lee, Jae-Hyun;Lee, Sang Hee;Kim, Pilje;Chung, Hyen-Mi;Seong, Chang-Ho
    • Journal of Environmental Health Sciences
    • /
    • v.48 no.3
    • /
    • pp.159-166
    • /
    • 2022
  • Background: A quantitative structure-activity relationship (QSAR) model was adopted in the Registration, Evaluation, Authorization, and Restriction of Chemicals (REACH, EU) regulations as well as the Act on Registration, Evaluation, etc. of Chemicals (AREC, Republic of Korea). It has been previously used in the registration of chemicals. Objectives: In this study, we investigated the correlation between the predicted data provided by three prediction programs using a QSAR model and actual experimental results (acute fish, daphnia magna toxicity). Through this approach, we aimed to effectively conjecture on the performance and determine the most applicable programs when designating toxic substances through the AREC. Methods: Chemicals that had been registered and evaluated in the Toxic Chemicals Control Act (TCCA, Republic of Korea) were selected for this study. Two prediction programs developed and operated by the U.S. EPA - the Ecological Structure-Activity Relationship (ECOSAR) and Toxicity Estimation Software Tool (T.E.S.T.) models - were utilized along with the TOPKAT (Toxicity Prediction by Komputer Assisted Technology) commercial program. The applicability of these three programs was evaluated according to three parameters: accuracy, sensitivity, and specificity. Results: The prediction analysis on fish and daphnia magna in the three programs showed that the TOPKAT program had better sensitivity than the others. Conclusions: Although the predictive performance of the TOPKAT program when using a single predictive program was found to perform well in toxic substance designation, using a single program involves many restrictions. It is necessary to validate the reliability of predictions by utilizing multiple methods when applying the prediction program to the regulation of chemicals.

Predicting the Number of Confirmed COVID-19 Cases Using Deep Learning Models with Search Term Frequency Data (검색어 빈도 데이터를 반영한 코로나 19 확진자수 예측 딥러닝 모델)

  • Sungwook Jung
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.9
    • /
    • pp.387-398
    • /
    • 2023
  • The COVID-19 outbreak has significantly impacted human lifestyles and patterns. It was recommended to avoid face-to-face contact and over-crowded indoor places as much as possible as COVID-19 spreads through air, as well as through droplets or aerosols. Therefore, if a person who has contacted a COVID-19 patient or was at the place where the COVID-19 patient occurred is concerned that he/she may have been infected with COVID-19, it can be fully expected that he/she will search for COVID-19 symptoms on Google. In this study, an exploratory data analysis using deep learning models(DNN & LSTM) was conducted to see if we could predict the number of confirmed COVID-19 cases by summoning Google Trends, which played a major role in surveillance and management of influenza, again and combining it with data on the number of confirmed COVID-19 cases. In particular, search term frequency data used in this study are available publicly and do not invade privacy. When the deep neural network model was applied, Seoul (9.6 million) with the largest population in South Korea and Busan (3.4 million) with the second largest population recorded lower error rates when forecasting including search term frequency data. These analysis results demonstrate that search term frequency data plays an important role in cities with a population above a certain size. We also hope that these predictions can be used as evidentiary materials to decide policies, such as the deregulation or implementation of stronger preventive measures.

Evaluation Model for Lateral Flow on Soft Ground Using Commitee and Probabilistic Neural Network Theory (군집신경망과 확률신경망 이론을 이용한 연약지반의 측방유동 평가 모델)

  • Kim, Young-Sang;Joo, No-Ah;Lee, Jeong-Jae
    • Journal of the Korean Geotechnical Society
    • /
    • v.23 no.7
    • /
    • pp.65-76
    • /
    • 2007
  • Recently, there have been many construction projects on soft ground with growth of industry and various construction problems concerning soft soil behavior also have been reported. Especially, foundation piles of abutments and (or) buildings which were constructed on the soft ground have been suffering from a lot of stability problems of inordinary displacement due to lateral flow of soft ground. Although many researches for this phenomena have been carried out, it is still difficult to assess the mechanism of lateral flow on soft ground quantitatively. And reliable design method for judgement of lateral flow occurrence is not established yet. In this study, PNN (probabilistic neural network) and CNN (committee neural network) theories were applied for judgment of lateral flow occurrence based on eat data compiled from Korea and Japan. Predictions of PNN and CNN models for new data which were not used during model development are compared with those predicted by conventional empirical methods. It was found that the developed PNN and CNN models can predict more precise and reliable judgment of lateral flow occurrence than conventional empirical methods.

Generative Adversarial Network Model for Generating Yard Stowage Situation in Container Terminal (컨테이너 터미널의 야드 장치 상태 생성을 위한 생성적 적대 신경망 모형)

  • Jae-Young Shin;Yeong-Il Kim;Hyun-Jun Cho
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2022.06a
    • /
    • pp.383-384
    • /
    • 2022
  • Following the development of technologies such as digital twin, IoT, and AI after the 4th industrial revolution, decision-making problems are being solved based on high-dimensional data analysis. This has recently been applied to the port logistics sector, and a number of studies on big data analysis, deep learning predictions, and simulations have been conducted on container terminals to improve port productivity. These high-dimensional data analysis techniques generally require a large number of data. However, the global port environment has changed due to the COVID-19 pandemic in 2020. It is not appropriate to apply data before the COVID-19 outbreak to the current port environment, and the data after the outbreak was not sufficiently collected to apply it to data analysis such as deep learning. Therefore, this study intends to present a port data augmentation method for data analysis as one of these problem-solving methods. To this end, we generate the container stowage situation of the yard through a generative adversarial neural network model in terms of container terminal operation, and verify similarity through statistical distribution verification between real and augmented data.

  • PDF

Development and Application of Statistical Programs Based on Data and Artificial Intelligence Prediction Model to Improve Statistical Literacy of Elementary School Students (초등학생의 통계적 소양 신장을 위한 데이터와 인공지능 예측모델 기반의 통계프로그램 개발 및 적용)

  • Kim, Yunha;Chang, Hyewon
    • Communications of Mathematical Education
    • /
    • v.37 no.4
    • /
    • pp.717-736
    • /
    • 2023
  • The purpose of this study is to develop a statistical program using data and artificial intelligence prediction models and apply it to one class in the sixth grade of elementary school to see if it is effective in improving students' statistical literacy. Based on the analysis of problems in today's elementary school statistical education, a total of 15 sessions of the program was developed to encourage elementary students to experience the entire process of statistical problem solving and to make correct predictions by incorporating data, the core in the era of the Fourth Industrial Revolution into AI education. The biggest features of this program are the recognition of the importance of data, which are the key elements of artificial intelligence education, and the collection and analysis activities that take into account context using real-life data provided by public data platforms. In addition, since it consists of activities to predict the future based on data by using engineering tools such as entry and easy statistics, and creating an artificial intelligence prediction model, it is composed of a program focused on the ability to develop communication skills, information processing capabilities, and critical thinking skills. As a result of applying this program, not only did the program positively affect the statistical literacy of elementary school students, but we also observed students' interest, critical inquiry, and mathematical communication in the entire process of statistical problem solving.

Applying deep learning based super-resolution technique for high-resolution urban flood analysis (고해상도 도시 침수 해석을 위한 딥러닝 기반 초해상화 기술 적용)

  • Choi, Hyeonjin;Lee, Songhee;Woo, Hyuna;Kim, Minyoung;Noh, Seong Jin
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.10
    • /
    • pp.641-653
    • /
    • 2023
  • As climate change and urbanization are causing unprecedented natural disasters in urban areas, it is crucial to have urban flood predictions with high fidelity and accuracy. However, conventional physically- and deep learning-based urban flood modeling methods have limitations that require a lot of computer resources or data for high-resolution flooding analysis. In this study, we propose and implement a method for improving the spatial resolution of urban flood analysis using a deep learning based super-resolution technique. The proposed approach converts low-resolution flood maps by physically based modeling into the high-resolution using a super-resolution deep learning model trained by high-resolution modeling data. When applied to two cases of retrospective flood analysis at part of City of Portland, Oregon, U.S., the results of the 4-m resolution physical simulation were successfully converted into 1-m resolution flood maps through super-resolution. High structural similarity between the super-solution image and the high-resolution original was found. The results show promising image quality loss within an acceptable limit of 22.80 dB (PSNR) and 0.73 (SSIM). The proposed super-resolution method can provide efficient model training with a limited number of flood scenarios, significantly reducing data acquisition efforts and computational costs.

Statistical Method and Deep Learning Model for Sea Surface Temperature Prediction (수온 데이터 예측 연구를 위한 통계적 방법과 딥러닝 모델 적용 연구)

  • Moon-Won Cho;Heung-Bae Choi;Myeong-Soo Han;Eun-Song Jung;Tae-Soon Kang
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.29 no.6
    • /
    • pp.543-551
    • /
    • 2023
  • As climate change continues to prompt an increasing demand for advancements in disaster and safety management technologies to address abnormal high water temperatures, typhoons, floods, and droughts, sea surface temperature has emerged as a pivotal factor for swiftly assessing the impacts of summer harmful algal blooms in the seas surrounding Korean Peninsula and the formation and dissipation of cold water along the East Coast of Korea. Therefore, this study sought to gauge predictive performance by leveraging statistical methods and deep learning algorithms to harness sea surface temperature data effectively for marine anomaly research. The sea surface temperature data employed in the predictions spans from 2018 to 2022 and originates from the Heuksando Tidal Observatory. Both traditional statistical ARIMA methods and advanced deep learning models, including long short-term memory (LSTM) and gated recurrent unit (GRU), were employed. Furthermore, prediction performance was evaluated using the attention LSTM technique. The technique integrated an attention mechanism into the sequence-to-sequence (s2s), further augmenting the performance of LSTM. The results showed that the attention LSTM model outperformed the other models, signifying its superior predictive performance. Additionally, fine-tuning hyperparameters can improve sea surface temperature performance.

Analyzing the Impact of Multivariate Inputs on Deep Learning-Based Reservoir Level Prediction and Approaches for Mid to Long-Term Forecasting (다변량 입력이 딥러닝 기반 저수율 예측에 미치는 영향 분석과 중장기 예측 방안)

  • Hyeseung Park;Jongwook Yoon;Hojun Lee;Hyunho Yang
    • The Transactions of the Korea Information Processing Society
    • /
    • v.13 no.4
    • /
    • pp.199-207
    • /
    • 2024
  • Local reservoirs are crucial sources for agricultural water supply, necessitating stable water level management to prepare for extreme climate conditions such as droughts. Water level prediction is significantly influenced by local climate characteristics, such as localized rainfall, as well as seasonal factors including cropping times, making it essential to understand the correlation between input and output data as much as selecting an appropriate prediction model. In this study, extensive multivariate data from over 400 reservoirs in Jeollabuk-do from 1991 to 2022 was utilized to train and validate a water level prediction model that comprehensively reflects the complex hydrological and climatological environmental factors of each reservoir, and to analyze the impact of each input feature on the prediction performance of water levels. Instead of focusing on improvements in water level performance through neural network structures, the study adopts a basic Feedforward Neural Network composed of fully connected layers, batch normalization, dropout, and activation functions, focusing on the correlation between multivariate input data and prediction performance. Additionally, most existing studies only present short-term prediction performance on a daily basis, which is not suitable for practical environments that require medium to long-term predictions, such as 10 days or a month. Therefore, this study measured the water level prediction performance up to one month ahead through a recursive method that uses daily prediction values as the next input. The experiment identified performance changes according to the prediction period and analyzed the impact of each input feature on the overall performance based on an Ablation study.

Matching prediction on Korean professional volleyball league (한국 프로배구 연맹의 경기 예측 및 영향요인 분석)

  • Heesook Kim;Nakyung Lee;Jiyoon Lee;Jongwoo Song
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.3
    • /
    • pp.323-338
    • /
    • 2024
  • This study analyzes the Korean professional volleyball league and predict match outcomes using popular machine learning classification methods. Match data from the 2012/2013 to 2022/2023 seasons for both male and female leagues were collected, including match details. Two different data structures were applied to the models: Separating matches results into two teams and performance differentials between the home and away teams. These two data structures were applied to construct a total of four predictive models, encompassing both male and female leagues. As specific variable values used in the models are unavailable before the end of matches, the results of the most recent 3 to 4 matches, up until just before today's match, were preprocessed and utilized as variables. Logistc Regrssion, Decision Tree, Bagging, Random Forest, Xgboost, Adaboost, and Light GBM, were employed for classification, and the model employing Random Forest showed the highest predictive performance. The results indicated that while significant variables varied by gender and data structure, set success rate, blocking points scored, and the number of faults were consistently crucial. Notably, our win-loss prediction model's distinctiveness lies in its ability to provide pre-match forecasts rather than post-event predictions.

Estimation of genetic correlations and genomic prediction accuracy for reproductive and carcass traits in Hanwoo cows

  • Md Azizul Haque;Asif Iqbal;Mohammad Zahangir Alam;Yun-Mi Lee;Jae-Jung Ha;Jong-Joo Kim
    • Journal of Animal Science and Technology
    • /
    • v.66 no.4
    • /
    • pp.682-701
    • /
    • 2024
  • This study estimated the heritabilities (h2) and genetic and phenotypic correlations between reproductive traits, including calving interval (CI), age at first calving (AFC), gestation length (GL), number of artificial inseminations per conception (NAIPC), and carcass traits, including carcass weight (CWT), eye muscle area (EMA), backfat thickness (BF), and marbling score (MS) in Korean Hanwoo cows. In addition, the accuracy of genomic predictions of breeding values was evaluated by applying the genomic best linear unbiased prediction (GBLUP) and the weighted GBLUP (WGBLUP) method. The phenotypic data for reproductive and carcass traits were collected from 1,544 Hanwoo cows, and all animals were genotyped using Illumina Bovine 50K single nucleotide polymorphism (SNP) chip. The genetic parameters were estimated using a multi-trait animal model using the MTG2 program. The estimated h2 for CI, AFC, GL, NAIPC, CWT, EMA, BF, and MS were 0.10, 0.13, 0.17, 0.11, 0.37, 0.35, 0.27, and 0.45, respectively, according to the GBLUP model. The GBLUP accuracy estimates ranged from 0.51 to 0.74, while the WGBLUP accuracy estimates for the traits under study ranged from 0.51 to 0.79. Strong and favorable genetic correlations were observed between GL and NAIPC (0.61), CWT and EMA (0.60), NAIPC and CWT (0.49), AFC and CWT (0.48), CI and GL (0.36), BF and MS (0.35), NAIPC and EMA (0.35), CI and BF (0.30), EMA and MS (0.28), CI and AFC (0.26), AFC and EMA (0.24), and AFC and BF (0.21). The present study identified low to moderate positive genetic correlations between reproductive and CWT traits, suggesting that a heavier body weight may lead to a longer CI, AFC, GL, and NAIPC. The moderately positive genetic correlation between CWT and AFC, and NAIPC, with a phenotypic correlation of nearly zero, suggesting that the genotype-environment interactions are more likely to be responsible for the phenotypic manifestation of these traits. As a result, the inclusion of these traits by breeders as selection criteria may present a good opportunity for developing a selection index to increase the response to the selection and identification of candidate animals, which can result in significantly increased profitability of production systems.