• Title/Summary/Keyword: Memory-Based Learning

Search Result 574, Processing Time 0.029 seconds

Prediction in Dissolved Oxygen Concentration and Occurrence of Hypoxia Water Mass in Jinhae Bay Based on Machine Learning Model (기계학습 모형 기반 진해만 용존산소농도 및 빈산소수괴 발생 예측)

  • Park, Seongsik;Kim, Byeong Kuk;Kim, Kyunghoi
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.34 no.3
    • /
    • pp.47-57
    • /
    • 2022
  • We carried out studies on prediction in concentration of dissolved oxygen (DO) with LSTM model and prediction in occurrence of hypoxia water mass (HWM) with decision tree. As results of study on prediction in DO concentration, a large number of Hidden node caused high complexity of model and required enough Epoch. And it was high accuracy in long Sequence length as prediction time step increased. The results of prediction in occurrence of HWM showed that the accuracy of nonHWM case was 66.1% in 30 day prediction, it was higher than 37.5% of HWM case. The reason is that the decision tree might overestimate DO concentration.

Tunnel wall convergence prediction using optimized LSTM deep neural network

  • Arsalan, Mahmoodzadeh;Mohammadreza, Taghizadeh;Adil Hussein, Mohammed;Hawkar Hashim, Ibrahim;Hanan, Samadi;Mokhtar, Mohammadi;Shima, Rashidi
    • Geomechanics and Engineering
    • /
    • v.31 no.6
    • /
    • pp.545-556
    • /
    • 2022
  • Evaluation and optimization of tunnel wall convergence (TWC) plays a vital role in preventing potential problems during tunnel construction and utilization stage. When convergence occurs at a high rate, it can lead to significant problems such as reducing the advance rate and safety, which in turn increases operating costs. In order to design an effective solution, it is important to accurately predict the degree of TWC; this can reduce the level of concern and have a positive effect on the design. With the development of soft computing methods, the use of deep learning algorithms and neural networks in tunnel construction has expanded in recent years. The current study aims to employ the long-short-term memory (LSTM) deep neural network predictor model to predict the TWC, based on 550 data points of observed parameters developed by collecting required data from different tunnelling projects. Among the data collected during the pre-construction and construction phases of the project, 80% is randomly used to train the model and the rest is used to test the model. Several loss functions including root mean square error (RMSE) and coefficient of determination (R2) were used to assess the performance and precision of the applied method. The results of the proposed models indicate an acceptable and reliable accuracy. In fact, the results show that the predicted values are in good agreement with the observed actual data. The proposed model can be considered for use in similar ground and tunneling conditions. It is important to note that this work has the potential to reduce the tunneling uncertainties significantly and make deep learning a valuable tool for planning tunnels.

Predicting water temperature and water quality in a reservoir using a hybrid of mechanistic model and deep learning model (역학적 모델과 딥러닝 모델을 결합한 저수지 수온 및 수질 예측)

  • Sung Jin Kim;Se Woong Chung
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.150-150
    • /
    • 2023
  • 기작기반의 역학적 모델과 자료기반의 딥러닝 모델은 수질예측에 다양하게 적용되고 있으나, 각각의 모델은 고유한 구조와 가정으로 인해 장·단점을 가지고 있다. 특히, 딥러닝 모델은 우수한 예측 성능에도 불구하고 훈련자료가 부족한 경우 오차와 과적합에 따른 분산(variance) 문제를 야기하며, 기작기반 모델과 달리 물리법칙이 결여된 예측 결과를 생산할 수 있다. 본 연구의 목적은 주요 상수원인 댐 저수지를 대상으로 수심별 수온과 탁도를 예측하기 위해 기작기반과 자료기반 모델의 장점을 융합한 PGDL(Process-Guided Deep Learninig) 모델을 개발하고, 물리적 법칙 만족도와 예측 성능을 평가하는데 있다. PGDL 모델 개발에 사용된 기작기반 및 자료기반 모델은 각각 CE-QUAL-W2와 순환 신경망 딥러닝 모델인 LSTM(Long Short-Term Memory) 모델이다. 각 모델은 2020년 1월부터 12월까지 소양강댐 댐 앞의 K-water 자동측정망 지점에서 실측한 수온과 탁도 자료를 이용하여 각각 보정하고 훈련하였다. 수온 및 탁도 예측을 위한 PGDL 모델의 주요 알고리즘은 LSTM 모델의 목적함수(또는 손실함수)에 실측값과 예측값의 오차항 이외에 역학적 모델의 에너지 및 질량 수지 항을 제약 조건에 추가하여 예측결과가 물리적 보존법칙을 만족하지 않는 경우 penalty를 부가하여 매개변수를 최적화시켰다. 또한, 자료 부족에 따른 LSTM 모델의 예측성능 저하 문제를 극복하기 위해 보정되지 않은 역학적 모델의 모의 결과를 모델의 훈련자료로 사용하는 pre-training 기법을 활용하여 실측자료 비율에 따른 모델의 예측성능을 평가하였다. 연구결과, PGDL 모델은 저수지 수온과 탁도 예측에 있어서 경계조건을 통한 에너지와 질량 변화와 저수지 내 수온 및 탁도 증감에 따른 공간적 에너지와 질량 변화의 일치도에 있어서 LSTM보다 우수하였다. 또한 역학적 모델 결과를 LSTM 모델의 훈련자료의 일부로 사용한 PGDL 모델은 적은 양의 실측자료를 사용하여도 CE-QUAL-W2와 LSTM 보다 우수한 예측 성능을 보였다. 연구결과는 다차원의 역학적 수리수질 모델과 자료기반 딥러닝 모델의 장점을 결합한 새로운 모델링 기술의 적용 가능성을 보여주며, 자료기반 모델의 훈련자료 부족에 따른 예측 성능 저하 문제를 극복하기 위해 역학적 모델이 유용하게 활용될 수 있음을 시사한다.

  • PDF

Malware Detection Using Deep Recurrent Neural Networks with no Random Initialization

  • Amir Namavar Jahromi;Sattar Hashemi
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.8
    • /
    • pp.177-189
    • /
    • 2023
  • Malware detection is an increasingly important operational focus in cyber security, particularly given the fast pace of such threats (e.g., new malware variants introduced every day). There has been great interest in exploring the use of machine learning techniques in automating and enhancing the effectiveness of malware detection and analysis. In this paper, we present a deep recurrent neural network solution as a stacked Long Short-Term Memory (LSTM) with a pre-training as a regularization method to avoid random network initialization. In our proposal, we use global and short dependencies of the inputs. With pre-training, we avoid random initialization and are able to improve the accuracy and robustness of malware threat hunting. The proposed method speeds up the convergence (in comparison to stacked LSTM) by reducing the length of malware OpCode or bytecode sequences. Hence, the complexity of our final method is reduced. This leads to better accuracy, higher Mattews Correlation Coefficients (MCC), and Area Under the Curve (AUC) in comparison to a standard LSTM with similar detection time. Our proposed method can be applied in real-time malware threat hunting, particularly for safety critical systems such as eHealth or Internet of Military of Things where poor convergence of the model could lead to catastrophic consequences. We evaluate the effectiveness of our proposed method on Windows, Ransomware, Internet of Things (IoT), and Android malware datasets using both static and dynamic analysis. For the IoT malware detection, we also present a comparative summary of the performance on an IoT-specific dataset of our proposed method and the standard stacked LSTM method. More specifically, of our proposed method achieves an accuracy of 99.1% in detecting IoT malware samples, with AUC of 0.985, and MCC of 0.95; thus, outperforming standard LSTM based methods in these key metrics.

Quality Control Plan of Water Level in Agricultural Reservoirs using a Deep-Learning Based LSTM Model (딥러닝 기반 LSTM 모형을 이용한 농업용 저수지 수위자료 품질관리 방안)

  • Yang, Mi-Hye;Nam, Won-Ho;Shin, An-Kook;Kang, Mun-Sung;Kim, Taegon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.128-128
    • /
    • 2020
  • 최근 농업환경의 변화와 기후변화에 대응하기 위해 농업용수 관리 정보화 및 과학화의 필요성이 증대되어 실시간으로 저수지 저수량과 농업용수 공급량을 파악하기 위해 자동 수위계측시설이 도입되었다. 농림축산식품부의 저수지 자동수위측정기 설치 및 운영지침에 따라 현재 농어촌공사 관리 저수지 1,734개소 및 수로부 1,880개소에 자동수위계가 설치되어 있으며, 저수지와 수로에서 10분 간격으로 수위자료가 생성되고 있다. 농업용 저수지 수문자료의 공인지점은 2016년 6개소에서 2019년 49개소로 증대되고 있으며, 데이터 품질 저하의 최소화 및 신뢰성 있는 수문자료 생성의 필요성이 증가함에 따라 농업용 저수지의 특성을 반영한 저수지 수위 오결측 데이터 보정 방안 및 수문 자료 품질관리 방안이 요구된다. 농업용 저수지의 수위 변화 및 강우-유출 현상은 물리적 모형을 구축하여 기상, 지형 등 영향 인자와 수위(또는 유출)와의 상관관계를 분석하는 것은 무적으로 불가능하였지만, 최근 인공신경망 (Artificial Neural Network, ANN) 등과 같이 black-box 형태의 모형을 이용하여 비선형적인 수문해석이 가능해졌다. 본 연구에서는 빅데이터와 인공신경망을 결합시킨 알고리즘인 딥러닝 (Deep Learning) 기반의 LSTM (Long Short-Term Memory) 모형을 활용하여 농업용 저수지 수위자료를 검토하여 자동계측기에서 발생하는 오류 보정을 위해 품질관리 방안을 제시하고자 한다.

  • PDF

Prediction of rainfall abstraction based on deep learning considering watershed and rainfall characteristic factors (유역 및 강우 특성인자를 고려한 딥러닝 기반의 강우손실 예측)

  • Jeong, Minyeob;Kim, Dae-Hong;Kim, Seokgyun
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.37-37
    • /
    • 2022
  • 유효우량 산정을 위하여 국내에서 주로 사용되는 모형은 NRCS-CN(Natural Resources Conservation Service - curve number) 모형으로, 유역의 유출 능력을 나타내는 유출곡선지수(runoff curve number, CN)와 같은 NRCS-CN 모형의 매개변수들은 관측 강우-유출자료 또는 토양도, 토지피복지도 등을 이용하여 유역마다 결정된 값이 사용되고 있다. 그러나 유역의 CN값은 유역의 토양 상태와 같은 환경적 조건에 따라 달라질 수 있으며, 이를 반영하기 위하여 선행토양함수조건(antecedent moisture condition, AMC)을 이용하여 CN값을 조정하는 방법이 사용되고 있으나, AMC 조건에 따른 CN 값의 갑작스런 변화는 유출량의 극단적인 변화를 가져올 수 있다. NRCS-CN 모형과 더불어 강우 손실량 산정에 많이 사용되는 모형으로 Green-Ampt 모형이 있다. Green-Ampt 모형은 유역에서 발생하는 침투현상의 물리적 과정을 고려하는 모형이라는 장점이 있으나, 모형에 활용되는 다양한 물리적인 매개변수들을 산정하기 위해서는 유역에 대한 많은 조사가 선행되어야 한다. 또한 이렇게 산정된 매개변수들은 유역 내 토양이나 식생 조건 등에 따른 여러 불확실성을 내포하고 있어 실무적용에 어려움이 있다. 따라서 본 연구에서는, 현재 사용되고 있는 강우손실 모형들의 매개변수를 추정하기 위한 방법을 제시하고자 하였다. 본 연구에서 제시하는 방법은 인공지능(AI) 기술 중 하나인 딥러닝(deep-learning) 기법을 기반으로 하고 있으며, 딥러닝 모형으로는 장단기 메모리(Long Short-Term Memory, LSTM) 모형이 활용되었다. 딥러닝 모형의 입력 데이터는 유역에서의 강우특성이나 토양수분, 증발산, 식생 특성들을 나타내는 인자이며, 모의 결과는 유역에서 발생한 총 유출량으로 강우손실 모형들의 매개변수 값들은 이들을 활용하여 도출될 수 있다. 산정된 매개변수 값들을 강우손실 모형에 적용하여 실제 유역들에서의 유효우량 산정에 활용해보았으며, 동역학파 기반의 강우-유출 모형을 사용하여 유출을 예측해보았다. 예측된 유출수문곡선을 관측 자료와 비교 시 NSE=0.5 이상으로 산정되어 유출이 적절히 예측되었음을 확인했다.

  • PDF

Change in Cognitive Function after Antipsychotics Treatment : A Pilot Study of Long-Acting Injectable versus Oral Form (항정신병약물 치료 후 인지기능 변화 차이 연구 : 장기 지속형 주사제와 경구제 비교의 예비 연구)

  • Sung, Kiyoung;Kim, Seoyoung;Kim, Euitae
    • Korean Journal of Schizophrenia Research
    • /
    • v.21 no.2
    • /
    • pp.74-80
    • /
    • 2018
  • Objectives : This study investigated whether long-acting injectable (LAI) paliperidone is different from its oral form in terms of the effect on cognitive function in schizophrenia spectrum and other psychotic disorders. Methods : We reviewed the medical records of patients in Seoul National University Bundang Hospital who were diagnosed as having schizophrenia and/or other psychotic disorders based on DSM-5 from 2016 to 2017. Seven patients were treated with oral paliperidone and 11 were treated with paliperidone palmitate. All patients underwent clinical and neuropsychological assessment, including the Korean version of the MATRICS Consensus Cognitive Battery (MCCB) at their first visit or within one month of their initial treatment. MCCB was repeated within three to 12 months after the initial assessment. Results : There was no significant difference between the two groups in most cognitive domains including speed of processing, attention and vigilance, working memory, verbal learning, visual learning and reasoning and problem solving domain. However, patients treated with paliperidone palmitate showed better improvement in social cognition domain than those taking oral paliperidone. The standardized values of social cognition domain scores had significantly improved over time in patients under paliperidone palmitate, demonstrating a significant time-by-group interaction. Conclusion : Our results show that long-acting injectable paliperidone could be helpful in some aspects of improving cognitive function in schizophrenia spectrum and other psychotic disorders. Further studies with other antipsychotics are necessary to generalize the results.

MAGICal Synthesis: Memory-Efficient Approach for Generative Semiconductor Package Image Construction (MAGICal Synthesis: 반도체 패키지 이미지 생성을 위한 메모리 효율적 접근법)

  • Yunbin Chang;Wonyong Choi;Keejun Han
    • Journal of the Microelectronics and Packaging Society
    • /
    • v.30 no.4
    • /
    • pp.69-78
    • /
    • 2023
  • With the rapid growth of artificial intelligence, the demand for semiconductors is enormously increasing everywhere. To ensure the manufacturing quality and quantity simultaneously, the importance of automatic defect detection during the packaging process has been re-visited by adapting various deep learning-based methodologies into automatic packaging defect inspection. Deep learning (DL) models require a large amount of data for training, but due to the nature of the semiconductor industry where security is important, sharing and labeling of relevant data is challenging, making it difficult for model training. In this study, we propose a new framework for securing sufficient data for DL models with fewer computing resources through a divide-and-conquer approach. The proposed method divides high-resolution images into pre-defined sub-regions and assigns conditional labels to each region, then trains individual sub-regions and boundaries with boundary loss inducing the globally coherent and seamless images. Afterwards, full-size image is reconstructed by combining divided sub-regions. The experimental results show that the images obtained through this research have high efficiency, consistency, quality, and generality.

Life prediction of IGBT module for nuclear power plant rod position indicating and rod control system based on SDAE-LSTM

  • Zhi Chen;Miaoxin Dai;Jie Liu;Wei Jiang;Yuan Min
    • Nuclear Engineering and Technology
    • /
    • v.56 no.9
    • /
    • pp.3740-3749
    • /
    • 2024
  • To reduce the losses caused by aging failure of insulation gate bipolar transistor (IGBT), which is the core components of nuclear power plant rod position indicating and rod control (RPC) system. It is necessary to conduct studies on its life prediction. The selection of IGBT failure characteristic parameters in existing research relies heavily on failure principles and expert experience. Moreover, the analysis and learning of time-domain degradation data have not been fully conducted, resulting in low prediction efficiency as the monotonicity, time correlation, and poor anti-interference ability of extracted degradation features. This paper utilizes the advantages of the stacked denoising autoencoder(SDAE) network in adaptive feature extraction and denoising capabilities to perform adaptive feature extraction on IGBT time-domain degradation data; establishes a long-short-term memory (LSTM) prediction model, and optimizes the learning rate, number of nodes in the hidden layer, and number of hidden layers using the Gray Wolf Optimization (GWO) algorithm; conducts verification experiments on the IGBT accelerated aging dataset provided by NASA PCoE Research Center, and selects performance evaluation indicators to compare and analyze the prediction results of the SDAE-LSTM model, PSOLSTM model, and BP model. The results show that the SDAE-LSTM model can achieve more accurate and stable IGBT life prediction.

Lexical Discovery and Consolidation Strategies of Proficient and Less Proficient EFL Vocational High School Learners

  • Chon, Yuah Vicky;Kim, You-Hee
    • English Language & Literature Teaching
    • /
    • v.17 no.3
    • /
    • pp.27-56
    • /
    • 2011
  • The analysis on the use of lexical discovery and consolidation strategies that have been researched within the area of vocabulary learning strategies (VLS) have not sufficiently drawn the interest of EFL practitioners with regard to vocational high school learners. The results, however, are expected to have implications for the design of vocabulary tasks and instructional materials for EFL learners. The present study investigates EFL vocational high school learners' use of lexical discovery and consolidation strategies with questionnaires, where the use of the learners' lexical discovery strategies were further validated with the think-aloud methodology by asking samples of proficient and less proficient learners to report on their reading process while reading L2 texts that had not been exposed to the learners. The results indicated that there were significant differences between the two groups of learners in the employment of 11 of the strategies which were in the categories of determination, social, memory, and metacognitive strategies, but not for cognitive strategies. The pattern of strategies indicated that different lexical discovery and consolidation strategies were employed relatively more by one proficiency group than another. The study suggests some implications for how strategy-based instruction can be implemented in EFL classrooms.

  • PDF