• Title/Summary/Keyword: 예보 매개변수들

Search Result 63, Processing Time 0.018 seconds

Prediction of groundwater level in the middle mountainous area of Pyoseon Watershed in Jeju Island using deep learning algorithm, LSTM (딥러닝 알고리즘 LSTM을 활용한 제주도 표선유역 중산간지역의 지하수위 예측)

  • Shin, Mun-Ju;Moon, Soo-Hyoung;Moon, Duk Chul
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.291-291
    • /
    • 2020
  • 제주도는 강수의 지표침투성이 좋은 화산섬의 지질특성상 지표수의 개발이용여건이 취약한 관계로 용수의 대부분을 지하수에 의존하고 있다. 따라서 제주도는 정책 및 연구적으로 오랜 기간동안 지하수의 보전관리에 많은 노력을 기울여 오고 있다. 하지만 최근 기후변화로 인한 강수의 변동성 증가로 인해 지하수위의 변동성 또한 증가할 가능성이 있으며 따라서 지하수위의 급격한 하강에 대비하여 지하수위의 예측 및 지하수 취수량 관리의 필요성이 요구되고 있다. 지하수에 절대적으로 의존하고 있는 제주도의 수자원 이용 여건을 고려할 때, 지하수의 취수량 관리를 위한 지하수위의 실시간 예측이 필요한 실정이다. 하지만 기존의 예측방법에 의한 제주도 지하수위 예측기간은 충분히 길지 않으며 예측기간이 길어지면 예측성능이 낮아지는 문제점이 있었다. 본 연구에서는 이러한 단점을 보완하기 위해 딥러닝 알고리즘인 Long Short Term Memory(LSTM)를 활용하여 제주도 남동쪽 표선유역 중산간지역의 1개 지하수위 관측정에 대해 지하수위를 예측하고 분석하였다. R 기반의 Keras 패키지에 있는 LSTM 알고리즘을 사용하였고, 입력자료는 인근의 성판악 및 교래 강우관측소의 일단위 강수량자료와 인근 취수정의 지하수 취수량자료 및 연구대상 관측정의 지하수위 자료를 사용하였으며, 사용된 자료의 기간은 2001년 2월 11일부터 2019년 10월 31일까지 이다. 2001년부터 13년의 보정 및 3년의 검증용 시계열자료를 사용하여 매개변수의 보정 및 과적합을 방지하였고, 3년의 예측용 시계열자료를 사용하여 LSTM 알고리즘의 예측성능을 평가하였다. 목표 예측일수는 1일, 10일, 20일, 30일로 설정하였으며 보정, 검증 및 예측기간에 대한 모의결과의 평가지수로는 Nash-Sutcliffe Efficiency(NSE)를 활용하였다. 모의결과, 보정, 검증 및 예측기간에 대한 1일 예측의 NSE는 각각 0.997, 0.997, 0.993 이었고, 10일 예측의 NSE는 각각 0.993, 0.912, 0.930 이었다. 20일 예측의 경우 NSE는 각각 0.809, 0.781, 0.809 이었으며 30일 예측의 경우 각각 0.677, 0.622, 0.633 이었다. 이것은 LSTM 알고리즘에 의한 10일 예측까지는 관측 지하수위 시계열자료를 매우 적절히 모의할 수 있다는 것을 의미하며, 20일 예측 또한 적절히 모의할 수 있다는 것을 의미한다. 따라서 LSTM 알고리즘을 활용하면 본 연구대상지점에 대한 2주일 또는 3주일의 안정적인 지하수위 예보가 가능하다고 판단된다. 또한 LSTM 알고리즘을 통한 실시간 지하수위 예측은 지하수 취수량 관리에 활용할 수 있을 것이다.

  • PDF

Flood Runoff Analysis using Radar Rainfall and Vflo Model for Namgang Dam Watershed (레이더강우와 Vflo모형을 이용한 남강댐유역 홍수유출해석)

  • Park, Jin-Hyeog;Kang, Boo-Sik;Lee, Geun-Sang;Lee, Eul-Rae
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.10 no.3
    • /
    • pp.13-21
    • /
    • 2007
  • Recently, very short-term rainfall forecast using radar is required for regional flash flood according to climate change. This research is to evaluate the feasibility of GIS based distributed model using radar rainfall which can express temporal and spatial distribution in actual dam watershed during flood runoff period. Vflo model which was developed Oklahoma university was used as physical based distributed model, and Namgang dam watershed ($2,293km^2$) was applied as study site. Distributed rainfall according to grid resolution was generated by using K-RainVieux, preprocess program of radar rainfall, from JIN radar. Also, GIS hydrological parameters were extracted from basic GIS data such as DEM, land cover and soil map, and used as input data of distributed model(Vflo). Results of this research can provide a base for building of real-time short-term rainfall runoff forecast system according to flash flood in near future.

  • PDF

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.141-154
    • /
    • 2019
  • Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.