Basin-Wide Multi-Reservoir Operation Using Reinforcement Learning

강화학습법을 이용한 유역통합 저수지군 운영

  • 이진희 (한국건설기술연구원 수자원연구부) ;
  • 심명필 (인하대학교 환경토목공학부 토목공학과)
  • Published : 2006.05.18

Abstract

The analysis of large-scale water resources systems is often complicated by the presence of multiple reservoirs and diversions, the uncertainty of unregulated inflows and demands, and conflicting objectives. Reinforcement learning is presented herein as a new approach to solving the challenging problem of stochastic optimization of multi-reservoir systems. The Q-Learning method, one of the reinforcement learning algorithms, is used for generating integrated monthly operation rules for the Keum River basin in Korea. The Q-Learning model is evaluated by comparing with implicit stochastic dynamic programming and sampling stochastic dynamic programming approaches. Evaluation of the stochastic basin-wide operational models considered several options relating to the choice of hydrologic state and discount factors as well as various stochastic dynamic programming models. The performance of Q-Learning model outperforms the other models in handling of uncertainty of inflows.

Keywords