DOI QR코드

DOI QR Code

Study of Selection of Regression Equation for Flow-conditions using Machine-learning Method: Focusing on Nakdonggang Waterbody

머신러닝 기법을 활용한 유황별 LOADEST 모형의 적정 회귀식 선정 연구: 낙동강 수계를 중심으로

  • Kim, Jonggun (Department of Regional Infrastructures Engineering, Kangwon National University) ;
  • Park, Youn Shik (Department of Rural Construction Engineering, Kongju National University) ;
  • Lee, Seoro (Department of Regional Infrastructures Engineering, Kangwon National University) ;
  • Shin, Yongchul (School of Agricultural Civil and Bio-Industrial Engineering, Kyungpook National University) ;
  • Lim, Kyoung Jae (Department of Regional Infrastructures Engineering, Kangwon National University) ;
  • Kim, Ki-sung (Department of Regional Infrastructures Engineering, Kangwon National University)
  • Received : 2017.04.24
  • Accepted : 2017.07.20
  • Published : 2017.07.31

Abstract

This study is to determine the coefficients of regression equations and to select the optimal regression equation in the LOADEST model after classifying the whole study period into 5 flow conditions for 16 watersheds located in the Nakdonggang waterbody. The optimized coefficients of regression equations were derived using the gradient descent method as a learning method in Tensorflow which is the engine of machine-learning method. In South Korea, the variability of streamflow is relatively high, and rainfall is concentrated in summer that can significantly affect the characteristic analysis of pollutant loads. Thus, unlike the previous application of the LOADEST model (adjusting whole study period), the study period was classified into 5 flow conditions to estimate the optimized coefficients and regression equations in the LOADEST model. As shown in the results, the equation #9 which has 7 coefficients related to flow and seasonal characteristics was selected for each flow condition in the study watersheds. When compared the simulated load (SS) to observed load, the simulation showed a similar pattern to the observation for the high flow condition due to the flow parameters related to precipitation directly. On the other hand, although the simulated load showed a similar pattern to observation in several watersheds, most of study watersheds showed large differences for the low flow conditions. This is because the pollutant load during low flow conditions might be significantly affected by baseflow or point-source pollutant load. Thus, based on the results of this study, it can be found that to estimate the continuous pollutant load properly the regression equations need to be determined with proper coefficients based on various flow conditions in watersheds. Furthermore, the machine-learning method can be useful to estimate the coefficients of regression equations in the LOADEST model.

Keywords

References

  1. Arnold, J. G., R. Srinivasan, R. S. Muttiah, and J. R. Williams, 1998. Large area hydrologic modeling and assessment - Part 1: Model development. Journal of the American Water Resources Association 34(1): 73-89. https://doi.org/10.1111/j.1752-1688.1998.tb05961.x
  2. Dean, J., G. S. Corrado, R. Monga, K. Chen, M. Devin, Q. V. Le, M. Z. Mao, M. Ranzato, A. Senior, P. Tucker, K. Yang, and A. Y. Ng, 2012. Large scale distributed deep networks. In Proceedings of NIPS, pp. 1232-1240.
  3. Haith, D. A., R. Mandel, and R. S. Wu, 1992. GWLF, generalized watershed loading functions, version 2.0, user's manual; Dept. of Agricultural & Biological Engineering, Cornell University: Ithaca, NY, USA.
  4. Jha, B. and M. K. Jha, 2013. Rating Curve Estimation of Surface Water Quality Data Using LOADEST. Journal of Environmental Protection 4: 849-856. https://doi.org/10.4236/jep.2013.48099
  5. Kang, H. W., Y. S. Park, J. Kim, W. S. Jang, J. C. Ryu, N. W. Kim, D. S. Kong, and K. J. Lim, 2010. Enhancement of SWAT Auto-calibration using K-means Clustering Algorithm. 2010 Autumn Conference of the Korean Society on Water Quality & Korean Society of Water and Wastewater, pp. 197-198. [Korean Literature]
  6. Kim, S. D., D. K. Kang, M. S. Kim, and H. S. Shin, 2007. The Possibility of Daily Flow Data Generation from 8-Day Intervals Measured Flow Data for Calibrating Watershed Model, Journal of Korean Society on Water Environment 23(1): 64-71. [Korean Literature]
  7. Lee, J., H. Kwon, and H. Choi, 2014. Evaluation of pollution level attributed to nonpoint sources in Nakdonggan Basin, Korea, Journal of Environmental Impact Assessment 23(5): 393-405. [Korean Literature] https://doi.org/10.14249/eia.2014.23.5.393
  8. Marcin Andrychowicz, Misha Denil, Sergio Gomez Colmenarejo, Matthew W. Hoffman, David Pfau, Tom Schaul, Brendan Shillingford, Nando de Freitas, 2016. Learning to learn by gradient descent. 30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain.
  9. Ministry of Environment (ME), 2004. Total Maximum Daily Loads Handbook. Ministry of Environment, Ministry of Environment (ME), pp. 1-69. [Korean Literature]
  10. National Institute of Environment Research (NIER), 2013. The Study on the Optimum Assessment Methods for Achievement of Target Water Quality and Estimation of Allocation Loads Using a Dynamic Model, NIER-RP2013-274, National Institute of Environment Research, pp. 1-31. [Korean Literature]
  11. Oh, J., T. Sinha, and A. Sankarasubramanian, 2013. The role of retrospective weather forecasts in developing daily forecasts of nutrient loadings over the Southeast US. Hydrology and Earth System Sciences Discussions 10: 15625-15657. https://doi.org/10.5194/hessd-10-15625-2013
  12. Park, Y. S. and B. A. Engel, 2014. Use of pollutant load regression models with various sampling frequencies for annual load estimation. Water 6: 1685-1697. https://doi.org/10.3390/w6061685
  13. Park, Y. S., J. M. Lee, Y. Jung, M. H. Shin, J. H. Park, H. Hwang, J. Ryu, J. Park, and K. Kim, 2015. Evaluation of regression models in LOADEST to estimate suspended solid load in Hangang waterbody. Journal of the Korean Society of Agricultural Engineers 57(2): 37-45. https://doi.org/10.5389/KSAE.2015.57.2.037
  14. Shin, M. H., C. H. Won, Y. H. Choi, K. C. Kim, J. Y. Seo, K. J. Lim, and J. D. Choi, 2008. Estimation of LOADEST Model application for NPS pollutants from agricultural watershed. the 2008 KSAE Annual Conference 24 October, poster No. 6.
  15. USEPA, 2001. Better Assessment Science Integrating Point and Nonpoint Sources (BASINS) v. 3.0 User's Manual; U. S. Environment Protection Agency: Washington, D.C., USA.
  16. Ramanarayanan, T. S., J. R. Williams, W. A. Dugas, L. M. Hauck, and A. M. S. McFarland, 1997. Using APEX to identify alternative practices for animal waste management, Minneapolis, MN. Paper No. 97-2209.
  17. Runkel, R. L., C. G. Crawford, and T. A. Cohn, 2004. Load estimator (LOADEST): A FORTRAN program for estimating constituent loads in streams and rivers, U.S. Geological Survey Techniques and Methods Book 4, Chapter A5, pp. 69.