Acknowledgement
This work is financially supported by Korea Ministry of Environment(MOE) Graduate School specialized in Integrated Pollution Prevention and Control Project.
References
- K. Al-Gumaei, A. Muller, J. N. Weskamp, C. S. Longo, F. Pethig and S. Windmann, "Scalable Analytics Platform for Machine Learning in Smart Production Systems",2019 24th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA), 2019, pp. 1155-1162, DOI: https://doi.org/10.1109/ETFA.2019.8869075
- Byung-Hyeon Yoo, Debrani Devi, Hyeon-Woo Kim, Hwa-Jeon Song, Kyung-Moon Park, and Seong-Won Lee, "A Survey on Recent Advances in Multi-Agent Reinforcement Learning," Electronics and Telecommunications Trends, vol. 35, no. 6, pp. 137-149, Dec. 2020. DOI: https://doi.org/10.22648/ETRI.2020.J.350614
- Kim, Bum-Kyu, Hong-Joo Yoon, and Jun Ho Lee. "A Study on the Distribution of Cold Water Occurrence using K-Means Clustering." The Journal of the Korea institute of electronic communication sciences 16. 2, pp.371-378, 2021. DOI: https://doi.org/10.13067/JKIECS.2021.16.2.371
- M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents." in Proc. of the Tenth International Conference on Machine Learning (ICML), pp.330-337, 1993. DOI: https://doi.org/10.1016/B978-1-55860-307-3.50049-6
- C. Watkins, "Learning from delayed rewards," Ph.D. Thesis, University of Cambridge England, 1989
- V. Mnih, et al., "Human-level control through deep reinforcement learning," Nature, pp.529-533, 201 DOI: https://doi.org/10.1038/nature14236
- J. N. Foerster, et al., "Stabilising experience replay for deep multi-agent reinforcement learning" in Proceedings of The 34th International Conference on Machine Learning (ICML), pp.1146-1155, 2017 DOI: https://doi.org/10.48550/arXiv.1702.08887
- Fei Chen and Wei Ren, "On the Control of Multi-Agent Systems: A Survey", Foundations and Trends® in Systems and Control: Vol. 6: No. 4, pp 339-499, 2019 DOI: http://dx.doi.org/10.1561/2600000019