DOI QR코드

DOI QR Code

Reinforcement learning multi-agent using unsupervised learning in a distributed cloud environment

  • Gu, Seo-Yeon (Department of Computer Science, Kwangwoon University) ;
  • Moon, Seok-Jae (Department of Artificial Intelligence Institute of Information Technology, KwangWoon University) ;
  • Park, Byung-Joon (Department of Computer Science, Kwangwoon University)
  • Received : 2022.04.18
  • Accepted : 2022.04.22
  • Published : 2022.05.31

Abstract

Companies are building and utilizing their own data analysis systems according to business characteristics in the distributed cloud. However, as businesses and data types become more complex and diverse, the demand for more efficient analytics has increased. In response to these demands, in this paper, we propose an unsupervised learning-based data analysis agent to which reinforcement learning is applied for effective data analysis. The proposal agent consists of reinforcement learning processing manager and unsupervised learning manager modules. These two modules configure an agent with k-means clustering on multiple nodes and then perform distributed training on multiple data sets. This enables data analysis in a relatively short time compared to conventional systems that perform analysis of large-scale data in one batch.

Keywords

Acknowledgement

This work is financially supported by Korea Ministry of Environment(MOE) Graduate School specialized in Integrated Pollution Prevention and Control Project.

References

  1. K. Al-Gumaei, A. Muller, J. N. Weskamp, C. S. Longo, F. Pethig and S. Windmann, "Scalable Analytics Platform for Machine Learning in Smart Production Systems",2019 24th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA), 2019, pp. 1155-1162, DOI: https://doi.org/10.1109/ETFA.2019.8869075
  2. Byung-Hyeon Yoo, Debrani Devi, Hyeon-Woo Kim, Hwa-Jeon Song, Kyung-Moon Park, and Seong-Won Lee, "A Survey on Recent Advances in Multi-Agent Reinforcement Learning," Electronics and Telecommunications Trends, vol. 35, no. 6, pp. 137-149, Dec. 2020. DOI: https://doi.org/10.22648/ETRI.2020.J.350614
  3. Kim, Bum-Kyu, Hong-Joo Yoon, and Jun Ho Lee. "A Study on the Distribution of Cold Water Occurrence using K-Means Clustering." The Journal of the Korea institute of electronic communication sciences 16. 2, pp.371-378, 2021. DOI: https://doi.org/10.13067/JKIECS.2021.16.2.371
  4. M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents." in Proc. of the Tenth International Conference on Machine Learning (ICML), pp.330-337, 1993. DOI: https://doi.org/10.1016/B978-1-55860-307-3.50049-6
  5. C. Watkins, "Learning from delayed rewards," Ph.D. Thesis, University of Cambridge England, 1989
  6. V. Mnih, et al., "Human-level control through deep reinforcement learning," Nature, pp.529-533, 201 DOI: https://doi.org/10.1038/nature14236
  7. J. N. Foerster, et al., "Stabilising experience replay for deep multi-agent reinforcement learning" in Proceedings of The 34th International Conference on Machine Learning (ICML), pp.1146-1155, 2017 DOI: https://doi.org/10.48550/arXiv.1702.08887
  8. Fei Chen and Wei Ren, "On the Control of Multi-Agent Systems: A Survey", Foundations and Trends® in Systems and Control: Vol. 6: No. 4, pp 339-499, 2019 DOI: http://dx.doi.org/10.1561/2600000019