Research Trends on Deep Reinforcement Learning

Jang, S.Y.;Yoon, H.J.;Park, N.S.;Yun, J.K.;Son, Y.S.;

doi:10.22648/ETRI.2019.J.340401

Electronics and Telecommunications Trends (전자통신동향분석)

Volume 34 Issue 4
/
Pages.1-14
/
2019
/
1225-6455(pISSN)

Electronics and Telecommunications Research Institute (한국전자통신연구원)

DOI QR Code

Research Trends on Deep Reinforcement Learning

심층 강화학습 기술 동향

장수영 (도시.공간ICT연구실) ;
윤현진 (도시.공간ICT연구실) ;
박노삼 (도시.공간ICT연구실) ;
윤재관 (도시.공간ICT연구실) ;
손영성 (자율형IoT연구실)

Published : 2019.08.01

https://doi.org/10.22648/ETRI.2019.J.340401 Citation PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Recent trends in deep reinforcement learning (DRL) have revealed the considerable improvements to DRL algorithms in terms of performance, learning stability, and computational efficiency. DRL also enables the scenarios that it covers (e.g., partial observability; cooperation, competition, coexistence, and communications among multiple agents; multi-task; decentralized intelligence) to be vastly expanded. These features have cultivated multi-agent reinforcement learning research. DRL is also expanding its applications from robotics to natural language processing and computer vision into a wide array of fields such as finance, healthcare, chemistry, and even art. In this report, we briefly summarize various DRL techniques and research directions.

Keywords

Acknowledgement

Grant : 초연결 공간의 분산 지능 핵심원천 기술

Supported by : 한국전자통신연구원

References

V. Mnih et al., "Playing Atari with Deep Reinforcement Learning," arxiv:1312.5602, 2013.
M. Hessel et al., "Rainbow: Combining Improvements in Deep Reinforcement Learning," in AAAI Conf. Crtif. Intell., New Orleans LA, USA, Feb. 2018, pp. 3215-3222.
R.S. Sutton et al., "Policy Gradient Methods for Reinforcement Learning with Function Approximation," in Proc. Int. Conf. Neural Inf. Process. Syst., Denver, CO, USA, 2000, pp. 1057-1063.
J. Schulman et al., "Proximal Policy Optimization Algorithms," arxiv:1707.06347, 2017.
J. Schulman et al., "Trust Region Policy Optimization," in Int. Conf. Mach. Learning(ICML), Lille, France, July 2015.
Y. Burda et al., "Exploration by Random Network Distillation," in Int. Conf. Learning Representations, New Orleans, LA, USA, 2019.
T. Rashid et al., "QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning," in Int. Conf. Mach. Learning(ICML), Stockholm, Sweden, 2018.
R. Lowe et al., "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments," in Conf. Neural Inf. Process. Syst., Long Beach, CA, USA, 2017.
M. Tan, "Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents," in Int. Conf. Mach. Learning(ICML), Amherst, MA, USA, 1993.
A. Tampuu et al., "Multiagent Cooperation and Competition with Deep Reinforcement Learning," PLOS One, vol. 12, no. 4, Apr. 2017, pp. 1-15.
S. Li et al., "Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient," in AAAI Conf. Crtif. Intell., Honolulu, HI, USA, 2019.
J. Dean et al., "Large Scale Distributed Deep Networks," in Int. Conf. Neural Inf. Process. Syst., Lake Tahoe, NV, USA, Dec. 2012, pp. 1223-1231.
V. Mnih et al., "Asynchronous Methods for Deep Reinforcement Learning," in Proc. Int. Conf. Mach. Learning, New York, USA, 2016, pp. 1928-1937.
A. Nair et al., "Massively Parallel Methods for Deep Reinforcement Learning," in Int. Conf. Mach. Learning(ICML), Lille, France, July 2015.
V. Mnih et al., "Human-Level Control Through Deep Reinforcement Learning," Nature, vol. 518, no. 7540, 2015, pp. 529-533. https://doi.org/10.1038/nature14236
T. Salimans et al., "Evolution Strategies as a Scalable Alternative to Reinforcement Learning," CoRR, arXiv: 1703.03864, 2017.
I. Adamski et al., "Distributed Deep Reinforcement Learning: Learn How to Play Atari Games in 21 Minutes," CoRR, arXiv: 1801.02852, 2018.
D. Horgan, et al., "Distributed Prioritized Experience Replay," in Int. Conf. Learning Representations, Vancouver, Canada, May 2018.
E. Liang et al., "RLlib: Abstractions for Distributed Reinforcement Learning," in Int. Conf. Learning Representations, Vancouver, Canada, May 2018.
P. Buchlovsky et al., "TF-Replicator: Distributed Machine Learning for Researchers," arxiv: 1902.00465, 2019.
L. Espeholt et al., "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures," Proc. Mach. Learning Research, vol. 80, 2018, pp. 1407-1416.
S. Kapturowski et al., "Recurrent Experience Replay in Distributed Reinforcement Learning," in Int. Conf, Learning Representations, New Orleans, LA, USA, May 2019.
H. Matthew and S. Peter, "Deep Recurrent Q-Learning for Partially Observable MDPs," in AAAI Fall Symposia, Arlington, VA, USA, Nov. 2015, pp. 29-37.
N. G. Lopez et al., "Gym-Gazebo2, a Toolkit for Reinforcement Learning Using ROS 2 and Gazebo," arxiv: 1903.06278, 2019.
J. Arthur et al., "Unity: A General Platform for Intelligent Agents," arxiv: 1809.02627, 2018.
G. Brockman et al., "OpenAI Gym," arxiv:1606.01540, 2016.
C. Beattie et al., "DeepMind Lab," arxiv: 1612.03801, 2016.
J. Tan et al., "Sim-to-Real: Learning Agile Locomotion for Quadruped Robots," in Proc. Robotics: Sci. Syst., Pittsburgh, PA, USA, 2018.
Y. Li, "Deep Reinforcement Learning," arxiv: 1810.06339, 2018.

Electronics and Telecommunications Trends (전자통신동향분석)

Research Trends on Deep Reinforcement Learning

심층 강화학습 기술 동향

Abstract

Keywords

Acknowledgement

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)