Research Trends on Inverse Reinforcement Learning

Lee, S.K.;Kim, D.W.;Jang, S.H.;Yang, S.I.;

doi:10.22648/ETRI.2019.J.340609

전자통신동향분석 (Electronics and Telecommunications Trends)

제34권6호
/
Pages.100-107
/
2019
/
1225-6455(pISSN)

한국전자통신연구원 (Electronics and Telecommunications Research Institute)

DOI QR Code

역강화학습 기술 동향

Research Trends on Inverse Reinforcement Learning

이상광 (지능형지식콘텐츠연구실) ;
김대욱 (지능형지식콘텐츠연구실) ;
장시환 (지능형지식콘텐츠연구실) ;
양성일 (지능형지식콘텐츠연구실)

발행 : 2019.12.01

https://doi.org/10.22648/ETRI.2019.J.340609 인용 PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Recently, reinforcement learning (RL) has expanded from the research phase of the virtual simulation environment to a wide range of applications, such as autonomous driving, natural language processing, recommendation systems, and disease diagnosis. However, RL is less likely to be used in these complex real-world environments. In contrast, inverse reinforcement learning (IRL) can obtain optimal policies in various situations; furthermore, it can use expert demonstration data to achieve its target task. In particular, IRL is expected to be a key technology for artificial general intelligence research that can successfully perform human intellectual tasks. In this report, we briefly summarize various IRL techniques and research directions.

키워드

과제정보

연구 과제번호 : 메타 플레이 인식 기반 지능형 게임 서비스 플랫폼 개발

연구 과제 주관 기관 : 한국콘텐츠진흥원

참고문헌

A. Attia et al., "Global overview of imitation learning," arXiv: 1801.06503, 2018.
J. Ho et al., "Generative adversarial imitation learning," Advances in Neural Information Processing Systems, 2016.
S. Ross et al., "Efficient reductions for imitation learning," Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010.
S. Ross et al., "A reduction of imitation learning and structured prediction to no-regret online learning," Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011.
P. Abbeel et al., "Apprenticeship learning via inverse reinforcement learning," Proceedings of the Twenty-First International Conference on Machine Learning, 2004.
B. Ziebart et al., "Maximum Entropy Inverse Reinforcement Learning," Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008.
S. Levine et al., "Nonlinear inverse reinforcement learning with gaussian processes," Advances in Neural Information Processing Systems, 2011.
M. Wulfmeier et al., "Maximum entropy deep inverse reinforcement learning," arXiv:1507.04888, 2015.
C. Finn et al., "Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization," Proceedings of the Thirty-Third International Conference on International Conference on Machine Learning, 2016.
http://rll.berkeley.edu/gcl
I. Goodfellow et al., "Generative adversarial nets," Advances in Neural Information Processing Systems, 2014.
J. Schuman et al., "Trust region policy optimization," Proceedings of the Thirty-Second International Conference on International Conference on Machine Learning, 2015.
X. Peng et al., "Variational discriminator bottleneck: Improving imitation learning, inverse RL, and GANs by constraining information flow," Proceddings of the International Conference on Learning Representations, 2019.
Y. Li et al., "InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations," Advances in Neural Information Processing Systems, 2017.
X. Chen et al., "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets," Advances in Neural Information Processing Systems, 2016.
M. Arjovsky et al., "Wasserstein Generative Adversarial Networks," Proceedings of the Thirty-Fourth International Conference on International Conference on Machine Learning, 2017.
http://torcs.sourceforge.net/

전자통신동향분석 (Electronics and Telecommunications Trends)

역강화학습 기술 동향

Research Trends on Inverse Reinforcement Learning

초록

키워드

과제정보

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)