A Reinforcement Learning with CMAC

Kwon, Sung-Gyu;

doi:10.5391/IJFIS.2006.6.4.271

International Journal of Fuzzy Logic and Intelligent Systems

Volume 6 Issue 4
/
Pages.271-276
/
2006
/
1598-2645(pISSN)
/
2093-744X(eISSN)

Korean Institute of Intelligent Systems (한국지능시스템학회)

DOI QR Code

A Reinforcement Learning with CMAC

Kwon, Sung-Gyu (Faculty of Mechanical and Automotive Engineering Keimyung University)

Published : 2006.12.01

https://doi.org/10.5391/IJFIS.2006.6.4.271 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

To implement a generalization of value functions in Adaptive Search Element (ASE)-reinforcement learning, CMAC (Cerebellar Model Articulation Controller) is integrated into ASE controller. ASE-reinforcement learning scheme is briefly studied to discuss how CMAC is integrated into ASE controller. Neighbourhood Sequential Training for CMAC is utilized to establish the look-up table and to produce discrete control outputs. In computer simulation, an ASE controller and a couple of ASE-CMAC neural network are trained to balance the inverted pendulum on a cart. The number of trials until the controllers are established and the learning performance of the controllers are evaluated to find that generalization ability of the CMAC improves the speed of the ASE-reinforcement learning enough to realize the cartpole control system.

Keywords

References

L. P. Kaelbling, M. L. Littman, and A. W. Moore, 'Reinforcement Learning: A Survey,' Journal of Artificial Intelligence Research, Vol. 4, pp. 237-285, 1996
L. Zhao and Z. Liu, 'A Genetic Algorithm for Reinforcement Learning,' Proceedings of the IEEE International Conference on Neural Networks, Vol. 2, pp. 1056-1060,3-6 June 1996
A. G. Barto, R. S. Sutton, and C. W. Anderson, 'Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems,' IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-13, No.5, pp. 834-846, September/ October 1983 https://doi.org/10.1109/TSMC.1983.6313077
V.J. Gullapalli, A. Franklin, and H. Benbrahim, 'Acquiring Robot Skills via Reinforcement Learning,' IEEE Control Systems, pp. 13-24, February 1994
T. Kondo and K. Ito, 'A Reinforcement Learning using Adaptive State Space Construction Strategy for Real Autonomous Mobile Robots,' Proceedings of the 41st SICE Annual Conference, Vol. 5, pp. 3139-3144, 5-7 August, 2002
M. Ricordeau, 'Q-Concept-Learning: Generalization with Concept Lattice Representation in Reinforcement Learning,' Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence, pp. 316-323, 3- 5 November 2003
D. Michie and R. A. Chambers, ''BOXES' as a Model of Pattern-Formation,' Towards a Theoretical Biology, pp. 206-215, 1968
Y. Hu and R. Fellman, 'A Hardware Efficient Implementation of a Boxes Reinforcement Learning System,' Proceedings of IEEE International Conference on Neural Networks, Vol. 4, pp. 2297-2302, 7 June - 2 July 1994
M. Han and B. Zhang, 'Control of Robotic Manipulators using a CMAC-Based Reinforcement Learning System,' Proceedings of the IEEE/RSJ/GI international Conference on Intelligent Robots and Systems 1994, Volume 3, 2-16, pp. 2117-2122, September 1994
M. T. Rosenstein and A. G. Barto, 'Reinforcement Learning with Supervision by a Stable Controller,' Proceedings of the 2004 American Control Conference, Boston, Massachusetts, pp. 4517-4522, June 30 - July 2, 2004
R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, The MIT Press, Cambridge, Massachusetts, 1998
C. Lin and H. Kim, 'CMAC-Based Adaptive Critic Self-Learning Control,' IEEE Transactions on Neural Networks, Vol. 2, No.5, pp. 530-533, September 1991 https://doi.org/10.1109/72.134290
R. S. Sutton, 'Generalization in Reinforcement Learning: Successful Examples using Sparse Coarse Coding,' Advances in Neural information Processing Systems 8, pp. 1038-1-44, 1996
X. Xu, D. Hu, and H. He, 'Accelerated Reinforcement Learning Control using Modified CMAC Neural Networks,' Proceedings of the 9th international Conference on Neural Information Processing, Vol. 5, pp. 2575-2578, 2002
Y. Wei and M. Zhao, 'Effective Strategies for Complex Skill Real-time Learning using Reinforcement Learning,' Proceedings of the 2003 IEEE international Conference on Robotics, Intelligent Systems and Signal Processing, pp. 388-392, October 2003
D. E. Thompson and S. Kwon, 'Neighborhood Sequential and Random Training Techniques for CMAC,' IEEE Transactions on Neural Networks, Vol. 6, No. 1, pp. 196-202, January 1995 https://doi.org/10.1109/72.363437
J. S. Albus, 'Data Storage in the Cerebellar Model Articulation Controller (CMAC),' Journal of Dynamic Systems, Measurement and Control, Transactions ASME, Series G, Vol. 97, No.3, pp. 228-233, September 1975 https://doi.org/10.1115/1.3426923
S. Geva and J. Sitte, 'A Cartpole Experiment Benchmark for Trainable Controllers,' IEEE Control Systems, pp. 40-51, October 1993

International Journal of Fuzzy Logic and Intelligent Systems

A Reinforcement Learning with CMAC

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)