Reinforcement learning Speedup method using Q-value Initialization

;

Proceedings of the IEEK Conference (대한전자공학회:학술대회논문집)

2001.06c
/
Pages.13-16
/
2001

The Institute of Electronics and Information Engineers (대한전자공학회)

Reinforcement learning Speedup method using Q-value Initialization

Q-value Initialization을 이용한 Reinforcement Learning Speedup Method

최정환

Published : 2001.06.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

In reinforcement teaming, Q-learning converges quite slowly to a good policy. Its because searching for the goal state takes very long time in a large stochastic domain. So I propose the speedup method using the Q-value initialization for model-free reinforcement learning. In the speedup method, it learns a naive model of a domain and makes boundaries around the goal state. By using these boundaries, it assigns the initial Q-values to the state-action pairs and does Q-learning with the initial Q-values. The initial Q-values guide the agent to the goal state in the early states of learning, so that Q-teaming updates Q-values efficiently. Therefore it saves exploration time to search for the goal state and has better performance than Q-learning. 1 present Speedup Q-learning algorithm to implement the speedup method. This algorithm is evaluated. in a grid-world domain and compared to Q-teaming.

Proceedings of the IEEK Conference (대한전자공학회:학술대회논문집)

Reinforcement learning Speedup method using Q-value Initialization

Q-value Initialization을 이용한 Reinforcement Learning Speedup Method

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)