Flexible Labeling Mechanism in LQ-learning for Maze Problems

Lee, Haeyeon;Hiroyuki Kamaya;Kenichi Abe;Hiroyuki Kamaya;

제어로봇시스템학회:학술대회논문집

제어로봇시스템학회 (Institute of Control, Robotics and Systems)

Flexible Labeling Mechanism in LQ-learning for Maze Problems

Lee, Haeyeon (Tohoku Univ.) ;
Hiroyuki Kamaya (Tohoku Univ.) ;
Kenichi Abe (Tohoku Univ.) ;
Hiroyuki Kamaya (Tohoku Univ.)

발행 : 2001.10.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Recently, Reinforcement Learning (RL) methods in MDP have been extended and applied to the POMDP problems. Currently, hierarchical RL methods are widely studied. However, they have the drawback that the learning time and memories are exhausted only for keeping the hierarchical structure, though they aren´t necessary. On the other hand, our "Labeling Q-learning (LQ-learning) proposed previously, has no hierarchical structure, but adopts a characteristic internal memory mechanism. Namely, LQ-1earning agent percepts the state by pair of observation and its label, and the agent can distinguish states, which look as same, but obviously different, more exactly. So to speak, at each step t, we define a new type of perception of its environment ～ot = (ot, $\theta$t), where of is conventional observation, and $\theta$t is the label attached to the observation. Then the conventional ...

제어로봇시스템학회:학술대회논문집

Flexible Labeling Mechanism in LQ-learning for Maze Problems

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)