人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
論文
POMDPs環境のための報酬獲得効率に基づく強化学習法
河合 宏和上野 敦志辰巳 昭治
著者情報
ジャーナル フリー

2008 年 23 巻 1 号 p. 1-12

詳細
抄録

Reinforcement Learning (RL) methods are very hopeful because they can learn useful behavior based on rewards from environment by trial and error. This paper tackles more difficult problems than the ones tackled by many ordinary RL methods: RL in POMDP (Partially Observable Markov Decision Process) environments with multiple rewards.

著者関連情報
© 2008 JSAI (The Japanese Society for Artificial Intelligence)
次の記事
feedback
Top