合理的政策形成アルゴリズムの連続値入力への拡張

書誌事項

タイトル別名
  • An Extension of the Rational Policy Making algorithm to Continuous State Spaces
  • ゴウリテキ セイサク ケイセイ アルゴリズム ノ レンゾクチ ニュウリョク エノ カクチョウ

この論文をさがす

抄録

Reinforcement Learning is a kind of machine learning. We know Profit Sharing, the Rational Policy Making algorithm (RPM), the Penalty Avoiding Rational Policy Making algorithm and PS-r* to guarantee the rationality in a typical class of the Partially Observable Markov Decision Processes. However they cannot treat continuous state spaces. In this paper, we present a solution to adapt them in continuous state spaces. We give RPM a mechanism to treat continuous state spaces in the environment that has the same type of a reward. We show the effectiveness of the proposed method in numerical examples.

収録刊行物

被引用文献 (1)*注記

もっと見る

参考文献 (23)*注記

もっと見る

関連プロジェクト

もっと見る

詳細情報

問題の指摘

ページトップへ