Profit Sharing 法における強化関数に関する一考察

Transactions of the Japanese Society for Artificial Intelligence 19:197-203 (2004)
  Copy   BIBTEX

Abstract

In this paper, we consider profit sharing that is one of the reinforcement learning methods. An agent learns a candidate solution of a problem from the reward that is received from the environment if and only if it reaches the destination state. A function that distributes the received reward to each action of the candidate solution is called the reinforcement function. On this learning system, the agent can reinforce the set of selected actions when it gets the reward. And the agent should not reinforce the detour actions. First, we will propose a new constraint equation about reinforcement functions to distribute the reinforcement values on the non-detour actions. If we use the reinforcement function to satisfy the constraint equation, the agent can select the non-detour actions directing to the destination state. Next, it is shown that the reinforcement function can be constant after learning process to suppress the selection of detour actions. Lastly, in computer simulations for maze problems, we show that the learning performance of agents does not depend on the size of environment.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 92,907

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

経験に固執しない Profit Sharing 法.Ueno Atsushi Uemura Wataru - 2006 - Transactions of the Japanese Society for Artificial Intelligence 21:81-93.
How to profit from profit sharing.J. Bell & D. Wray - 1989 - Business and Society Review 68:57-60.
Profit-sharing and industrial peace.Arthur O. Lovejoy - 1921 - International Journal of Ethics 31 (3):241-263.
不完全知覚判定法を導入した Profit Sharing.Masuda Shiro Saito Ken - 2004 - Transactions of the Japanese Society for Artificial Intelligence 19:379-388.
Profit Sharing の不完全知覚環境下への拡張: PS-r^* の提案と評価.Kobayashi Shigenobu Miyazaki Kazuteru - 2003 - Transactions of the Japanese Society for Artificial Intelligence 18:286-296.
Profit: Some moral reflections.Paul F. Camenisch - 1987 - Journal of Business Ethics 6 (3):225 - 231.
The self-dual serial cost-sharing rule.M. J. Albizuri - 2010 - Theory and Decision 69 (4):555-567.
Insights from ifaluk: Food sharing among cooperative fishers.Richard Sosis - 2004 - Behavioral and Brain Sciences 27 (4):568-569.
What Should Be the Data Sharing Policy of Cognitive Science?Mark A. Pitt & Yun Tang - 2013 - Topics in Cognitive Science 5 (1):214-221.

Analytics

Added to PP
2014-03-21

Downloads
23 (#701,518)

6 months
11 (#270,430)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references