Q(st at):= (I — o')Q(st at) + o'(r(st+1)

Abstract

Straightforward reinforcement learning for multi-agent co-learning settings often results in poor outcomes. Meta-learning processes beyond straightforward reinforcement learning may be necessary to achieve good (or optimal) outcomes. Algorithmic processes of meta-learning, or "manipulation", will be described, which is a cognitively realistic and effective means for learning cooperation. We will discuss various "manipulation" routines that address the issue of improving multi-agent co-learning. We hope to develop better adaptive means of multi-agent cooperation, without requiring a priori knowledge, and advance multi-agent co-learning beyond existing theories and techniques

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,322

External links

  • This entry has no external links. Add one.
Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

  • Only published works are available at libraries.

Similar books and articles

A model for updates in a multi-agent setting.John Cantwell - 2007 - Journal of Applied Non-Classical Logics 17 (2):183-196.
Learning to cooperate: Reciprocity and self-control.Peter Danielson - 2002 - Behavioral and Brain Sciences 25 (2):256-257.

Analytics

Added to PP
2012-09-05

Downloads
6 (#1,425,536)

6 months
1 (#1,533,009)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references