Q(st at):= (I Ã¢â‚¬â€ o')Q(st at) + o'(r(st+1)

Ron Sun

Q(st at):= (I Ã¢â¬â o')Q(st at) + o'(r(st+1)

Abstract

Straightforward reinforcement learning for multi-agent co-learning settings often results in poor outcomes. Meta-learning processes beyond straightforward reinforcement learning may be necessary to achieve good (or optimal) outcomes. Algorithmic processes of meta-learning, or "manipulation", will be described, which is a cognitively realistic and effective means for learning cooperation. We will discuss various "manipulation" routines that address the issue of improving multi-agent co-learning. We hope to develop better adaptive means of multi-agent cooperation, without requiring a priori knowledge, and advance multi-agent co-learning beyond existing theories and techniques

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Keywords

Add keywords

Reprint years

My notes

Similar books and articles

Integrating reinforcement learning, bidding and genetic algorithms.Ron Sun - unknown

Automatic Partitioning for Multi-Agent Reinforcement Learning.Ron Sun - unknown

An evolutionary game theoretic perspective on learning in multi-agent systems.Karl Tuyls, Ann Nowe, Tom Lenaerts & Bernard Manderick - 2004 - Synthese 139 (2):297 - 330.

Bidding in Reinforcement Learning: A Paradigm for Multi-Agent Systems.Chad Sessions - unknown

Individual action and collective function: From sociology to multi-agent learning.Ron Sun - manuscript

Multi-Agent Reinforcement Learning: Weighting and Partitioning.Ron Sun & Todd Peterson - unknown

Spontaneous coordination and evolutionary learning processes in an agent-based model.Pierre Barbaroux & Gilles Enée - 2005 - Mind and Society 4 (2):179-195.

A model for updates in a multi-agent setting.John Cantwell - 2007 - Journal of Applied Non-Classical Logics 17 (2):183-196.

Beyond simple rule extraction: The extraction of planning knowledge from reinforcement learners.Ron Sun - unknown

Bottom-up skill learning in reactive sequential decision tasks.Ron Sun, Todd Peterson & Edward Merrill - unknown

Learning to cooperate: Reciprocity and self-control.Peter Danielson - 2002 - Behavioral and Brain Sciences 25 (2):256-257.

Knowledge extraction from reinforcement learning.Ron Sun - unknown

Learning to plan probabilistically from neural networks.R. Sun - unknown

Learning with neighbours: Emergence of convention in a society of learning agents.Roland Mühlenbernd - 2011 - Synthese 183 (S1):87-109.

Analytics

Added to PP
2012-09-05

Downloads
6 (#1,425,536)

6 months
1 (#1,533,009)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Q(st at):= (I Ã¢â¬â o')Q(st at) + o'(r(st+1)

Abstract

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work

Q(st at):= (I Ã¢â¬â o')Q(st at) + o'(r(st+1)

Abstract

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work

Q(st at):= (I Ã¢â¬â o')Q(st at) + o'(r(st+1)