Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance

Artificial Intelligence 225 (C):24-50 (2015)
  Copy   BIBTEX

Abstract

This article has no associated abstract. (fix it)

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 93,867

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Model-based average reward reinforcement learning.Prasad Tadepalli & DoKyeong Ok - 1998 - Artificial Intelligence 100 (1-2):177-224.
Profit Sharing 法における強化関数に関する一考察.Tatsumi Shoji Uemura Wataru - 2004 - Transactions of the Japanese Society for Artificial Intelligence 19:197-203.
The relation of secondary reward to gradients of reinforcement.Charles C. Perkins Jr - 1947 - Journal of Experimental Psychology 37 (5):377.
罰を回避する合理的政策の学習.坪井 創吾 宮崎 和光 - 2001 - Transactions of the Japanese Society for Artificial Intelligence 16 (2):185-192.

Analytics

Added to PP
2020-12-22

Downloads
11 (#1,146,652)

6 months
9 (#437,808)

Historical graph of downloads
How can I increase my downloads?