Minds and Machines 30 (2):219-246 (2020)

Authors
Julia Haas
Rhodes College
Abstract
I describe a suite of reinforcement learning environments in which artificial agents learn to value and respond to moral content and contexts. I illustrate the core principles of the framework by characterizing one such environment, or “gridworld,” in which an agent learns to trade-off between monetary profit and fair dealing, as applied in a standard behavioral economic paradigm. I then highlight the core technical and philosophical advantages of the learning approach for modeling moral cognition, and for addressing the so-called value alignment problem in AI.
Keywords No keywords specified (fix it)
Categories (categorize this paper)
ISBN(s)
DOI 10.1007/s11023-020-09524-9
Options
Edit this record
Mark as duplicate
Export citation
Find it on Scholar
Request removal from index
Translate to english
Revision history

Download options

PhilArchive copy


Upload a copy of this paper     Check publisher's policy     Papers currently archived: 62,481
External links

Setup an account with your affiliations in order to access resources via your University's proxy server
Configure custom proxy (use this if your affiliation does not provide a proxy)
Through your library

References found in this work BETA

The Weirdest People in the World?Joseph Henrich, Steven J. Heine & Ara Norenzayan - 2010 - Behavioral and Brain Sciences 33 (2-3):61-83.

View all 39 references / Add more references

Citations of this work BETA

Moral Rationalism on the Brain.Joshua May - forthcoming - Mind and Language.

Add more citations

Similar books and articles

Models of Moral Cognition.Jeffrey White - 2013 - In Lorenzo Magnani (ed.), Model-Based Reasoning in Science and Technology, 1. springer. pp. last 20.
Moral Nativism: A Sceptical Response.Kim Sterelny - 2010 - Mind and Language 25 (3):279-297.
Meta-Moral Cognition: An Introduction.Reena Cheruvalath - 2019 - Mind and Society 18 (1):33-42.
On the Moral Equality of Artificial Agents.Christopher Wareham - 2011 - International Journal of Technoethics 2 (1):35-42.

Analytics

Added to PP index
2020-04-25

Total views
31 ( #349,666 of 2,446,269 )

Recent downloads (6 months)
9 ( #75,577 of 2,446,269 )

How can I increase my downloads?

Downloads

My notes