Enforcing ethical goals over reinforcement-learning policies

Ethics and Information Technology 24 (4):1-19 (2022)
  Copy   BIBTEX

Abstract

Recent years have yielded many discussions on how to endow autonomous agents with the ability to make ethical decisions, and the need for explicit ethical reasoning and transparency is a persistent theme in this literature. We present a modular and transparent approach to equip autonomous agents with the ability to comply with ethical prescriptions, while still enacting pre-learned optimal behaviour. Our approach relies on a normative supervisor module, that integrates a theorem prover for defeasible deontic logic within the control loop of a reinforcement learning agent. The supervisor operates as both an event recorder and an on-the-fly compliance checker w.r.t. an external norm base. We successfully evaluated our approach with several tests using variations of the game Pac-Man, subject to a variety of “ethical” constraints.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 94,070

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Just consequentialism and computing.James H. Moor - 1999 - Ethics and Information Technology 1 (1):61-65.
Editorial: Ethical reflections on the virtual frontier. [REVIEW]Lucas D. Introna - 2000 - Ethics and Information Technology 2 (1):1-2.
Why a treaty on autonomous weapons is necessary and feasible.Daan Kayser - 2023 - Ethics and Information Technology 25 (2):1-5.

Analytics

Added to PP
2022-09-30

Downloads
16 (#906,830)

6 months
7 (#592,070)

Historical graph of downloads
How can I increase my downloads?