Toward a Psychology of Deep Reinforcement Learning Agents Using a Cognitive Architecture

Topics in Cognitive Science 14 (4):756-779 (2022)
  Copy   BIBTEX

Abstract

We argue that cognitive models can provide a common ground between human users and deep reinforcement learning (Deep RL) algorithms for purposes of explainable artificial intelligence (AI). Casting both the human and learner as cognitive models provides common mechanisms to compare and understand their underlying decision-making processes. This common grounding allows us to identify divergences and explain the learner's behavior in human understandable terms. We present novel salience techniques that highlight the most relevant features in each model's decision-making, as well as examples of this technique in common training environments such as Starcraft II and an OpenAI gridworld.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 76,419

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Analytics

Added to PP
2021-09-02

Downloads
2 (#1,402,744)

6 months
1 (#452,962)

Historical graph of downloads

Sorry, there are not enough data points to plot this chart.
How can I increase my downloads?