Toward a Psychology of Deep Reinforcement Learning Agents Using a Cognitive Architecture

Topics in Cognitive Science 14 (4):756-779 (2022)
  Copy   BIBTEX


We argue that cognitive models can provide a common ground between human users and deep reinforcement learning (Deep RL) algorithms for purposes of explainable artificial intelligence (AI). Casting both the human and learner as cognitive models provides common mechanisms to compare and understand their underlying decision-making processes. This common grounding allows us to identify divergences and explain the learner's behavior in human understandable terms. We present novel salience techniques that highlight the most relevant features in each model's decision-making, as well as examples of this technique in common training environments such as Starcraft II and an OpenAI gridworld.



    Upload a copy of this work     Papers currently archived: 91,252

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles


Added to PP

12 (#1,031,844)

6 months
7 (#364,455)

Historical graph of downloads
How can I increase my downloads?