Toward a Psychology of Deep Reinforcement Learning Agents Using a Cognitive Architecture
Konstantinos Mitsopoulos, Sterling Somers, Joel Schooler, Christian Lebiere, Peter Pirolli & Robert Thomson
Topics in Cognitive Science 14 (4):756-779 (2022)
Abstract
We argue that cognitive models can provide a common ground between human users and deep reinforcement learning (Deep RL) algorithms for purposes of explainable artificial intelligence (AI). Casting both the human and learner as cognitive models provides common mechanisms to compare and understand their underlying decision-making processes. This common grounding allows us to identify divergences and explain the learner's behavior in human understandable terms. We present novel salience techniques that highlight the most relevant features in each model's decision-making, as well as examples of this technique in common training environments such as Starcraft II and an OpenAI gridworld.DOI
10.1111/tops.12573
My notes
Similar books and articles
Counterfactual state explanations for reinforcement learning agents via generative deep learning.Matthew L. Olson, Roli Khanna, Lawrence Neal, Fuxin Li & Weng-Keen Wong - 2021 - Artificial Intelligence 295 (C):103455.
What Is the Model in Model‐Based Planning?Thomas Pouncy, Pedro Tsividis & Samuel J. Gershman - 2021 - Cognitive Science 45 (1):e12928.
SAwSu: An Integrated Model of Associative and Reinforcement Learning.Vladislav D. Veksler, Christopher W. Myers & Kevin A. Gluck - 2014 - Cognitive Science 38 (3):580-598.
The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI.Samuel Allen Alexander - 2020 - Journal of Artificial General Intelligence 11 (1):70-85.
Interestingness elements for explainable reinforcement learning: Understanding agents' capabilities and limitations.Pedro Sequeira & Melinda Gervasio - 2020 - Artificial Intelligence 288:103367.
Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs.Finale Doshi-Velez, Joelle Pineau & Nicholas Roy - 2012 - Artificial Intelligence 187-188 (C):115-132.
The evolution of a cognitive architecture for emotional learning from a modulon structured genome.Stevo Bozinovski & Liljana Bozinovska - 2008 - Journal of Mind and Behavior 29 (1-2):195-216.
Predictive Movements and Human Reinforcement Learning of Sequential Action.Roy de Kleijn, George Kachergis & Bernhard Hommel - 2018 - Cognitive Science 42 (S3):783-808.
When, What, and How Much to Reward in Reinforcement Learning-Based Models of Cognition.Christian P. Janssen & Wayne D. Gray - 2012 - Cognitive Science 36 (2):333-358.
The Outcome‐Representation Learning Model: A Novel Reinforcement Learning Model of the Iowa Gambling Task.Nathaniel Haines, Jasmin Vassileva & Woo‐Young Ahn - 2018 - Cognitive Science 42 (8):2534-2561.
A real‐world rational agent: unifying old and new AI.Paul F. M. J. Verschure & Philipp Althaus - 2003 - Cognitive Science 27 (4):561-590.
Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning.Rui Wang, Xianghua Gan, Qing Li & Xiao Yan - 2021 - Complexity 2021:1-17.
The Role of Basal Ganglia Reinforcement Learning in Lexical Ambiguity Resolution.Jose M. Ceballos, Andrea Stocco & Chantel S. Prat - 2020 - Topics in Cognitive Science 12 (1):402-416.
Analytics
Added to PP
2021-09-02
Downloads
2 (#1,402,744)
6 months
1 (#452,962)
2021-09-02
Downloads
2 (#1,402,744)
6 months
1 (#452,962)
Historical graph of downloads
Sorry, there are not enough data points to plot this chart.
References found in this work
An Integrated Theory of the Mind.John R. Anderson, Daniel Bothell, Michael D. Byrne, Scott Douglass, Christian Lebiere & Yulin Qin - 2004 - Psychological Review 111 (4):1036-1060.
Instance-based learning: Integrating sampling and repeated decisions from experience.Cleotilde Gonzalez & Varun Dutt - 2011 - Psychological Review 118 (4):523-551.
Instance‐based learning in dynamic decision making.Cleotilde Gonzalez, Javier F. Lerch & Christian Lebiere - 2003 - Cognitive Science 27 (4):591-635.
Toward Personalized Deceptive Signaling for Cyber Defense Using Cognitive Models.Edward A. Cranford, Cleotilde Gonzalez, Palvi Aggarwal, Sarah Cooney, Milind Tambe & Christian Lebiere - 2020 - Topics in Cognitive Science 12 (3):992-1011.