Explaining Machine Learning Decisions

Philosophy of Science 89 (1):1-19 (2022)
  Copy   BIBTEX

Abstract

The operations of deep networks are widely acknowledged to be inscrutable. The growing field of Explainable AI has emerged in direct response to this problem. However, owing to the nature of the opacity in question, XAI has been forced to prioritise interpretability at the expense of completeness, and even realism, so that its explanations are frequently interpretable without being underpinned by more comprehensive explanations faithful to the way a network computes its predictions. While this has been taken to be a shortcoming of the field of XAI, I argue that it is broadly the right approach to the problem.

Other Versions

No versions found

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 96,395

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Allure of Simplicity.Thomas Grote - 2023 - Philosophy of Medicine 4 (1).
What is Interpretability?Adrian Erasmus, Tyler D. P. Brunet & Eyal Fisher - 2021 - Philosophy and Technology 34:833–862.
SIDEs: Separating Idealization from Deceptive ‘Explanations’ in xAI.Emily Sullivan - forthcoming - Proceedings of the 2024 Acm Conference on Fairness, Accountability, and Transparency.

Analytics

Added to PP
2022-04-07

Downloads
196 (#112,113)

6 months
57 (#101,722)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

John Zerilli
University of Edinburgh

Citations of this work

Cultural Bias in Explainable AI Research.Uwe Peters & Mary Carman - forthcoming - Journal of Artificial Intelligence Research.
ML interpretability: Simple isn't easy.Tim Räz - 2024 - Studies in History and Philosophy of Science Part A 103 (C):159-167.
Explainability, Public Reason, and Medical Artificial Intelligence.Michael Da Silva - 2023 - Ethical Theory and Moral Practice 26 (5):743-762.

View all 10 citations / Add more citations

References found in this work

Real patterns.Daniel C. Dennett - 1991 - Journal of Philosophy 88 (1):27-51.
Intentional systems.Daniel C. Dennett - 1971 - Journal of Philosophy 68 (February):87-106.
Principles of categorization [Електронний ресурс]/Eleonora Rosch.E. Rosch - 1978 - In Eleanor Rosch & Barbara Bloom Lloyd (eds.), Cognition and Categorization. Lawrence Elbaum Associates.
Artificial intelligence—A personal view.David Marr - 1977 - Artificial Intelligence 9 (September):37-48.
Intentional Systems Theory.Daniel Dennett - 2007 - In Brian P. McLaughlin, Ansgar Beckermann & Sven Walter (eds.), The Oxford handbook of philosophy of mind. New York: Oxford University Press.

View all 9 references / Add more references