Justifying our Credences in the Trustworthiness of AI Systems: A Reliabilistic Approach

Abstract

We address an open problem in the epistemology of artificial intelligence (AI), namely, the justification of the epistemic attitudes we have towards the trustworthiness of AI systems. We start from a key consideration: the trustworthiness of an AI is a time-relative property of the system, with two distinct facets. One is the actual trustworthiness of the AI, and the other is the perceived trustworthiness of the system as assessed by its users while interacting with it. We show that credences, namely, beliefs we hold with a degree of confidence, are the appropriate attitude for capturing the facets of trustworthiness of an AI over time. Then, we introduce a reliabilistic account providing justification to the credence in the trustworthiness of AI, which we derive from Tang’s probabilistic theory of justified credence. Our account stipulates that a credence in the trustworthiness of an AI system is justified if and only if it is caused by an assessment process that tends to result in a high proportion of credences for which the actual and perceived trustworthiness of the AI are calibrated. Our approach informs research on human-AI interactions and trustworthy AI by providing actionable recommendations on how to measure the reliability of the process through which users perceive the trustworthiness of the system and its calibration to the actual levels of trustworthiness of the AI. It also allows investigating the relation between reliability and the appropriate reliance on the system.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,928

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

  • Only published works are available at libraries.

Similar books and articles

What is trustworthiness?Christoph Kelp & Mona Simion - 2023 - Noûs 57 (3):667-683.
Xin: Being Trustworthy.Winnie Sung - 2020 - International Philosophical Quarterly 60 (3):271-286.
Trustworthy artificial intelligence.Mona Simion & Christoph Kelp - 2020 - Asian Journal of Philosophy 2 (1):1-12.
e-Trust and reputation.Thomas W. Simpson - 2011 - Ethics and Information Technology 13 (1):29-38.
On the Roles of Trustworthiness and Acceptance.Marian David - 1991 - Grazer Philosophische Studien 40 (1):93-107.
On the Roles of Trustworthiness and Acceptance.Marian David - 1991 - Grazer Philosophische Studien 40 (1):93-107.
Trust and Trustworthiness.Stephen Wright - 2010 - Philosophia 38 (3):615-627.
Trustworthiness of autonomous systems.S. Kate Devitt - 2018 - In Hussein A. Abbass, Jason Scholz & Darryn Reid (eds.), Foundations of Trusted Autonomous Systems. Springer. pp. 161-184.
Stand up for trustworthiness.Frank Murphy - 2019 - Ann Arbor: Cherry Lake Publishing.

Analytics

Added to PP
2023-08-11

Downloads
31 (#516,123)

6 months
11 (#237,740)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references