Calibrating machine behavior: a challenge for AI alignment

Erez Firt

Download from

dx.doi.org

More download options

Calibrating machine behavior: a challenge for AI alignment

Erez Firt

Ethics and Information Technology 25 (3):1-8 (2023) Copy BIBT_EX

Abstract

When discussing AI alignment, we usually refer to the problem of teaching or training advanced autonomous AI systems to make decisions that are aligned with human values or preferences. Proponents of this approach believe it can be employed as means to stay in control over sophisticated intelligent systems, thus avoiding certain existential risks. We identify three general obstacles on the path to implementation of value alignment: a technological/technical obstacle, a normative obstacle, and a calibration problem. Presupposing, for the purposes of this discussion, that the technical and normative problems are solved, we focus on the problem of how to calibrate a system, for a specific value, to be on a specific location within a spectrum stretching between righteous and normal or average human behavior. Calibration, or more specifically mis-calibration, also raises the issue of trustworthiness. If we cannot trust AI systems to perform tasks the way we intended, we would not use them on our roads and at our homes. In an era where we strive to construct autonomous machines endowed with common sense, reasoning abilities and a connection to the world, so they would be able to act in alignment with human values, such mis-calibrations can make the difference between trustworthy and untrustworthy systems.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Keywords

Ethics Innovation/Technology Management Library Science Management of Computing and Information Systems User Interfaces and Human Computer Interaction

Reprint years

DOI

10.1007/s10676-023-09716-8

My notes

Analytics

Added to PP
2023-09-20

Downloads
32 (#502,127)

6 months
23 (#120,782)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

Artificial Intelligence, Values, and Alignment.Iason Gabriel - 2020 - Minds and Machines 30 (3):411-437.

Just consequentialism and computing.James H. Moor - 1999 - Ethics and Information Technology 1 (1):61-65.

Artificial understanding: a step toward robust AI.Erez Firt - forthcoming - AI and Society:1-13.

Just consequentialism and computing.James H. Moor - 1999 - Ethics and Information Technology 1 (1):61-65.

The missing G.Erez Firt - 2020 - AI and Society 35 (4):995-1007.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Calibrating machine behavior: a challenge for AI alignment

Abstract

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work