The Ghost in the Machine has an American accent: value conflict in GPT-3

Rebecca Johnson; Giada Pistilli; Natalia Menedez-Gonzalez; Leslye Denisse Dias Duran; Enrico Panai; Julija Kalpokiene; Donald Jay Bertulfo

The Ghost in the Machine has an American accent: value conflict in GPT-3

Rebecca Johnson, Giada Pistilli, Natalia Menedez-Gonzalez, Leslye Denisse Dias Duran, Enrico Panai, Julija Kalpokiene & Donald Jay Bertulfo

Abstract

The alignment problem in the context of large language models must consider the plurality of human values in our world. Whilst there are many resonant and overlapping values amongst the world’s cultures, there are also many conflicting, yet equally valid, values. It is important to observe which cultural values a model exhibits, particularly when there is a value conflict between input prompts and generated outputs. We discuss how the co- creation of language and cultural value impacts large language models (LLMs). We explore the constitution of the training data for GPT-3 and compare that to the world’s language and internet access demographics, as well as to reported statistical profiles of dominant values in some Nation-states. We stress tested GPT-3 with a range of value-rich texts representing several languages and nations; including some with values orthogonal to dominant US public opinion as reported by the World Values Survey. We observed when values embedded in the input text were mutated in the generated outputs and noted when these conflicting values were more aligned with reported dominant US values. Our discussion of these results uses a moral value pluralism (MVP) lens to better understand these value mutations. Finally, we provide recommendations for how our work may contribute to other current work in the field.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Author Profiles

Natalia Gonzalez

Giada Pistilli

Sorbonne Université

Leslye Denisse Dias Duran

Universidad Pontificia Bolivariana

2 more

Keywords

Artificial Intelligence - Value Pluralism - AI Ethics - Natural Language Processing

Reprint years

My notes

Analytics

Added to PP
2022-05-18

Downloads
191 (#105,291)

6 months
102 (#52,796)

Historical graph of downloads

How can I increase my downloads?

Author Profiles

Natalia Gonzalez

Giada Pistilli

Sorbonne Université

Leslye Denisse Dias Duran

Universidad Pontificia Bolivariana

2 more

Citations of this work

No citations found.

Add more citations

References found in this work

Women, Fire, and Dangerous Things: What Categories Reveal about the Mind.George Lakoff - 1987 - Philosophy and Rhetoric 22 (4):299-302.

Artificial Intelligence, Values, and Alignment.Iason Gabriel - 2020 - Minds and Machines 30 (3):411-437.

Mortal Questions.Thomas Nagel - 1980 - Critica 12 (34):125-133.

Word vector embeddings hold social ontological relations capable of reflecting meaningful fairness assessments.Ahmed Izzidien - 2022 - AI and Society 37 (1):299-318.

The logic of judgments of practise.John Dewey - 1915 - Journal of Philosophy, Psychology and Scientific Methods 12 (19):505-523.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

The Ghost in the Machine has an American accent: value conflict in GPT-3

Abstract

Author Profiles

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author Profiles

Citations of this work

References found in this work