Turning biases into hypotheses through method: A logic of scientific discovery for machine learning

Big Data and Society 8 (1) (2021)
  Copy   BIBTEX

Abstract

Machine learning systems have shown great potential for performing or supporting inferential reasoning through analyzing large data sets, thereby potentially facilitating more informed decision-making. However, a hindrance to such use of ML systems is that the predictive models created through ML are often complex, opaque, and poorly understood, even if the programs “learning” the models are simple, transparent, and well understood. ML models become difficult to trust, since lay-people, specialists, and even researchers have difficulties gauging the reasonableness, correctness, and reliability of the inferences performed. In this article, we argue that bridging this gap in the understanding of ML models and their reasonableness requires a focus on developing an improved methodology for their creation. This process has been likened to “alchemy” and criticized for involving a large degree of “black art,” owing to its reliance on poorly understood “best practices”. We soften this critique and argue that the seeming arbitrariness often is the result of a lack of explicit hypothesizing stemming from an empiricist and myopic focus on optimizing for predictive performance rather than from an occult or mystical process. We present some of the problems resulting from the excessive focus on optimizing generalization performance at the cost of hypothesizing about the selection of data and biases. We suggest embedding ML in a general logic of scientific discovery similar to the one presented by Charles Sanders Peirce, and present a recontextualized version of Peirce’s scientific hypothesis adjusted to ML.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 93,612

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Predicting and explaining with machine learning models: Social science as a touchstone.Oliver Buchholz & Thomas Grote - 2023 - Studies in History and Philosophy of Science Part A 102 (C):60-69.
Do ML models represent their targets?Emily Sullivan - forthcoming - Philosophy of Science.
What is it for a Machine Learning Model to Have a Capability?Jacqueline Harding & Nathaniel Sharadin - forthcoming - British Journal for the Philosophy of Science.
What kind of novelties can machine learning possibly generate? The case of genomics.Emanuele Ratti - 2020 - Studies in History and Philosophy of Science Part A 83:86-96.

Analytics

Added to PP
2021-07-01

Downloads
14 (#996,581)

6 months
9 (#439,104)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

The inference to the best explanation.Gilbert H. Harman - 1965 - Philosophical Review 74 (1):88-95.
Peirce's theory of abduction.K. T. Fann - 1970 - The Hague,: Martinus Nijhoff.
Patterns of abduction.Gerhard Schurz - 2008 - Synthese 164 (2):201-234.

View all 15 references / Add more references