Data mining: a reconsideration

Journal of Economic Methodology 7 (2):183-194 (2000)

Abstract

Data mining occurs because most economic hypotheses do not have a unique empirical interpretation but allow the econometrician much leeway in selecting conditioning variables, lags, functional forms, and sometimes the sample. The resulting problems are of interest not only to methodologists and philosophers concerned with how hypotheses are validated in the presence of some inevitable ad hocery but, also to readers of economics journals who have no interest in methodology but need to know whether to believe what they read. Since I focus on such mundane problems I make no claim of contributing to the deeper epistemological problems of relating empirical evidence to theory, and to the meaning of confirmation and disconfirmation when, say five of the eight specifications tested are consistent with the hypothesis and, three are not. Instead, I deal with a practical problem confronting a researcher who wants to persuade his readers but does not want to deceive them. He has fitted many regressions with varying results. How should he decide how many and which ones to report? This paper is therefore more about a problem in communicating results than about a problem in the philosophy of science. Hence, I use some common-sense notions, even though I cannot provide rigorous definitions for them

Download options

PhilArchive



    Upload a copy of this work     Papers currently archived: 72,766

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Analytics

Added to PP
2012-02-20

Downloads
20 (#563,403)

6 months
1 (#386,989)

Historical graph of downloads
How can I increase my downloads?

References found in this work

Error and the Growth of Experimental Knowledge.Deborah Mayo - 1996 - International Studies in the Philosophy of Science 15 (1):455-459.

Add more references

Citations of this work

Revisiting Data Mining: 'Hunting' with or Without a License.Aris Spanos - 2000 - Journal of Economic Methodology 7 (2):231-264.
Introduction: Is Data Mining a Methodological Problem?Roger E. Backhouse & Mary S. Morgan - 2000 - Journal of Economic Methodology 7 (2):171-181.

Add more citations

Similar books and articles

Introduction: Is Data Mining a Methodological Problem?Roger E. Backhouse & Mary S. Morgan - 2000 - Journal of Economic Methodology 7 (2):171-181.
Three Attitudes Towards Data Mining.Kevin D. Hoover & Stephen J. Perez - 2000 - Journal of Economic Methodology 7 (2):195-210.
Data Mining: Proprietary Rights, People and Proposals.Dinah Payne & Cherie Courseault Trumbach - 2009 - Business Ethics, the Environment and Responsibility 18 (3):241-252.
Revisiting Data Mining: 'Hunting' with or Without a License.Aris Spanos - 2000 - Journal of Economic Methodology 7 (2):231-264.
Informational Privacy, Data Mining, and the Internet.Herman T. Tavani - 1999 - Ethics and Information Technology 1 (2):137-145.
KDD, Data Mining, and the Challenge for Normative Privacy.Herman T. Tavani - 1999 - Ethics and Information Technology 1 (4):265-273.
Ethical Issues in Web Data Mining.Lita van Wel & Lambèr Royakkers - 2004 - Ethics and Information Technology 6 (2):129-140.
“Explain” in Scientific Discourse.James A. Overton - 2013 - Synthese 190 (8):1383-1405.