David Bourget (Western Ontario)
David Chalmers (ANU, NYU)
Rafael De Clercq
Jack Alan Reynolds
Learn more about PhilPapers
Journal of Economic Methodology 7 (2):195-210 (2000)
'Data mining' refers to a broad class of activities that have in common, a search over different ways to process or package data statistically or econometrically with the purpose of making the final presentation meet certain design criteria. We characterize three attitudes toward data mining: first, that it is to be avoided and, if it is engaged in, that statistical inferences must be adjusted to account for it; second, that it is inevitable and that the only results of any interest are those that transcend the variety of alternative data mined specifications (a view associated with Leamer's extreme-bounds analysis); and third, that it is essential and that the only hope we have of using econometrics to uncover true economic relationships is to be found in the intelligent mining of data. The first approach confuses considerations of sampling distribution and considerations of epistemic warrant and, reaches an unnecessarily hostile attitude toward data mining. The second approach relies on a notion of robustness that has little relationship to truth: there is no good reason to expect a true specification to be robust alternative specifications. Robustness is not, in general, a carrier of epistemic warrant. The third approach is operationalized in the general-to-specific search methodology of the LSE school of econometrics. Its success demonstrates that intelligent data mining is an important element in empirical investigation in economics.
|Keywords||No keywords specified (fix it)|
|Categories||categorize this paper)|
Setup an account with your affiliations in order to access resources via your University's proxy server
Configure custom proxy (use this if your affiliation does not provide a proxy)
|Through your library|
References found in this work BETA
No references found.
Citations of this work BETA
J. Kuorikoski, A. Lehtinen & C. Marchionni (2010). Economic Modelling as Robustness Analysis. British Journal for the Philosophy of Science 61 (3):541-567.
Similar books and articles
Dinah Payne & Cherie Courseault Trumbach (2009). Data Mining: Proprietary Rights, People and Proposals. Business Ethics 18 (3):241-252.
Clinton A. Greene (2000). I Am Not, nor Have I Ever Been a Member of a Data-Mining Discipline. Journal of Economic Methodology 7 (2):217-230.
Aris Spanos (2000). Revisiting Data Mining: 'Hunting' with or Without a License. Journal of Economic Methodology 7 (2):231-264.
Roger E. Backhouse & Mary S. Morgan (2000). Introduction: Is Data Mining a Methodological Problem? Journal of Economic Methodology 7 (2):171-181.
Adrian R. Pagan & Michael R. Veall (2000). Data Mining and the Econometrics Industry: Comments on the Papers of Mayer and of Hoover and Perez. Journal of Economic Methodology 7 (2):211-216.
Herman T. Tavani (1999). Informational Privacy, Data Mining, and the Internet. Ethics and Information Technology 1 (2):137-145.
Steven Cook (2001). Observations on the Practice of Data-Mining: Comments on the JEM Symposium. Journal of Economic Methodology 8 (3):415-419.
Herman T. Tavani (1999). KDD, Data Mining, and the Challenge for Normative Privacy. Ethics and Information Technology 1 (4):265-273.
Lita van Wel & Lambèr Royakkers (2004). Ethical Issues in Web Data Mining. Ethics and Information Technology 6 (2):129-140.
Anthony Danna & Oscar H. Gandy (2002). All That Glitters is Not Gold: Digging Beneath the Surface of Data Mining. [REVIEW] Journal of Business Ethics 40 (4):373 - 386.
Thomas Mayer (2000). Data Mining: A Reconsideration. Journal of Economic Methodology 7 (2):183-194.
Kamal Dahbur & Thomas Muscarello (2003). Classification System for Serial Criminal Patterns. Artificial Intelligence and Law 11 (4):251-269.
Added to index2012-02-20
Total downloads8 ( #250,895 of 1,700,363 )
Recent downloads (6 months)4 ( #161,079 of 1,700,363 )
How can I increase my downloads?