Administrative social science data: The challenge of reproducible research
Big Data and Society 3 (2) (2016)
Abstract
Powerful new social science data resources are emerging. One particularly important source is administrative data, which were originally collected for organisational purposes but often contain information that is suitable for social science research. In this paper we outline the concept of reproducible research in relation to micro-level administrative social science data. Our central claim is that a planned and organised workflow is essential for high quality research using micro-level administrative social science data. We argue that it is essential for researchers to share research code, because code sharing enables the elements of reproducible research. First, it enables results to be duplicated and therefore allows the accuracy and validity of analyses to be evaluated. Second, it facilitates further tests of the robustness of the original piece of research. Drawing on insights from computer science and other disciplines that have been engaged in e-Research we discuss and advocate the use of Git repositories to provide a useable and effective solution to research code sharing and rendering social science research using micro-level administrative data reproducible.My notes
Similar books and articles
Sharing Data is a Shared Responsibility: Commentary on: “The Essential Nature of Sharing in Science”.Joe Giffels - 2010 - Science and Engineering Ethics 16 (4):801-803.
Challenges in administrative data linkage for research.Harvey Goldstein, Mauricio L. Barreto, Mahmoud Azimaee, Anders Hjern, James Boyd, Chris Dibben & Katie Harron - 2017 - Big Data and Society 4 (2).
The role and nature of consent in government administrative data.Alexandra Eveleigh, Oliver Duke-Williams, Elizabeth Shepherd & Anna Sexton - 2018 - Big Data and Society 5 (2).
Openness in the social sciences: Sharing data.Joan E. Sieber - 1991 - Ethics and Behavior 1 (2):69 – 86.
Archiving information from geotagged tweets to promote reproducibility and comparability in social media research.Fred Morstatter, Jürgen Pfeffer, Wolfgang Zenk-Möltgen, Katrin Weller & Katharina Kinder-Kurlanda - 2017 - Big Data and Society 4 (2).
Where are human subjects in Big Data research? The emerging ethics divide.Kate Crawford & Jacob Metcalf - 2016 - Big Data and Society 3 (1).
Research Integrity in the Context of Social Science Research in Africa.Nico Nortjé & Willem A. Hoffmann - 2019 - In Nico Nortjé, Retha Visagie & J. S. Wessels (eds.), Social Science Research Ethics in Africa. Springer Verlag. pp. 117-123.
Editors' Overview: Topics in the Responsible Management of Research Data.Joe Giffels, Sara H. Vollmer & Stephanie J. Bird - 2010 - Science and Engineering Ethics 16 (4):631-637.
Values and Data Collection in Social Research.Julie Zahle - 2018 - Philosophy of Science 85 (1):144-163.
What enables ethically conducted clinical research in hospitals? Views of the administrative staff.Sanna-Maria Nurmi, Mari Kangasniemi, Arja Halkoaho & Anna-Maija Pietilä - 2016 - Clinical Ethics 11 (4):166-175.
Developing Current Research Information Systems as Data Sources for Studies of Research.Gunnar Sivertsen - 2019 - In Wolfgang Glänzel, Henk F. Moed, Ulrich Schmoch & Mike Thelwall (eds.), Springer Handbook of Science and Technology Indicators. Springer Verlag. pp. 667-683.
Publishing computational research - a review of infrastructures for reproducible and transparent scholarly communication. [REVIEW]Laura Goulier, Daniel Nüst & Markus Konkol - 2020 - Research Integrity and Peer Review 5 (1).
Openness in Big Data and Data Repositories: The Application of an Ethics Framework for Big Data in Health and Research.Vicki Xafis & Markus K. Labude - 2019 - Asian Bioethics Review 11 (3):255-273.
Predicting ethnicity with first names in online social media networks.Niek C. de Schipper & Bas Hofstra - 2018 - Big Data and Society 5 (1).
On the Emergence and the Research Outline of Social Information Science.Ouyang Kang - 2008 - Proceedings of the Xxii World Congress of Philosophy 46:37-52.
Analytics
Added to PP
2020-11-24
Downloads
0
6 months
0
2020-11-24
Downloads
0
6 months
0
Historical graph of downloads
Sorry, there are not enough data points to plot this chart.
Citations of this work
Challenges in administrative data linkage for research.Harvey Goldstein, Mauricio L. Barreto, Mahmoud Azimaee, Anders Hjern, James Boyd, Chris Dibben & Katie Harron - 2017 - Big Data and Society 4 (2).
‘What about the dads?’ Linking fathers and children in administrative data: A systematic scoping review.Jenny Woodman, Margaret O’Brien, Pia Hardelid, Katie Harron & Irina Lut - 2022 - Big Data and Society 9 (1).
References found in this work
Cognitive neuroscience 2.0: building a cumulative science of human brain function.Tal Yarkoni, Russell A. Poldrack, David C. Van Essen & Tor D. Wager - 2010 - Trends in Cognitive Sciences 14 (11):489-496.