DataUp: A tool to help researchers describe and share tabular data

Abstract

Scientific datasets have immeasurable value, but they lose their value overtime without proper documentation, long-term storage, and easy discovery andaccess. Across disciplines as diverse as astronomy, demography, archeology,and ecology, large numbers of small heterogeneous datasets are especially at risk unless they are properly documented, saved, andshared. One unifying factor for many of these at-risk datasets is that they residein spreadsheets.In response to this need, the California Digital Library partneredwith Microsoft Research Connections and the Gordon and Betty MooreFoundation to create the DataUp data management tool for Microsoft Excel.Many researchers creating these small, heterogeneous datasets use Excel atsome point in their data collection and analysis workflow, so we were interestedin developing a data management tool that fits easily into those work flows andminimizes the learning curve for researchers.The DataUp project began in August 2011. We first formally assessedthe needs of researchers by conducting surveys and interviews of our targetresearch groups: earth, environmental, and ecological scientists. We foundthat, on average, researchers had very poor data management practices, werenot aware of data centers or metadata standards, and did not understand thebenefits of data management or sharing. Based on our survey results, wecomposed a list of desirable components and requirements and solicitedfeedback from the community to prioritize potential features of the DataUp tool.These requirements were then relayed to the software developers, and DataUpwas successfully launched in October 2012.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 92,705

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

  • Only published works are available at libraries.

Similar books and articles

Issues in Data Management.Sharon S. Krag - 2010 - Science and Engineering Ethics 16 (4):743-748.
Bodies of Data: Genomic Data and Bioscience Data Sharing.Pilar Ossorio - 2011 - Social Research: An International Quarterly 78 (4):907-932.
Bodies of data: genomic data and bioscience data sharing.Pilar N. Ossorio - 2011 - Social Research: An International Quarterly 78 (3):907-932.
The use of secondary data in business ethics research.Christopher J. Cowton - 1998 - Journal of Business Ethics 17 (4):423-434.

Analytics

Added to PP
2017-03-18

Downloads
2 (#1,813,371)

6 months
2 (#1,241,799)

Historical graph of downloads

Sorry, there are not enough data points to plot this chart.
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references