Abstract
Scientific knowledge production is currently affected by the dissemination of data on an unprecedented scale. Technologies for the automated production and sharing of vast amounts of data have changed the way in which data are handled and interpreted in several scientific domains, most notably molecular biology and biomedicine. In these fields, the activity of data gathering has become increasingly technology-driven, with machines such as next generation genome sequencers and mass spectrometers generating billions of data points within hours, and with little need for human supervision. Given the relative ease and low costs with which datasets can be produced (that is, once a laboratory has been able to afford ..