Key components of data publishing : using current best practices to develop a reference model for data publishing
Austin, Claire C
MetadataShow full item record
Availability of workflows for data publishing could have an enormous impact on researchers, research practices and publishing paradigms, as well as on funding strategies and career and research evaluations. We present the generic components of such workflows in order to provide a reference model for these stakeholders. Methods: The RDA-WDS Data Publishing Workflows group set out to study the current data publishing workflow landscape across disciplines and institutions. A diverse set of workflows were examined to identify common components and standard practices, including basic self-publishing services, institutional data repositories, long term projects, curated data repositories, and joint data journal and repository arrangements. Results: The results of this examination have been used to derive a data publishing reference model comprised of generic components. From an assessment of the current data publishing landscape, we highlight important gaps and challenges to consider, especially when dealing with more complex workflows and their integration into wider community frameworks. Conclusions: It is clear that the data publishing landscape is varied and dynamic, and that there are important gaps and challenges. The different components of a data publishing system need to work, to the greatest extent possible, in a seamless and integrated way. We therefore advocate the implementation of existing standards for repositories and all parts of the data publishing process, and the development of new standards where necessary. Effective and trustworthy data publishing should be embedded in documented workflows. As more research communities seek to publish the data associated with their research, they can build on one or more of the components identified in this reference model.
Author Posting. © The Author(s), 2015. This is the author's version of the work. It is posted here by permission of Springer for personal use, not for redistribution. The definitive version was published in International Journal on Digital Libraries 18 (2017): 77-92, doi:10.1007/s00799-016-0178-2.
Suggested CitationPreprint: Austin, Claire C, Bloom, Theodora, Dallmeier-Tiessen, Sunje, Khodiyar, Varsha, Murphy, Fiona, Nurnberger, Amy, Raymond, Lisa, Stockhause, Martina, Tedds, Jonathan, Vardigan, Mary, Whyte, Angus, "Key components of data publishing : using current best practices to develop a reference model for data publishing", 2015-12-04, https://doi.org/10.1007/s00799-016-0178-2, https://hdl.handle.net/1912/8147
Showing items related by title, author, creator and subject.
High-resolution imaging of the Bear Valley section of the San Andreas fault at seismogenic depths with fault-zone head waves and relocated seismicity McGuire, Jeffrey J.; Ben-Zion, Yehuda (Blackwell Publishing, 2005-09-02)Detailed imaging of fault-zone (FZ) material properties at seismogenic depths is a difficult seismological problem owing to the short length scales of the structural features. Seismic energy trapped within a low-velocity ...
Thessen, Anne E.; Cui, Hong; Mozzherin, Dmitry (Hindawi Publishing, 2012)Centuries of biological knowledge are contained in the massive body of scientific literature, written for human-readability but too big for any one person to consume. Large-scale mining of information from the literature ...
Yilmaz, Pelin; Gilbert, Jack A.; Knight, Rob; Amaral-Zettler, Linda A.; Karsch-Mizrachi, Ilene; Cochrane, Guy R.; Nakamura, Yasukazu; Sansone, Susanna-Assunta; Glockner, Frank Oliver; Field, Dawn (Nature Publishing Group, 2011-04-07)Interest in sampling of diverse environments, combined with advances in high-throughput sequencing, vastly accelerates the pace at which new genomes and metagenomes are generated. For example, as of January 2011, 12 ...