Show simple item record

dc.contributor.authorShepherd, Adam  Concept link
dc.contributor.authorYork, Amber  Concept link
dc.contributor.authorSchloer, Conrad  Concept link
dc.contributor.authorKinkade, Danie  Concept link
dc.contributor.authorRauch, Shannon  Concept link
dc.contributor.authorBiddle, Matt  Concept link
dc.contributor.authorCopley, Nancy  Concept link
dc.contributor.authorHaskins, Christina  Concept link
dc.contributor.authorSoenen, Karen  Concept link
dc.contributor.authorSaito, Mak A.  Concept link
dc.contributor.authorWiebe, Peter  Concept link
dc.date.accessioned2020-05-15T21:31:48Z
dc.date.available2020-05-15T21:31:48Z
dc.date.issued2020-05-15
dc.identifier.urihttps://hdl.handle.net/1912/25777
dc.descriptionPresented at Data Curation Network, May 15, 2020en_US
dc.description.abstractAt domain-specific data repositories, curation that strives for FAIR principles often entails transforming data submissions to improve understanding and reuse. The Biological and Chemical Oceanography Data Management Office (BCO-DMO, https://www.bco-dmo.org) has been adopting the data containerization specification of the Frictionless Data project (https://frictionlessdata.io) in an effort to improve its data curation process efficiency. In doing so, BCO-DMO has been using the Frictionless Data Package Pipelines library (https://github.com/frictionlessdata/datapackage-pipelines) to define the processing steps that transform original submissions to final data products. Because these pipelines are defined using a declarative language they can be serialized into formal provenance data structures using the Provenance Ontology (PROV-O, https://www.w3.org/TR/prov-o/). While there may still be some curation steps that cannot be easily automated, this method is a step towards reproducible transforms that bridge the original data submission to its published state in machine-actionable ways that benefit the research community through transparency in the data curation process. BCO-DMO has built a user interface on top of these modular tools for making it easer for data managers to process submission, reuse existing workflows, and make transparent the added value of domain-specific data curation.en_US
dc.description.sponsorshipNSF #1924618en_US
dc.publisherWoods Hole Oceanographic Institutionen_US
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectData Curationen_US
dc.subjectProvenanceen_US
dc.subjectWorkflowsen_US
dc.subjectFrictionless Dataen_US
dc.subjectData managementen_US
dc.subjectData repositoryen_US
dc.titleCapturing Provenance of Data Curation at BCO-DMOen_US
dc.typePresentationen_US
dc.identifier.doi10.1575/1912/25777


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution 4.0 International
Except where otherwise noted, this item's license is described as Attribution 4.0 International