Capturing Provenance of Data Curation at BCO-DMO

dc.contributor.author Shepherd, Adam
dc.contributor.author York, Amber
dc.contributor.author Schloer, Conrad
dc.contributor.author Kinkade, Danie
dc.contributor.author Rauch, Shannon
dc.contributor.author Copley, Nancy
dc.contributor.author Gerlach, Dana
dc.contributor.author Haskins, Christina
dc.contributor.author Soenen, Karen
dc.contributor.author Saito, Mak A.
dc.contributor.author Wiebe, Peter
dc.date.accessioned 2020-11-10T19:32:03Z
dc.date.available 2020-11-10T19:32:03Z
dc.date.issued 2020-11-09
dc.description Presented at USGS Data Management Working Group, 9, November 2020 en_US
dc.description.abstract At domain-specific data repositories, curation that strives for FAIR principles often entails transforming data submissions to improve understanding and reuse. The Biological and Chemical Oceanography Data Management Office (BCO-DMO, https://www.bco-dmo.org) has been adopting the data containerization specification of the Frictionless Data project (https://frictionlessdata.io) in an effort to improve its data curation process efficiency. In doing so, BCO-DMO has been using the Frictionless Data Package Pipelines library (https://github.com/frictionlessdata/datapackage-pipelines) to define the processing steps that transform original submissions to final data products. Because these pipelines are defined using a declarative language they can be serialized into formal provenance data structures using the Provenance Ontology (PROV-O, https://www.w3.org/TR/prov-o/). While there may still be some curation steps that cannot be easily automated, this method is a step towards reproducible transforms that bridge the original data submission to its published state in machine-actionable ways that benefit the research community through transparency in the data curation process. BCO-DMO has built a user interface on top of these modular tools for making it easier for data managers to process submission, reuse existing workflows, and make transparent the added value of domain-specific data curation. en_US
dc.description.sponsorship NSF #1924618 en_US
dc.identifier.doi 10.1575/1912/26373
dc.identifier.uri https://hdl.handle.net/1912/26373
dc.publisher Woods Hole Oceanographic Institution en_US
dc.relation.isversionof https://doi.org/10.1575/1912/25777
dc.rights CC0 1.0 Universal *
dc.rights.uri http://creativecommons.org/publicdomain/zero/1.0/ *
dc.subject Data Curation en_US
dc.subject Provenance en_US
dc.subject Workflows en_US
dc.subject Frictionless Data en_US
dc.subject Data management en_US
dc.subject Data repository en_US
dc.title Capturing Provenance of Data Curation at BCO-DMO en_US
dc.type Presentation en_US
dspace.entity.type Publication
relation.isAuthorOfPublication fabbcd8e-ce7a-4ede-b638-e47af490c67c
relation.isAuthorOfPublication 18a11994-2e73-4de3-adf6-cda9a0ddddb6
relation.isAuthorOfPublication 09542e0b-ec3f-470c-910d-8047ae1c20a3
relation.isAuthorOfPublication 09cddcd0-c893-4334-8a78-292171f697b4
relation.isAuthorOfPublication acaa04eb-34c3-4dcd-a8a7-e2a6c525e6cb
relation.isAuthorOfPublication 0fd499a5-2c8f-4e73-afd8-b33db071dd97
relation.isAuthorOfPublication 4783fa1d-45df-4528-a64c-9aaf8717e17a
relation.isAuthorOfPublication 1a2c8da5-e47b-4780-8317-2fdee8a0ddda
relation.isAuthorOfPublication 5ca83620-c5f3-4f10-9ad0-1356498a329c
relation.isAuthorOfPublication cb145654-8987-45bf-8412-902f2c36b648
relation.isAuthorOfPublication 8c6806d4-c72e-47a8-b713-fe927d8dce80
relation.isAuthorOfPublication.latestForDiscovery fabbcd8e-ce7a-4ede-b638-e47af490c67c
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
USGS DM WG_ Capturing Provenance of Data Curation at BCO-DMO.pdf
Size:
1.86 MB
Format:
Adobe Portable Document Format
Description:
USGS_DM_WG_ Capturing_Provenance_Data_Curation_BCO_DMO
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: