Capturing Provenance of Data Curation at BCO-DMO
Capturing Provenance of Data Curation at BCO-DMO
dc.contributor.author | Shepherd, Adam | |
dc.contributor.author | York, Amber | |
dc.contributor.author | Schloer, Conrad | |
dc.contributor.author | Kinkade, Danie | |
dc.contributor.author | Rauch, Shannon | |
dc.contributor.author | Biddle, Matt | |
dc.contributor.author | Copley, Nancy | |
dc.contributor.author | Haskins, Christina | |
dc.contributor.author | Soenen, Karen | |
dc.contributor.author | Saito, Mak A. | |
dc.contributor.author | Wiebe, Peter | |
dc.date.accessioned | 2020-05-15T21:31:48Z | |
dc.date.available | 2020-05-15T21:31:48Z | |
dc.date.issued | 2020-05-15 | |
dc.description | Presented at Data Curation Network, May 15, 2020 | en_US |
dc.description.abstract | At domain-specific data repositories, curation that strives for FAIR principles often entails transforming data submissions to improve understanding and reuse. The Biological and Chemical Oceanography Data Management Office (BCO-DMO, https://www.bco-dmo.org) has been adopting the data containerization specification of the Frictionless Data project (https://frictionlessdata.io) in an effort to improve its data curation process efficiency. In doing so, BCO-DMO has been using the Frictionless Data Package Pipelines library (https://github.com/frictionlessdata/datapackage-pipelines) to define the processing steps that transform original submissions to final data products. Because these pipelines are defined using a declarative language they can be serialized into formal provenance data structures using the Provenance Ontology (PROV-O, https://www.w3.org/TR/prov-o/). While there may still be some curation steps that cannot be easily automated, this method is a step towards reproducible transforms that bridge the original data submission to its published state in machine-actionable ways that benefit the research community through transparency in the data curation process. BCO-DMO has built a user interface on top of these modular tools for making it easer for data managers to process submission, reuse existing workflows, and make transparent the added value of domain-specific data curation. | en_US |
dc.description.sponsorship | NSF #1924618 | en_US |
dc.identifier.doi | 10.1575/1912/25777 | |
dc.identifier.uri | https://hdl.handle.net/1912/25777 | |
dc.publisher | Woods Hole Oceanographic Institution | en_US |
dc.rights | Attribution 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.subject | Data Curation | en_US |
dc.subject | Provenance | en_US |
dc.subject | Workflows | en_US |
dc.subject | Frictionless Data | en_US |
dc.subject | Data management | en_US |
dc.subject | Data repository | en_US |
dc.title | Capturing Provenance of Data Curation at BCO-DMO | en_US |
dc.type | Presentation | en_US |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | 5a1ec46b-03cf-40c3-b294-551ee5f54cf7 | |
relation.isAuthorOfPublication | fabbcd8e-ce7a-4ede-b638-e47af490c67c | |
relation.isAuthorOfPublication | 18a11994-2e73-4de3-adf6-cda9a0ddddb6 | |
relation.isAuthorOfPublication | 09542e0b-ec3f-470c-910d-8047ae1c20a3 | |
relation.isAuthorOfPublication | 09cddcd0-c893-4334-8a78-292171f697b4 | |
relation.isAuthorOfPublication | acaa04eb-34c3-4dcd-a8a7-e2a6c525e6cb | |
relation.isAuthorOfPublication | 0fd499a5-2c8f-4e73-afd8-b33db071dd97 | |
relation.isAuthorOfPublication | 4783fa1d-45df-4528-a64c-9aaf8717e17a | |
relation.isAuthorOfPublication | 5ca83620-c5f3-4f10-9ad0-1356498a329c | |
relation.isAuthorOfPublication | cb145654-8987-45bf-8412-902f2c36b648 | |
relation.isAuthorOfPublication | 8c6806d4-c72e-47a8-b713-fe927d8dce80 | |
relation.isAuthorOfPublication.latestForDiscovery | 5a1ec46b-03cf-40c3-b294-551ee5f54cf7 |
Files
Original bundle
1 - 1 of 1
- Name:
- DCN_Prov-Data-Curation_BCO-DMO.pdf
- Size:
- 1.83 MB
- Format:
- Adobe Portable Document Format
- Description:
- DCN_Prov-Data-Curation_BCO-DMO
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.88 KB
- Format:
- Item-specific license agreed upon to submission
- Description: