Frictionless Data Processing in the Wild

View/ Open
Date
2019-05-08Author
York, Amber
Concept link
Schloer, Conrad
Concept link
Copley, Nancy
Concept link
Biddle, Matt
Concept link
Rauch, Shannon
Concept link
Haskins, Christina
Concept link
Soenen, Karen
Concept link
Shepherd, Adam
Concept link
Kinkade, Danie
Concept link
Metadata
Show full item recordCitable URI
https://hdl.handle.net/1912/24143As published
https://doi.org/10.5281/zenodo.2687557DOI
10.5281/zenodo.2687557Keyword
Open data; Frictionless data; Datapackage-pipelines; Open knowledge; Data processing; Provenance; Interoperability; FAIR; WorkflowsAbstract
Frictionless Data (FD) initiatives out of the Open Knowledge Foundation provide attractive informatics and processing capabilities. The BCO-DMO data repository used FD tools on real-world datasets, and we have some lessons learned to share. By building upon existing FD tools, we found ways to reduce the amount of time data managers spend generating metadata, and writing custom scripts. We are also developing ways for data managers with varying levels of scripting ability to make use of Frictionless Data tools.
Description
Presented at csv,conf,v4 Conference Portland, Oregon, May 5-8, 2019.
Collections
Suggested Citation
Presentation: York, Amber, Schloer, Conrad, Copley, Nancy, Biddle, Matt, Rauch, Shannon, Haskins, Christina, Soenen, Karen, Shepherd, Adam, Kinkade, Danie, "Frictionless Data Processing in the Wild", Presented at csv,conf,v4 Conference Portland, Oregon, May 5-8, 2019., DOI:10.5281/zenodo.2687557, https://hdl.handle.net/1912/24143The following license files are associated with this item:
Related items
Showing items related by title, author, creator and subject.
-
Towards Capturing Provenance of the Data Curation Process at Domain-specific Repositories
Shepherd, Adam; Rauch, Shannon; Schloer, Conrad; Kinkade, Danie; Biddle, Matt; Copley, Nancy; Saito, Mak A.; Wiebe, Peter; York, Amber (2018-12-14)Data repositories often transform submissions to improve understanding and reuse of data by researchers other than the original submitter. However, scientific workflows built by the data submitters often depend on the ... -
In search of Frictionless Data
Shepherd, Adam (Biological and Chemical Oceanography Data Management Office, 2017-09-21) -
Towards capturing data curation provenance using Frictionless Data Package Pipelines [poster]
Shepherd, Adam; Schloer, Conrad; York, Amber; Kinkade, Danie (2018-10-10)At domain-specific data repositories, curation that strives for FAIR principles often entails transforming data submissions to improve understanding and reuse. The Biological and Chemical Oceanography Data Management ...