The Frictionless Data Package : data containerization for automated scientific workflows [poster]

dc.contributor.author Shepherd, Adam
dc.contributor.author Fils, Douglas
dc.contributor.author Kinkade, Danie
dc.contributor.author Saito, Mak A.
dc.date.accessioned 2017-12-12T15:23:29Z
dc.date.available 2017-12-12T15:23:29Z
dc.date.issued 2017-12-13
dc.description Presented at the Fall AGU Meeting, New Orleans, LA, 11-15 December 2017 en_US
dc.description.abstract As cross-disciplinary geoscience research increasingly relies on machines to discover and access data, one of the critical questions facing data repositories is how data and supporting materials should be packaged for consumption. Traditionally, data repositories have relied on a human's involvement throughout discovery and access workflows. This human could assess fitness for purpose by reading loosely coupled, unstructured information from web pages and documentation. In attempts to shorten the time to science and access data resources across may disciplines, expectations for machines to mediate the process of discovery and access is challenging data repository infrastructure. This challenge is to find ways to deliver data and information in ways that enable machines to make better decisions by enabling them to understand the data and metadata of many data types. Additionally, once machines have recommended a data resource as relevant to an investigator's needs, the data resource should be easy to integrate into that investigator's toolkits for analysis and visualization. The Biological and Chemical Oceanography Data Management Office (BCO-DMO) supports NSF-funded OCE and PLR investigators with their project's data management needs. These needs involve a number of varying data types some of which require multiple files with differing formats. Presently, BCO-DMO has described these data types and the important relationships between the type's data files through human-readable documentation on web pages. For machines directly accessing data files from BCO-DMO, this documentation could be overlooked and lead to misinterpreting the data. Instead, BCO-DMO is exploring the idea of data containerization, or packaging data and related information for easier transport, interpretation, and use. In researching the landscape of data containerization, the Frictionlessdata Data Package (http://frictionlessdata.io/) provides a number of valuable advantages over similar solutions. This presentation will focus on these advantages and how the Frictionlessdata Data Package addresses a number of real-world use cases faced for data discovery, access, analysis and visualization. en_US
dc.description.sponsorship National Science Foundation Award #1435578, Award #1639714 en_US
dc.identifier.uri https://hdl.handle.net/1912/9418
dc.language.iso en_US en_US
dc.rights Attribution 4.0 International *
dc.rights.uri http://creativecommons.org/licenses/by/4.0/ *
dc.subject Frictionless Data en_US
dc.subject Data management en_US
dc.subject Data workflows en_US
dc.subject Data transport en_US
dc.title The Frictionless Data Package : data containerization for automated scientific workflows [poster] en_US
dc.type Presentation en_US
dspace.entity.type Publication
relation.isAuthorOfPublication acaa04eb-34c3-4dcd-a8a7-e2a6c525e6cb
relation.isAuthorOfPublication 0fd499a5-2c8f-4e73-afd8-b33db071dd97
relation.isAuthorOfPublication cb145654-8987-45bf-8412-902f2c36b648
relation.isAuthorOfPublication c4bdb97f-7a7b-4b96-8441-962b9ac43442
relation.isAuthorOfPublication.latestForDiscovery acaa04eb-34c3-4dcd-a8a7-e2a6c525e6cb
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
The-Frictionless-Data-Package_Data-Containerization-for-Automated-Scientific-Workflows.pdf
Size:
1.19 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.89 KB
Format:
Item-specific license agreed upon to submission
Description: