Dynamic reusable workflows for ocean science

Thumbnail Image
Date
2016-10-25
Authors
Signell, Richard P.
Fernandes, Filipe
Wilcox, Kyle
Linked Authors
Alternative Title
Date Created
Location
DOI
10.3390/jmse4040068
Related Materials
Replaces
Replaced By
Keywords
Numerical modeling
Reproducibility
Catalog services
Data services
Web services
Metadata
Ocean forecasting
Ocean modeling
Data management
Data system
Interoperability
OPeNDAP
THREDDS
CSW
Jupyter Notebooks
Abstract
Digital catalogs of ocean data have been available for decades, but advances in standardized services and software for catalog searches and data access now make it possible to create catalog-driven workflows that automate—end-to-end—data search, analysis, and visualization of data from multiple distributed sources. Further, these workflows may be shared, reused, and adapted with ease. Here we describe a workflow developed within the US Integrated Ocean Observing System (IOOS) which automates the skill assessment of water temperature forecasts from multiple ocean forecast models, allowing improved forecast products to be delivered for an open water swim event. A series of Jupyter Notebooks are used to capture and document the end-to-end workflow using a collection of Python tools that facilitate working with standardized catalog and data services. The workflow first searches a catalog of metadata using the Open Geospatial Consortium (OGC) Catalog Service for the Web (CSW), then accesses data service endpoints found in the metadata records using the OGC Sensor Observation Service (SOS) for in situ sensor data and OPeNDAP services for remotely-sensed and model data. Skill metrics are computed and time series comparisons of forecast model and observed data are displayed interactively, leveraging the capabilities of modern web browsers. The resulting workflow not only solves a challenging specific problem, but highlights the benefits of dynamic, reusable workflows in general. These workflows adapt as new data enter the data system, facilitate reproducible science, provide templates from which new scientific workflows can be developed, and encourage data providers to use standardized services. As applied to the ocean swim event, the workflow exposed problems with two of the ocean forecast products which led to improved regional forecasts once errors were corrected. While the example is specific, the approach is general, and we hope to see increased use of dynamic notebooks across geoscience domains.
Description
© The Author(s), 2016. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Journal of Marine Science and Engineering 4 (2016): 68, doi:10.3390/jmse4040068.
Embargo Date
Citation
Journal of Marine Science and Engineering 4 (2016): 68
Cruises
Cruise ID
Cruise DOI
Vessel Name
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-ShareAlike 4.0 International