Biological and Chemical Oceanography Data Management Office (BCO-DMO)
Permanent URI for this community
The Biological and Chemical Oceanography Data Management Office (BCO-DMO) staff members work with investigators to serve data online from research projects funded by the Biological and Chemical Oceanography Sections, the Division of Polar Programs Arctic Sciences and Antarctic Organisms & Ecosystems Program at the U.S. National Science Foundation.
BCO-DMO is a combination of the formerly independent Data Management Offices formed in support of the US JGOFS and US GLOBEC programs. The BCO-DMO staff members are the curators of the data collections created by those respective programs, as well as data from more recent NSF Geosciences Directorate (GEO) Division of Ocean Sciences (OCE) Biological and Chemical Oceanography Sections, Division of Polar Programs (PLR) Antarctic Sciences (ANT) Organisms & Ecosystems, and Arctic Sciences (ARC) awards. The BCO-DMO project is funded by NSF OCE and ANT programs, NSF award number OCE-1435578.
Data sets managed by BCO-DMO and hosted in WHOAS, can be found here.
Browse
Browsing Biological and Chemical Oceanography Data Management Office (BCO-DMO) by Title
Results Per Page
Sort Options
-
PresentationThe advantages of machine aided co-reference resolution for research cruise metadata( 2017-05-31) Shepherd, Adam ; Chandler, Cynthia L. ; Arko, Robert A. ; Fils, Douglas ; Kinkade, DanieOne of the central incentives of deploying linked open data is the opportunity to leverage the linkages between source datasets to retrieve related information. The Biological and Chemical Oceanography Data Management Office (BCO-DMO) reaps these benefits by linking its cruise-level metadata to the Rolling Deck to Repository (R2R) – the trusted, authoritative source for cruises undertaken by the U.S. academic research fleet. Even though the process of identifying a link between these two repositories is easy for a human, this talk will explore the advantages of using a machine-aided process to suggest links to R2R cruises to a BCO-DMO data manager.
-
PresentationAligned semantics to advance data interoperability across the ocean value chain - from raw data to societal goals [poster](Woods Hole Oceanographic Institution, 2019-09-16) Shepherd, Adam ; Caltagirone, Scott ; Kokkinaki, Alexandra ; Leadbetter, Adam ; Moncoiffe, Gwenaelle ; Simpson, Pauline ; Thomas, Robert ; Buttigieg, Pier LuigiThe FAIR principles (Findability, Accessibility, Interoperability, Re-usability) have pervaded discussions on data across disciplines and sectors.While data Findability and Accessibility has greatly improved, considerable difficulties in scalable interoperation remain. Without significant progress, the rapidly growing stores of ocean data risk being siloed for many years to come. A key aspect of Interoperability is "semantic": using knowledge representation (KR) to translate human understanding into machine-readable form. Quality KR allows machines to "understand" what any information artifact is about and relate it to similar artifacts, enabling discovery and enhancing reuse. KR products are usually expressed as vocabularies, glossaries, thesauri, or ontologies (collectively, terminologies), each with its own costs and benefits. Ironically, most marine terminologies are, themselves, not truly interoperable. This is an unfortunate but inevitable outcome of localised and transient funding, and the lack of sustained global infrastructures.Nonetheless, voluntary consortia are addressing this issue with urgency to realise the promise of KR in ocean observation. Here, we present 1) the alignment of well-adopted marine terminologies, 2) a collective strategy for sustained interoperability, and 3) a use case featuring the IOC-UNESCO Ocean Best Practice System. Initialised by the Biological and Chemical Oceanography Data Management Office, we are interlinking terminologies from the Natural Environment Research Council's Vocabulary Server, the Open Biological and Biomedical Ontologies Foundry, and the Earth Science Information Partners. To serve the UNESCO Decade of Ocean Science for Sustainable Development, this effort includes ontologies which represent both the Essential Ocean Variables and the UN Sustainable Development Goals. Finally, we provide perspectives on what measures are needed to meet the interoperability challenge at scale over the next decade.
-
OtherBCO-DMO Quick Guide( 2018-09-19) Kinkade, Danie ; Shepherd, Adam ; Ake, Hannah ; Biddle, Matt ; Copley, Nancy ; Rauch, Shannon ; York, AmberCurating and providing open access to research data is a collaborative process. This process may be thought of as a life cycle with data passing through various phases. Each phase has its own associated actors, roles, and critical activities. Good data management practices are necessary for all phases, from proposal to preservation.
-
PresentationBCO-DMO's migration to ERDDAP(Woods Hole Oceanographic Institution, 2019-10-24) Biddle, MattAs a domain specific repository, BCO-DMO supports data stewardship throughout the data lifecycle. One key aspect of that data lifecycle is making data and metadata available online in a variety of file formats. This presentation will walk through BCO-DMO's current data serving system, our migration to ERDDAP, and what that might mean for the future. There will be a focus on the nuts-and-bolts of our migration, the benefits of this activity, and some of the difficulties we've encountered along the way.
-
PresentationBCO-DMO: Surfing the Crests and Troughs of Data Sharing(Woods Hole Oceanographic Institution, 2022-03-02) Kinkade, DanieMany of the challenges currently associated with sharing oceanographic data currently facing researchers and the repositories through which they share their data, are cultural rather than technical. This talk presents an overview of obstacles and opportunities related to data sharing within the oceanographic community.
-
PresentationBiological & Chemical Oceanography Data Management Office : a domain-specific repository for oceanographic data from around the world [poster]( 2018-02-14) Ake, Hannah ; Biddle, Matt ; Copley, Nancy ; Kinkade, Danie ; Rauch, Shannon ; Saito, Mak A. ; Shepherd, Adam ; Switzer, Megan ; Wiebe, Peter ; York, AmberThe Biological and Chemical Oceanography Data Management Office (BCO-DMO) is a domain-specific digital data repository that works with investigators funded under the National Science Foundation’s Division of Ocean Sciences and Office of Polar Programs to manage their data free of charge. Data managers work closely with investigators to satisfy their data sharing requirements and to develop comprehensive Data Management Plans, as well as to ensure that their data will be well described with extensive metadata creation. Additionally, BCO-DMO offers tools to find and reuse these high-quality data and metadata packages, and services such as DOI generation for publication and attribution. These resources are free for all to discover, access, and utilize. As a repository embedded in our research community, BCO-DMO is well positioned to offer knowledge and expertise from both domain trained data managers and the scientific community at large. BCO-DMO is currently home to more than 9000 datasets and 900 projects, all of which are or will be submitted for archive at the National Centers for Environmental Information (NCEI). Our data holdings continue to grow, and encompass a wide range of oceanographic research areas, including biological, chemical, physical, and ecological. These data represent cruises and experiments from around the world, and are managed using community best practices, standards, and technologies to ensure accuracy and promote re-use. BCO-DMO is a repository and tool for investigators, offering both ocean science data and resources for data dissemination and publication.
-
PresentationThe Biological and Chemical Oceanography Data Management Office: Accelerating Scientific Discovery Through Responsive Management of Observational Oceanographic Data [poster](Woods Hole Oceanographic Institution, 2019-09-16) Kinkade, Danie ; Shepherd, Adam ; Biddle, Matt ; Copley, Nancy ; Haskins, Christina ; Soenen, Karen ; Rauch, Shannon ; York, Amber ; Saito, Mak A. ; Wiebe, PeterOceanographic data, when well-documented and stewarded toward preservation, have the potential to accelerate new science and facilitate our understanding of complex natural systems. The Biological and Chemical Oceanography Data Management Office (BCO-DMO) is funded by the NSF to document and manage marine biological, chemical, physical, and biogeochemical data, ensuring their discovery and access, and facilitating their reuse. The task of curating and providing access to research data is a collaborative process, with associated actors and critical activities occurring throughout the data’s life cycle. BCO-DMO supports all phases of the data life cycle and works closely with investigators to ensure open access of well-documented project data and information. Supporting this curation process is a flexible cyberinfrastructure that provides the means for data submission, discovery, and access; ultimately enabling reuse. This poster will introduce the repository and describe some of the strategic enhancements coming to BCO-DMO.
-
PresentationBiological and Chemical Oceanography Data Management Office: Supporting a New Vision for Adaptive Management of Oceanographic Data [poster](Woods Hole Oceanographic Institution, 2022-06-21) Shepherd, Adam ; Gerlach, Dana ; Heyl, Taylor ; Kinkade, Danie ; Nagala, Shravani ; Newman, Sawyer ; Rauch, Shannon ; Saito, Mak A. ; Schloer, Conrad ; Soenen, Karen ; Wiebe, Peter ; York, AmberAn unparalleled data catalog of well-documented, interoperable oceanographic data and information, openly accessible to all end-users through an intuitive web-based interface for the purposes of advancing marine research, education, and policy. Conference Website: https://web.whoi.edu/ocb-workshop/
-
ArticleBringing dark data into the light : a case study of the recovery of Northwestern Atlantic zooplankton data collected in the 1970s and 1980s(Elsevier, 2015-04-06) Wiebe, Peter H. ; Allison, M. DicksonData generated as a result of publicly funded research in the USA and other countries are now required to be available in public data repositories. However, many scientific data over the past 50+ years were collected at a time when the technology for curation, storage, and dissemination were primitive or non-existent and consequently many of these datasets are not available publicly. These so-called “dark data” sets are essential to the understanding of how the ocean has changed chemically and biologically in response to the documented shifts in temperature and salinity (aka climate change). An effort is underway to bring into the light, dark data about zooplankton collected in the 1970s and 1980s as part of the cold-core and warm-core rings multidisciplinary programs and other related projects. Zooplankton biomass and euphausiid species abundance from 306 tows and related environmental data including many depth specific tows taken on 34 research cruises in the Northwest Atlantic are online and accessible from the Biological and Chemical Oceanography Data Management Office (BCO-DMO).
-
PresentationCapturing Provenance of Data Curation at BCO-DMO(Woods Hole Oceanographic Institution, 2020-05-15) Shepherd, Adam ; York, Amber ; Schloer, Conrad ; Kinkade, Danie ; Rauch, Shannon ; Biddle, Matt ; Copley, Nancy ; Haskins, Christina ; Soenen, Karen ; Saito, Mak A. ; Wiebe, PeterAt domain-specific data repositories, curation that strives for FAIR principles often entails transforming data submissions to improve understanding and reuse. The Biological and Chemical Oceanography Data Management Office (BCO-DMO, https://www.bco-dmo.org) has been adopting the data containerization specification of the Frictionless Data project (https://frictionlessdata.io) in an effort to improve its data curation process efficiency. In doing so, BCO-DMO has been using the Frictionless Data Package Pipelines library (https://github.com/frictionlessdata/datapackage-pipelines) to define the processing steps that transform original submissions to final data products. Because these pipelines are defined using a declarative language they can be serialized into formal provenance data structures using the Provenance Ontology (PROV-O, https://www.w3.org/TR/prov-o/). While there may still be some curation steps that cannot be easily automated, this method is a step towards reproducible transforms that bridge the original data submission to its published state in machine-actionable ways that benefit the research community through transparency in the data curation process. BCO-DMO has built a user interface on top of these modular tools for making it easer for data managers to process submission, reuse existing workflows, and make transparent the added value of domain-specific data curation.
-
PresentationCapturing Provenance of Data Curation at BCO-DMO(Woods Hole Oceanographic Institution, 2020-11-09) Shepherd, Adam ; York, Amber ; Schloer, Conrad ; Kinkade, Danie ; Rauch, Shannon ; Copley, Nancy ; Gerlach, Dana ; Haskins, Christina ; Soenen, Karen ; Saito, Mak A. ; Wiebe, PeterAt domain-specific data repositories, curation that strives for FAIR principles often entails transforming data submissions to improve understanding and reuse. The Biological and Chemical Oceanography Data Management Office (BCO-DMO, https://www.bco-dmo.org) has been adopting the data containerization specification of the Frictionless Data project (https://frictionlessdata.io) in an effort to improve its data curation process efficiency. In doing so, BCO-DMO has been using the Frictionless Data Package Pipelines library (https://github.com/frictionlessdata/datapackage-pipelines) to define the processing steps that transform original submissions to final data products. Because these pipelines are defined using a declarative language they can be serialized into formal provenance data structures using the Provenance Ontology (PROV-O, https://www.w3.org/TR/prov-o/). While there may still be some curation steps that cannot be easily automated, this method is a step towards reproducible transforms that bridge the original data submission to its published state in machine-actionable ways that benefit the research community through transparency in the data curation process. BCO-DMO has built a user interface on top of these modular tools for making it easier for data managers to process submission, reuse existing workflows, and make transparent the added value of domain-specific data curation.
-
ArticleChallenges and future directions for data management in the geosciences(American Meteorological Society, 2019-06-04) Schuster, Douglas C. ; Mayernik, Matthew ; Hou, Chung-Yi ; Stossmeister, Greg ; Downs, Robert R. ; Kinkade, Danie ; Nguyen, Tran B. ; Ramamurthy, Mohan ; Zhang, FuqingThe open availability and wide accessibility of digital scientific resources, such as articles and datasets, is becoming the norm for twenty-first-century science. Geoscience researchers are now being asked by funding agencies and scientific publishers to archive and cite data to support open access but often struggle to understand, interpret, and fulfill these requirements. To fulfill the promise of new open data initiatives, 1) scientific resources (e.g., data and software) must be collected and documented properly; 2) repository services, including preservation and storage capabilities, must be maintained, supported, and improved over time; and 3) governance institutions must be established. These issues were discussed in the Geoscience Digital Data Resource and Repository Service (GeoDaRRS) workshop,1 held in August 2018, at NCAR. The workshop brought together more than 60 geoscience researchers, technology experts, scientific publishers, funders, and data repository personnel to discuss data management challenges and opportunities within the geosciences. This included exploring whether new services are needed to complement existing data facilities, particularly in the areas of 1) data management planning support resources and 2) repository services for geoscience researchers who have data that do not fit in any existing repository. More details on the workshop agenda and recommendations are available in the final workshop report (Mayernik et al. 2018).
-
PresentationCode and Software: How would you share yours? [poster](Woods Hole Oceanographic Institution, 2020-02-21) Biddle, Matt ; Copley, Nancy ; Haskins, Christina ; Rauch, Shannon ; Soenen, Karen ; York, Amber ; Kinkade, Danie ; Saito, Mak A. ; Shepherd, Adam ; Wiebe, PeterBCO-DMO curates earth science data where models become increasingly important. The Biological and Chemical Oceanography Data Management Office (BCO-DMO) is a publicly accessible earth science data repository created to curate, publicly serve (publish), and archive digital data and information from biological, chemical and biogeochemical research conducted in coastal, marine, great lakes and laboratory environments. Recently, more and more of the projects submitted to BCO-DMO represent modeling efforts which further increase our knowledge of chemical and biological properties within the ocean ecosystem. We feel the time is at hand for the scientific community to begin a concerted and holistic approach to the curation of code and software.
-
Moving ImageCollaborative research : EarthCube building blocks, leveraging semantics and linked data for geoscience data sharing and discovery, OceanLink( 2013-10-28) Wiebe, Peter H. ; Chandler, Cynthia L. ; Raymond, Lisa ; Shepherd, Adam ; Finin, Tim ; Narock, Tom ; Arko, Robert A. ; Carbotte, Suzanne M. ; Hitzler, Pascal ; Cheatham, Michelle ; Krisnadhi, AdilaThe OceanLink EarthCube project will apply state-of-the-art Semantic Web Technologies to support data representation, discovery, analysis, sharing, and integration of datasets from the global oceans, and related resources including meeting abstracts and library holdings. Ships are a principal platform from which a wide spectrum of oceanographic data are collected. At the University of Maryland, Baltimore County, semantic relationships will be extracted from text for use in developing methods that efficiently identify relationships across distributed oceanographic datasets. At Wright State University integration of disparate data will occur by refining and applying leading edge technology from the Semantic Web, ontologies, and linked data. From the MBLWHOI Library, DSpace content will be published as Linked Open Data, providing relationships between oceanographic datasets, publications, conference presentations, and funded National Science Foundation projects. Teams of researchers at the Lamont-Doherty Earth Observatory and the Woods Hole Oceanographic Institution will develop Use Cases that represent the needs of the oceanographic research community and will publish oceanographic dataset catalogs as Linked Open Data. A key contribution will be semantically-enabled cyberinfrastructure components capable of automated data integration across distributed repositories. These efforts will ultimately lead to generalized computational techniques applicable to all of EarthCube.
-
PresentationData Help Desk BCO-DMO Lightning Talk(Woods Hole Oceanographic Institution, 2020-02-18) Biddle, Matt ; Shepherd, Adam ; Kinkade, Danie ; Haskins, Christina ; Soenen, Karen ; Rauch, Shannon ; Copley, Nancy ; York, Amber ; Schloer, Conrad ; Saito, Mak A. ; Wiebe, PeterBCO-DMO is the Biological and Chemical Oceanography Data Management Office. We help oceanography researchers who are funded by the National Science Foundation’s (NSF's) Division of Ocean Sciences' (OCE) Biological or Chemical Oceanography Sections or the Division of Polar Programs' Antarctic Organisms & Ecosystems Program manage their data, making them accessible over the internet. This lightning talk gives a brief overview of who we are, who we work with, and the types of data we manage.
-
PresentationData Management and Reporting: BCO-DMO Data Management Services and Best Practices(Woods Hole Oceanographic Institution, 2019-06-14) Rauch, Shannon ; Kinkade, Danie ; Biddle, Matt ; Copley, Nancy ; York, Amber ; Soenen, Karen ; Shepherd, AdamThe University-National Oceanographic Laboratory System (UNOLS) hosted an Early Career Chief Scientist Training Workshop in June 2019. The goal of this workshop was to help early-career marine scientists plan and write effective cruise proposals, develop collaborative sampling strategies and plans, become familiar with shipboard equipment and sampling at sea, and communicate major findings through writing of manuscripts and cruise reports. This presentation provides information on data management and reporting best practices for chief scientists. It includes information on: National Science Foundation (NSF) data policy requirements, writing a Data Management Plan (DMP), the data lifecycle, data publication, and shipboard data management recommendations.
-
PresentationData Management in (Ocean) Sciences – Interactive Class(Woods Hole Oceanographic Institution, 2020-02-26) Soenen, Karen ; Harden, Benjamin E.Ocean 101, engaging classes to help SEA students understand the frontiers of ocean climate science. This particular class focuses on data management in oceanography. Covered topics are the importance of open data, the data life cycle and F.A.I.R. Principles. The interactive part consists of creating the content for a data management plan and applying general data management practices.
-
PresentationThe Data Management Process and Lessons Learned From U.S. GEOTRACES(Woods Hole Oceanographic Institution, 2018-11-09) Rauch, Shannon ; Kinkade, Danie ; Shepherd, Adam ; Copley, Nancy ; Biddle, Matt ; York, AmberIn an effort to explore and develop international community interest for a potential future "Biogeotraces-like" program, a working group of 28 scientists from 9 nations met in Woods Hole in November 2018. The result of this workshop is a new research effort termed "Biogeoscapes". This presentation highlighted data management lessons and recommendations from based on past experience handling data from a similarly-scaled global research project, GEOTRACES.
-
ArticleData management strategy to improve global use of ocean acidification data and information(The Oceanography Society, 2015-06) Garcia, Hernan E. ; Cosca, Catherine E. ; Kozyr, Alex ; Mayorga, Emilio ; Chandler, Cynthia L. ; Thomas, Robert W. ; O’Brien, Kevin ; Appeltans, Ward ; Hankin, Steve ; Newton, Jan A. ; Gutierrez, Angelica ; Gattuso, Jean-Pierre ; Hansson, Lina ; Zweng, Melissa ; Pfeil, BenjaminOcean acidification (OA) refers to the general decrease in pH of the global ocean as a result of absorbing anthropogenic CO2 emitted in the atmosphere since preindustrial times (Sabine et al., 2004). There is, however, considerable variability in ocean acidification, and many careful measurements need to be made and compared in order to obtain scientifically valid information for the assessment of patterns, trends, and impacts over a range of spatial and temporal scales, and to understand the processes involved. A single country or institution cannot undertake measurements of worldwide coastal and open ocean OA changes; therefore, international cooperation is needed to achieve that goal. The OA data that have been, and are being, collected represent a significant public investment. To this end, it is critically important that researchers (and others) around the world are easily able to find and use reliable OA information that range from observing data (from time-series moorings, process studies, and research cruises), to biological response experiments (e.g., mesocosm), data products, and model output.
-
PresentationData Science Training Camp at Woods Hole Oceanographic Institution: Syllabus and slide presentations in 2020(Woods Hole Oceanographic Institution, 2020-08-21) Beaulieu, Stace E. ; Raymond, Lisa ; Mickle, Audrey ; Futrelle, Joe ; Symmonds, Nick ; Mazzoli, Roberta ; Brey, Rich ; Kinkade, Danie ; Rauch, ShannonWith data and software increasingly recognized as scholarly research products, and aiming towards open science and reproducibility, it is imperative for today's oceanographers to learn foundational practices and skills for data management and research computing, as well as practices specific to the ocean sciences. This educational package was developed as a data science training camp for graduate students and professionals in the ocean sciences and implemented at the Woods Hole Oceanographic Institution (WHOI) in 2019 and 2020. Here we provide materials for the 2020 camp which was delivered in-person during two afternoons (total of 8 hours), with two modules per afternoon. We aimed for ~40 participants per camp, with disciplines spanning Earth and life sciences and engineering. Disciplines at each table were mixed on the first afternoon but similar on the second afternoon. Contents of this package include the syllabus and slide presentations for each of the four modules: 1 "Good enough practices in scientific computing," 2 Data management, 3 Software development and research computing, and 4 Best practices in the ocean sciences. The 3rd module is split into two parts. We also include a poster presented at the 2020 Ocean Science Meeting, which has some results from pre- and post-surveys. Funding: The camp was funded by WHOI Academic Programs Office through a Doherty Chair in Education Award, with additional support from WHOI Ocean Informatics Working Group, WHOI Information Services, MBLWHOI Library, the NSF-funded Biological and Chemical Oceanography Data Management Office (BCO-DMO), and an NSF-funded XSEDE Jetstream Education Allocation TG-OCE190011. We also utilized resources from the NSF-funded Pangeo project.