Eukaryotic genomes from a global metagenomic data set illuminate trophic modes and biogeography of ocean plankton

Thumbnail Image
Date
2023-11-10
Authors
Alexander, Harriet
Hu, Sarah K.
Krinos, Arianna I.
Pachiadaki, Maria G.
Tully, Benjamin J.
Neely, Christopher J.
Reiter, Taylor
Alternative Title
Date Created
Location
DOI
10.1128/mbio.01676-23
Related Materials
Replaces
Replaced By
Keywords
Metagenomics
Protists
Genomes
Eukaryotic metagenome-assembled genomes
Abstract
Metagenomics is a powerful method for interpreting the ecological roles and physiological capabilities of mixed microbial communities. Yet, many tools for processing metagenomic data are neither designed to consider eukaryotes nor are they built for an increasing amount of sequence data. EukHeist is an automated pipeline to retrieve eukaryotic and prokaryotic metagenome-assembled genomes (MAGs) from large-scale metagenomic sequence data sets. We developed the EukHeist workflow to specifically process large amounts of both metagenomic and/or metatranscriptomic sequence data in an automated and reproducible fashion. Here, we applied EukHeist to the large-size fraction data (0.8–2,000 µm) from Tara Oceans to recover both eukaryotic and prokaryotic MAGs, which we refer to as TOPAZ (Tara Oceans Particle-Associated MAGs). The TOPAZ MAGs consisted of >900 environmentally relevant eukaryotic MAGs and >4,000 bacterial and archaeal MAGs. The bacterial and archaeal TOPAZ MAGs expand upon the phylogenetic diversity of likely particle- and host-associated taxa. We use these MAGs to demonstrate an approach to infer the putative trophic mode of the recovered eukaryotic MAGs. We also identify ecological cohorts of co-occurring MAGs, which are driven by specific environmental factors and putative host-microbe associations. These data together add to a number of growing resources of environmentally relevant eukaryotic genomic information. Complementary and expanded databases of MAGs, such as those provided through scalable pipelines like EukHeist, stand to advance our understanding of eukaryotic diversity through increased coverage of genomic representatives across the tree of life.
Description
© The Author(s), 2023. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Alexander, H., Hu, S., Krinos, A., Pachiadaki, M., Tully, B., Neely, C., & Reiter, T. (2023). Eukaryotic genomes from a global metagenomic data set illuminate trophic modes and biogeography of ocean plankton. mBio, e0167623, https://doi.org/10.1128/mbio.01676-23.
Embargo Date
Citation
Alexander, H., Hu, S., Krinos, A., Pachiadaki, M., Tully, B., Neely, C., & Reiter, T. (2023). Eukaryotic genomes from a global metagenomic data set illuminate trophic modes and biogeography of ocean plankton. mBio, e0167623.
Cruises
Cruise ID
Cruise DOI
Vessel Name
Except where otherwise noted, this item's license is described as Attribution 4.0 International