Linking pangenomes and metagenomes : the Prochlorococcus metapangenome
MetadataShow full item record
KeywordComparative genomics; Metagenomics; Microbial ecology; Metapangenomics; anvi’o; Hypervariable genomic islands; Sugar metabolism; Pangenomics; TARA Oceans
Pangenomes offer detailed characterizations of core and accessory genes found in a set of closely related microbial genomes, generally by clustering genes based on sequence homology. In comparison, metagenomes facilitate highly resolved investigations of the relative distribution of microbial genomes and individual genes across environments through read recruitment analyses. Combining these complementary approaches can yield unique insights into the functional basis of microbial niche partitioning and fitness, however, advanced software solutions are lacking. Here we present an integrated analysis and visualization strategy that provides an interactive and reproducible framework to generate pangenomes and to study them in conjunction with metagenomes. To investigate its utility, we applied this strategy to a Prochlorococcus pangenome in the context of a large-scale marine metagenomic survey. The resulting Prochlorococcus metapangenome revealed remarkable differential abundance patterns between very closely related isolates that belonged to the same phylogenetic cluster and that differed by only a small number of gene clusters in the pangenome. While the relationships between these genomes based on gene clusters correlated with their environmental distribution patterns, phylogenetic analyses using marker genes or concatenated single-copy core genes did not recapitulate these patterns. The metapangenome also revealed a small set of core genes that mostly occurred in hypervariable genomic islands of the Prochlorococcus populations, which systematically lacked read recruitment from surface ocean metagenomes. Notably, these core gene clusters were all linked to sugar metabolism, suggesting potential benefits to Prochlorococcus from a high sequence diversity of sugar metabolism genes. The rapidly growing number of microbial genomes and increasing availability of environmental metagenomes provide new opportunities to investigate the functioning and the ecology of microbial populations, and metapangenomes can provide unique insights for any taxon and biome for which genomic and sufficiently deep metagenomic data are available.
© The Author(s), 2018. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in PeerJ 6 (2018): e4320, doi:10.7717/peerj.4320.
Suggested CitationPeerJ 6 (2018): e4320
The following license files are associated with this item:
Showing items related by title, author, creator and subject.
New biological insights into how deforestation in Amazonia affects soil microbial communities using metagenomics and metagenome-assembled genomes Kroeger, Marie E.; Delmont, Tom O.; Eren, A. Murat; Meyer, Kyle M.; Guo, Jiarong; Khan, Kiran; Rodrigues, Jorge L. M.; Bohannan, Brendan J. M.; Tringe, Susannah G.; Borges, Clovis D.; Tiedje, James M.; Tsai, Siu M.; Nüsslein, Klaus (Frontiers Media, 2018-07-23)Deforestation in the Brazilian Amazon occurs at an alarming rate, which has broad effects on global greenhouse gas emissions, carbon storage, and biogeochemical cycles. In this study, soil metagenomes and metagenome-assembled ...
Sangwan, Naseer; Xia, Fangfang; Gilbert, Jack A. (BioMed Central, 2016-03-08)Assembly of metagenomic sequence data into microbial genomes is of fundamental value to improving our understanding of microbial ecology and metabolism by elucidating the functional potential of hard-to-culture microorganisms. ...
Delmont, Tom O.; Eren, A. Murat; Maccario, Lorrie; Prestat, Emmanuel; Esen, Ozcan C.; Pelletier, Eric; Le Paslier, Denis; Simonet, Pascal; Vogel, Timothy M. (Frontiers Media, 2015-04-30)Despite extensive direct sequencing efforts and advanced analytical tools, reconstructing microbial genomes from soil using metagenomics have been challenging due to the tremendous diversity and relatively uniform distribution ...