Massively parallel tag sequencing reveals the complexity of anaerobic marine protistan communities

dc.contributor.author Stoeck, Thorsten
dc.contributor.author Behnke, Anke
dc.contributor.author Christen, Richard
dc.contributor.author Amaral-Zettler, Linda A.
dc.contributor.author Rodriguez-Mora, Maria J.
dc.contributor.author Chistoserdov, Andrei Y.
dc.contributor.author Orsi, William D.
dc.contributor.author Edgcomb, Virginia P.
dc.date.accessioned 2009-11-30T16:02:00Z
dc.date.available 2009-11-30T16:02:00Z
dc.date.issued 2009-11-03
dc.description © 2009 The Authors. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in BMC Biology 7 (2009): 72, doi:10.1186/1741-7007-7-72. en_US
dc.description.abstract Recent advances in sequencing strategies make possible unprecedented depth and scale of sampling for molecular detection of microbial diversity. Two major paradigm-shifting discoveries include the detection of bacterial diversity that is one to two orders of magnitude greater than previous estimates, and the discovery of an exciting 'rare biosphere' of molecular signatures ('species') of poorly understood ecological significance. We applied a high-throughput parallel tag sequencing (454 sequencing) protocol adopted for eukaryotes to investigate protistan community complexity in two contrasting anoxic marine ecosystems (Framvaren Fjord, Norway; Cariaco deep-sea basin, Venezuela). Both sampling sites have previously been scrutinized for protistan diversity by traditional clone library construction and Sanger sequencing. By comparing these clone library data with 454 amplicon library data, we assess the efficiency of high-throughput tag sequencing strategies. We here present a novel, highly conservative bioinformatic analysis pipeline for the processing of large tag sequence data sets.The analyses of ca. 250,000 sequence reads revealed that the number of detected Operational Taxonomic Units (OTUs) far exceeded previous richness estimates from the same sites based on clone libraries and Sanger sequencing. More than 90% of this diversity was represented by OTUs with less than 10 sequence tags. We detected a substantial number of taxonomic groups like Apusozoa, Chrysomerophytes, Centroheliozoa, Eustigmatophytes, hyphochytriomycetes, Ichthyosporea, Oikomonads, Phaeothamniophytes, and rhodophytes which remained undetected by previous clone library-based diversity surveys of the sampling sites. The most important innovations in our newly developed bioinformatics pipeline employ (i) BLASTN with query parameters adjusted for highly variable domains and a complete database of public ribosomal RNA (rRNA) gene sequences for taxonomic assignments of tags; (ii) a clustering of tags at k differences (Levenshtein distance) with a newly developed algorithm enabling very fast OTU clustering for large tag sequence data sets; and (iii) a novel parsing procedure to combine the data from individual analyses. Our data highlight the magnitude of the under-sampled 'protistan gap' in the eukaryotic tree of life. This study illustrates that our current understanding of the ecological complexity of protist communities, and of the global species richness and genome diversity of protists, is severely limited. Even though 454 pyrosequencing is not a panacea, it allows for more comprehensive insights into the diversity of protistan communities, and combined with appropriate statistical tools, enables improved ecological interpretations of the data and projections of global diversity. en_US
dc.description.sponsorship The International Census of Marine Microbes and the W.M. Keck Foundation award to the Marine Biological Laboratory at Woods Hole (MA) supported the pyrosequencing part of this study. Further financial support came from a grant from the Deutsche Forschungsgemeinschaft to TS (STO414/3-1). Support for the unpublished work on Cariaco Basin protists came from NSF MCB-0348407 to VE (collaborative project with S Epstein at Northeastern University, Boston, MA, USA). Financial support to AC was provided by NSF MCB-0348045. Financial support to RC was provided by the ANR-Biodiversité project Aquaparadox. en_US
dc.format.mimetype application/pdf
dc.identifier.citation BMC Biology 7 (2009): 72 en_US
dc.identifier.doi 10.1186/1741-7007-7-72
dc.identifier.uri https://hdl.handle.net/1912/3082
dc.language.iso en en_US
dc.publisher BioMed Central en_US
dc.relation.uri https://doi.org/10.1186/1741-7007-7-72
dc.rights Attribution 2.0 Generic *
dc.rights.uri http://creativecommons.org/licenses/by/2.0 *
dc.title Massively parallel tag sequencing reveals the complexity of anaerobic marine protistan communities en_US
dc.type Article en_US
dspace.entity.type Publication
relation.isAuthorOfPublication 38be6ef1-043e-4d3c-bd9c-9ef5ffcd78b7
relation.isAuthorOfPublication 1bce92ae-a0b2-49ec-b0e3-83308ed7dc3e
relation.isAuthorOfPublication f32f672b-b054-4aaf-9f35-9ff0654d6c87
relation.isAuthorOfPublication b9072c87-5485-4987-9309-02751e6951d3
relation.isAuthorOfPublication 38b57316-7db1-4f6d-a938-11b1d55bf857
relation.isAuthorOfPublication a82c501e-a0f0-41b5-bc8c-f62cbfaf1483
relation.isAuthorOfPublication b73a0f8b-0d39-407f-a532-aa3c4f7569fa
relation.isAuthorOfPublication a8b5a5de-457b-4a1e-a069-cf078d133a07
relation.isAuthorOfPublication.latestForDiscovery 38be6ef1-043e-4d3c-bd9c-9ef5ffcd78b7
Files
Original bundle
Now showing 1 - 5 of 6
Thumbnail Image
Name:
1741-7007-7-72.pdf
Size:
2.4 MB
Format:
Adobe Portable Document Format
Description:
Article
Thumbnail Image
Name:
1741-7007-7-72-s1.pdf
Size:
937.17 KB
Format:
Adobe Portable Document Format
Description:
Figure S1: Scanning electron micrograph of an unidentified ciliate isolated from anoxic, sulfidic waters of the Cariaco Basin.
Thumbnail Image
Name:
1741-7007-7-72-s2.pdf
Size:
17.91 KB
Format:
Adobe Portable Document Format
Description:
Table S1: Taxonomy and proportion of abundant metazoan operational taxonomic units.
Thumbnail Image
Name:
1741-7007-7-72-s3.pdf
Size:
81.85 KB
Format:
Adobe Portable Document Format
Description:
Figure S2: Numbers of unique metazoan operational taxonomic units.
Thumbnail Image
Name:
1741-7007-7-72-s4.pdf
Size:
42.44 KB
Format:
Adobe Portable Document Format
Description:
Table S2: Relative contribution of metazoan operational taxonomic units to total eukaryote operational taxonomic units.
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.97 KB
Format:
Item-specific license agreed upon to submission
Description: