Identifying contamination with advanced visualization and analysis practices : metagenomic approaches for eukaryotic genome assemblies
Example uses of single-copy genes to estimate the number of bacterial genomes in an assembly (852.2Kb)
Visualization and curation of the raw tardigrade genome assembly from Koutsovoulos et al. (2016) (1.751Mb)
Summary of H. dujardini and bacterial genomes identified from the raw assembly results of Boothby et al. (2015) and Koutsovoulos et al. (2016) (31.29Kb)
Summary of functions identified by RAST in the bacterial draft genome #2 (selection #3 in Fig. 1) (373.8Kb)
MetadataShow full item record
High-throughput sequencing provides a fast and cost-effective mean to recover genomes of organisms from all domains of life. However, adequate curation of the assembly results against potential contamination of non-target organisms requires advanced bioinformatics approaches and practices. Here, we re-analyzed the sequencing data generated for the tardigrade Hypsibius dujardini, and created a holistic display of the eukaryotic genome assembly using DNA data originating from two groups and eleven sequencing libraries. By using bacterial single-copy genes, k-mer frequencies, and coverage values of scaffolds we could identify and characterize multiple near-complete bacterial genomes from the raw assembly, and curate a 182 Mbp draft genome for H. dujardini supported by RNA-Seq data. Our results indicate that most contaminant scaffolds were assembled from Moleculo long-read libraries, and most of these contaminants have differed between library preparations. Our re-analysis shows that visualization and curation of eukaryotic genome assemblies can benefit from tools designed to address the needs of today’s microbiologists, who are constantly challenged by the difficulties associated with the identification of distinct microbial genomes in complex environmental metagenomes.
© The Author(s), 2016. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in PeerJ 4 (2016): e1839, doi:10.7717/peerj.1839.
Suggested CitationPeerJ 4 (2016): e1839
The following license files are associated with this item:
Showing items related by title, author, creator and subject.
New biological insights into how deforestation in Amazonia affects soil microbial communities using metagenomics and metagenome-assembled genomes Kroeger, Marie E.; Delmont, Tom O.; Eren, A. Murat; Meyer, Kyle M.; Guo, Jiarong; Khan, Kiran; Rodrigues, Jorge L. M.; Bohannan, Brendan J. M.; Tringe, Susannah G.; Borges, Clovis D.; Tiedje, James M.; Tsai, Siu M.; Nüsslein, Klaus (Frontiers Media, 2018-07-23)Deforestation in the Brazilian Amazon occurs at an alarming rate, which has broad effects on global greenhouse gas emissions, carbon storage, and biogeochemical cycles. In this study, soil metagenomes and metagenome-assembled ...
Sharko, Fedor S.; Nedoluzhko, Artem V.; Lê, Brandon M.; Tsygankova, Svetlana V.; Boulygina, Eugenia S.; Rastorguev, Sergey M.; Sokolov, Alexey S.; Rodriguez, Fernando; Mazur, Alexander M.; Polilov, Alexey A.; Benton, Richard; Evgen'ev, Michael B.; Arkhipova, Irina R.; Prokhortchouk, Egor B.; Skryabin, Konstantin G. (Public Library of Science, 2019-12-23)Body size reduction, also known as miniaturization, is an important evolutionary process that affects a number of physiological and phenotypic traits and helps animals conquer new ecological niches. However, this process ...
Groen, Aaron C.; Maresca, Thomas J.; Gatlin, Jesse C.; Salmon, Edward D.; Mitchison, Timothy J. (American Society for Cell Biology, 2009-04-15)Distinct pathways from centrosomes and chromatin are thought to contribute in parallel to microtubule nucleation and stabilization during animal cell mitotic spindle assembly, but their full mechanisms are not known. We ...