DRISEE overestimates errors in metagenomic sequencing data
Eren, A. Murat
Morrison, Hilary G.
Huse, Susan M.
Sogin, Mitchell L.
MetadataShow full item record
The extremely high error rates reported by Keegan et al. in ‘A platform-independent method for detecting errors in metagenomic sequencing data: DRISEE’ (PLoS Comput Biol 2012;8:e1002541) for many next-generation sequencing datasets prompted us to re-examine their results. Our analysis reveals that the presence of conserved artificial sequences, e.g. Illumina adapters, and other naturally occurring sequence motifs accounts for most of the reported errors. We conclude that DRISEE reports inflated levels of sequencing error, particularly for Illumina data. Tools offered for evaluating large datasets need scrupulous review before they are implemented.
© The Author(s), 2013. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Briefings in Bioinformatics 15 (2014): 783-787, doi:10.1093/bib/bbt010.
The following license files are associated with this item:
Except where otherwise noted, this item's license is described as Attribution-NonCommercial 3.0 Unported
Showing items related by title, author, creator and subject.
Heaney, Kevin D.; Lermusiaux, Pierre F. J.; Duda, Timothy F.; Haley, Patrick J. (2016-08)Regional ocean models are capable of forecasting conditions for usefully long intervals of time (days) provided that initial and ongoing conditions can be measured. In resource-limited circumstances, the placement of ...
Chechelnitsky, Michael Y. (Massachusetts Institute of Technology and Woods Hole Oceanographic Institution, 1999-06)Data assimilation methods, such as the Kalman filter, are routinely used in oceanography. The statistics of the model and measurement errors need to be specified a priori. In this study we address the problem of estimating ...
Kalmikov, Alexander G. (Massachusetts Institute of Technology and Woods Hole Oceanographic Institution, 2013-02)Quantifying uncertainty and error bounds is a key outstanding challenge in ocean state estimation and climate research. It is particularly difficult due to the large dimensionality of this nonlinear estimation problem ...