Keeping it light: (re)analyzing community-wide datasets without major infrastructure
Keeping it light: (re)analyzing community-wide datasets without major infrastructure
Date
2018-12-13
Authors
Alexander, Harriet
Johnson, Lisa K
Brown, C. Titus
Johnson, Lisa K
Brown, C. Titus
Linked Authors
Alternative Title
Citable URI
As Published
Date Created
Location
DOI
10.1093/gigascience/giy159
Related Materials
Replaces
Replaced By
Keywords
reproducibility
data reuse
open data
data reuse
open data
Abstract
DNA sequencing technology has revolutionized the field of biology, shifting biology from a data-limited to data-rich state. Central to the interpretation of sequencing data are the computational tools and approaches that convert raw data into biologically meaningful information. Both the tools and the generation of data are actively evolving, yet the practice of re-analysis of previously generated data with new tools is not commonplace. Re-analysis of existing data provides an affordable means of generating new information and will likely become more routine within biology, yet necessitates a new set of considerations for best practices and resource development. Here, we discuss several practices that we believe to be broadly applicable when re-analyzing data, especially when done by small research groups.
Description
© The Author(s), 2019. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Alexander, H., Johnson, L. K., & Brown, C. T.. Keeping it light: (re)analyzing community-wide datasets without major infrastructure. Gigascience, 8(2),(2019): giy159, doi:10.1093/gigascience/giy159.
Embargo Date
Citation
Alexander, H., Johnson, L. K., & Brown, C. T. (2019). Keeping it light: (re)analyzing community-wide datasets without major infrastructure. Gigascience, 8(2), giy159.