Identifying reference genes with stable expression from high throughput sequence data
Jenkins, Bethany D.
Rynearson, Tatiana A.
Saito, Mak A.
Mercier, Melissa L.
Dyhrman, Sonya T.
MetadataShow full item record
KeywordThalassiosira pseudonana; Diatom; Phytoplankton; Housekeeping genes; RT-qPCR; Transcriptome; Relative gene expression; Reference gene
Genes that are constitutively expressed across multiple environmental stimuli are crucial to quantifying differentially expressed genes, particularly when employing quantitative reverse transcriptase polymerase chain reaction (RT-qPCR) assays. However, the identification of these potential reference genes in non-model organisms is challenging and is often guided by expression patterns in distantly related organisms. Here, transcriptome datasets from the diatom Thalassiosira pseudonana grown under replete, phosphorus-limited, iron-limited, and phosphorus and iron co-limited nutrient regimes were analyzed through literature-based searches for homologous reference genes, k-means clustering, and analysis of sequence counts (ASC) to identify putative reference genes. A total of 9759 genes were identified and screened for stable expression. Literature-based searches surveyed 18 generally accepted reference genes, revealing 101 homologs in T. pseudonana with variable expression and a wide range of mean tags per million. k-means analysis parsed the whole transcriptome into 15 clusters. The two most stable clusters contained 709 genes, but still had distinct patterns in expression. ASC analyses identified 179 genes that were stably expressed (posterior probability < 0.1 for 1.25 fold change). Genes known to have a stable expression pattern across the test treatments, like actin, were identified in this pool of 179 candidate genes. ASC can be employed on data without biological replicates and was more robust than the k-means approach in isolating genes with stable expression. The intersection of the genes identified through ASC with commonly used reference genes from the literature suggests that actin and ubiquitin ligase may be useful reference genes for T. pseudonana and potentially other diatoms. With the wealth of transcriptome sequence data becoming available, ASC can be easily applied to transcriptome datasets from other phytoplankton to identify reference genes.
© The Author(s), 2012. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Frontiers in Microbiology 3 (2012): 385, doi:10.3389/fmicb.2012.00385.
Suggested CitationArticle: Alexander, Harriet, Jenkins, Bethany D., Rynearson, Tatiana A., Saito, Mak A., Mercier, Melissa L., Dyhrman, Sonya T., "Identifying reference genes with stable expression from high throughput sequence data", Frontiers in Microbiology 3 (2012): 385, DOI:10.3389/fmicb.2012.00385, https://hdl.handle.net/1912/6986
The following license files are associated with this item:
Showing items related by title, author, creator and subject.
Differential gene expression between fall- and spring-run Chinook salmon assessed by long serial analysis of gene expression Bernier, Jeremiah C.; Birkeland, Shanda R.; Cipriano, Michael J.; McArthur, Andrew G.; Banks, Michael A. (American Fisheries Society, 2008-09-15)Of all Pacific salmonids, Chinook salmon Oncorhynchus tshawytscha display the greatest variability in return times to freshwater. The molecular mechanisms of these differential return times have not been well described. ...
Watkins, William A.; Daher, Mary Ann; Haley, Nancy J. (Woods Hole Oceanographic Institution, 1990-06)This documentation for the CETACEA database of marine mammal literature references updates and expands the original work by Watkins, Bird, Moore, and Tyack 1988 (Reference Database Marine Mammal Literature, Technical ...