Cargando…
Identifying reference genes with stable expression from high throughput sequence data
Genes that are constitutively expressed across multiple environmental stimuli are crucial to quantifying differentially expressed genes, particularly when employing quantitative reverse transcriptase polymerase chain reaction (RT-qPCR) assays. However, the identification of these potential reference...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494082/ https://www.ncbi.nlm.nih.gov/pubmed/23162540 http://dx.doi.org/10.3389/fmicb.2012.00385 |
_version_ | 1782249358789443584 |
---|---|
author | Alexander, Harriet Jenkins, Bethany D. Rynearson, Tatiana A. Saito, Mak A. Mercier, Melissa L. Dyhrman, Sonya T. |
author_facet | Alexander, Harriet Jenkins, Bethany D. Rynearson, Tatiana A. Saito, Mak A. Mercier, Melissa L. Dyhrman, Sonya T. |
author_sort | Alexander, Harriet |
collection | PubMed |
description | Genes that are constitutively expressed across multiple environmental stimuli are crucial to quantifying differentially expressed genes, particularly when employing quantitative reverse transcriptase polymerase chain reaction (RT-qPCR) assays. However, the identification of these potential reference genes in non-model organisms is challenging and is often guided by expression patterns in distantly related organisms. Here, transcriptome datasets from the diatom Thalassiosira pseudonana grown under replete, phosphorus-limited, iron-limited, and phosphorus and iron co-limited nutrient regimes were analyzed through literature-based searches for homologous reference genes, k-means clustering, and analysis of sequence counts (ASC) to identify putative reference genes. A total of 9759 genes were identified and screened for stable expression. Literature-based searches surveyed 18 generally accepted reference genes, revealing 101 homologs in T. pseudonana with variable expression and a wide range of mean tags per million. k-means analysis parsed the whole transcriptome into 15 clusters. The two most stable clusters contained 709 genes, but still had distinct patterns in expression. ASC analyses identified 179 genes that were stably expressed (posterior probability < 0.1 for 1.25 fold change). Genes known to have a stable expression pattern across the test treatments, like actin, were identified in this pool of 179 candidate genes. ASC can be employed on data without biological replicates and was more robust than the k-means approach in isolating genes with stable expression. The intersection of the genes identified through ASC with commonly used reference genes from the literature suggests that actin and ubiquitin ligase may be useful reference genes for T. pseudonana and potentially other diatoms. With the wealth of transcriptome sequence data becoming available, ASC can be easily applied to transcriptome datasets from other phytoplankton to identify reference genes. |
format | Online Article Text |
id | pubmed-3494082 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-34940822012-11-16 Identifying reference genes with stable expression from high throughput sequence data Alexander, Harriet Jenkins, Bethany D. Rynearson, Tatiana A. Saito, Mak A. Mercier, Melissa L. Dyhrman, Sonya T. Front Microbiol Microbiology Genes that are constitutively expressed across multiple environmental stimuli are crucial to quantifying differentially expressed genes, particularly when employing quantitative reverse transcriptase polymerase chain reaction (RT-qPCR) assays. However, the identification of these potential reference genes in non-model organisms is challenging and is often guided by expression patterns in distantly related organisms. Here, transcriptome datasets from the diatom Thalassiosira pseudonana grown under replete, phosphorus-limited, iron-limited, and phosphorus and iron co-limited nutrient regimes were analyzed through literature-based searches for homologous reference genes, k-means clustering, and analysis of sequence counts (ASC) to identify putative reference genes. A total of 9759 genes were identified and screened for stable expression. Literature-based searches surveyed 18 generally accepted reference genes, revealing 101 homologs in T. pseudonana with variable expression and a wide range of mean tags per million. k-means analysis parsed the whole transcriptome into 15 clusters. The two most stable clusters contained 709 genes, but still had distinct patterns in expression. ASC analyses identified 179 genes that were stably expressed (posterior probability < 0.1 for 1.25 fold change). Genes known to have a stable expression pattern across the test treatments, like actin, were identified in this pool of 179 candidate genes. ASC can be employed on data without biological replicates and was more robust than the k-means approach in isolating genes with stable expression. The intersection of the genes identified through ASC with commonly used reference genes from the literature suggests that actin and ubiquitin ligase may be useful reference genes for T. pseudonana and potentially other diatoms. With the wealth of transcriptome sequence data becoming available, ASC can be easily applied to transcriptome datasets from other phytoplankton to identify reference genes. Frontiers Media S.A. 2012-11-09 /pmc/articles/PMC3494082/ /pubmed/23162540 http://dx.doi.org/10.3389/fmicb.2012.00385 Text en Copyright © 2012 Alexander, Jenkins, Rynearson, Saito, Mercier and Dyhrman. http://www.frontiersin.org/licenseagreement This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc. |
spellingShingle | Microbiology Alexander, Harriet Jenkins, Bethany D. Rynearson, Tatiana A. Saito, Mak A. Mercier, Melissa L. Dyhrman, Sonya T. Identifying reference genes with stable expression from high throughput sequence data |
title | Identifying reference genes with stable expression from high throughput sequence data |
title_full | Identifying reference genes with stable expression from high throughput sequence data |
title_fullStr | Identifying reference genes with stable expression from high throughput sequence data |
title_full_unstemmed | Identifying reference genes with stable expression from high throughput sequence data |
title_short | Identifying reference genes with stable expression from high throughput sequence data |
title_sort | identifying reference genes with stable expression from high throughput sequence data |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494082/ https://www.ncbi.nlm.nih.gov/pubmed/23162540 http://dx.doi.org/10.3389/fmicb.2012.00385 |
work_keys_str_mv | AT alexanderharriet identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata AT jenkinsbethanyd identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata AT rynearsontatianaa identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata AT saitomaka identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata AT merciermelissal identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata AT dyhrmansonyat identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata |