Cargando…

Identifying reference genes with stable expression from high throughput sequence data

Genes that are constitutively expressed across multiple environmental stimuli are crucial to quantifying differentially expressed genes, particularly when employing quantitative reverse transcriptase polymerase chain reaction (RT-qPCR) assays. However, the identification of these potential reference...

Descripción completa

Detalles Bibliográficos
Autores principales: Alexander, Harriet, Jenkins, Bethany D., Rynearson, Tatiana A., Saito, Mak A., Mercier, Melissa L., Dyhrman, Sonya T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494082/
https://www.ncbi.nlm.nih.gov/pubmed/23162540
http://dx.doi.org/10.3389/fmicb.2012.00385
_version_ 1782249358789443584
author Alexander, Harriet
Jenkins, Bethany D.
Rynearson, Tatiana A.
Saito, Mak A.
Mercier, Melissa L.
Dyhrman, Sonya T.
author_facet Alexander, Harriet
Jenkins, Bethany D.
Rynearson, Tatiana A.
Saito, Mak A.
Mercier, Melissa L.
Dyhrman, Sonya T.
author_sort Alexander, Harriet
collection PubMed
description Genes that are constitutively expressed across multiple environmental stimuli are crucial to quantifying differentially expressed genes, particularly when employing quantitative reverse transcriptase polymerase chain reaction (RT-qPCR) assays. However, the identification of these potential reference genes in non-model organisms is challenging and is often guided by expression patterns in distantly related organisms. Here, transcriptome datasets from the diatom Thalassiosira pseudonana grown under replete, phosphorus-limited, iron-limited, and phosphorus and iron co-limited nutrient regimes were analyzed through literature-based searches for homologous reference genes, k-means clustering, and analysis of sequence counts (ASC) to identify putative reference genes. A total of 9759 genes were identified and screened for stable expression. Literature-based searches surveyed 18 generally accepted reference genes, revealing 101 homologs in T. pseudonana with variable expression and a wide range of mean tags per million. k-means analysis parsed the whole transcriptome into 15 clusters. The two most stable clusters contained 709 genes, but still had distinct patterns in expression. ASC analyses identified 179 genes that were stably expressed (posterior probability < 0.1 for 1.25 fold change). Genes known to have a stable expression pattern across the test treatments, like actin, were identified in this pool of 179 candidate genes. ASC can be employed on data without biological replicates and was more robust than the k-means approach in isolating genes with stable expression. The intersection of the genes identified through ASC with commonly used reference genes from the literature suggests that actin and ubiquitin ligase may be useful reference genes for T. pseudonana and potentially other diatoms. With the wealth of transcriptome sequence data becoming available, ASC can be easily applied to transcriptome datasets from other phytoplankton to identify reference genes.
format Online
Article
Text
id pubmed-3494082
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-34940822012-11-16 Identifying reference genes with stable expression from high throughput sequence data Alexander, Harriet Jenkins, Bethany D. Rynearson, Tatiana A. Saito, Mak A. Mercier, Melissa L. Dyhrman, Sonya T. Front Microbiol Microbiology Genes that are constitutively expressed across multiple environmental stimuli are crucial to quantifying differentially expressed genes, particularly when employing quantitative reverse transcriptase polymerase chain reaction (RT-qPCR) assays. However, the identification of these potential reference genes in non-model organisms is challenging and is often guided by expression patterns in distantly related organisms. Here, transcriptome datasets from the diatom Thalassiosira pseudonana grown under replete, phosphorus-limited, iron-limited, and phosphorus and iron co-limited nutrient regimes were analyzed through literature-based searches for homologous reference genes, k-means clustering, and analysis of sequence counts (ASC) to identify putative reference genes. A total of 9759 genes were identified and screened for stable expression. Literature-based searches surveyed 18 generally accepted reference genes, revealing 101 homologs in T. pseudonana with variable expression and a wide range of mean tags per million. k-means analysis parsed the whole transcriptome into 15 clusters. The two most stable clusters contained 709 genes, but still had distinct patterns in expression. ASC analyses identified 179 genes that were stably expressed (posterior probability < 0.1 for 1.25 fold change). Genes known to have a stable expression pattern across the test treatments, like actin, were identified in this pool of 179 candidate genes. ASC can be employed on data without biological replicates and was more robust than the k-means approach in isolating genes with stable expression. The intersection of the genes identified through ASC with commonly used reference genes from the literature suggests that actin and ubiquitin ligase may be useful reference genes for T. pseudonana and potentially other diatoms. With the wealth of transcriptome sequence data becoming available, ASC can be easily applied to transcriptome datasets from other phytoplankton to identify reference genes. Frontiers Media S.A. 2012-11-09 /pmc/articles/PMC3494082/ /pubmed/23162540 http://dx.doi.org/10.3389/fmicb.2012.00385 Text en Copyright © 2012 Alexander, Jenkins, Rynearson, Saito, Mercier and Dyhrman. http://www.frontiersin.org/licenseagreement This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle Microbiology
Alexander, Harriet
Jenkins, Bethany D.
Rynearson, Tatiana A.
Saito, Mak A.
Mercier, Melissa L.
Dyhrman, Sonya T.
Identifying reference genes with stable expression from high throughput sequence data
title Identifying reference genes with stable expression from high throughput sequence data
title_full Identifying reference genes with stable expression from high throughput sequence data
title_fullStr Identifying reference genes with stable expression from high throughput sequence data
title_full_unstemmed Identifying reference genes with stable expression from high throughput sequence data
title_short Identifying reference genes with stable expression from high throughput sequence data
title_sort identifying reference genes with stable expression from high throughput sequence data
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494082/
https://www.ncbi.nlm.nih.gov/pubmed/23162540
http://dx.doi.org/10.3389/fmicb.2012.00385
work_keys_str_mv AT alexanderharriet identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata
AT jenkinsbethanyd identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata
AT rynearsontatianaa identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata
AT saitomaka identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata
AT merciermelissal identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata
AT dyhrmansonyat identifyingreferencegeneswithstableexpressionfromhighthroughputsequencedata