Cargando…

Improved moderation for gene-wise variance estimation in RNA-Seq via the exploitation of external information

BACKGROUND: The cost of RNA-Seq has been decreasing over the last few years. Despite this, experiments with four or less biological replicates are still quite common. Estimating the variances of gene expression estimates becomes both a challenging and interesting problem in these situations of low r...

Descripción completa

Detalles Bibliográficos
Autores principales: Patrick, Ellis, Buckley, Michael, Lin, David Ming, Yang, Yee Hwa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549800/
https://www.ncbi.nlm.nih.gov/pubmed/23368783
http://dx.doi.org/10.1186/1471-2164-14-S1-S9
_version_ 1782256472877432832
author Patrick, Ellis
Buckley, Michael
Lin, David Ming
Yang, Yee Hwa
author_facet Patrick, Ellis
Buckley, Michael
Lin, David Ming
Yang, Yee Hwa
author_sort Patrick, Ellis
collection PubMed
description BACKGROUND: The cost of RNA-Seq has been decreasing over the last few years. Despite this, experiments with four or less biological replicates are still quite common. Estimating the variances of gene expression estimates becomes both a challenging and interesting problem in these situations of low replication. However, with the wealth of microarray and other publicly available gene expression data readily accessible on public repositories, these sources of information can be leveraged to make improvements in variance estimation. RESULTS: We have proposed a novel approach called Tshrink+ for inferring differential gene expression through improved modelling of the gene-wise variances. Existing methods share information between genes of similar average expression by shrinking, or moderating, the gene-wise variances to a fitted common variance. We have been able to achieve improved estimation of the common variance by using gene-wise sample variances from external experiments, as well as gene length. CONCLUSIONS: Using biological data we show that utilising additional external information can improve the modelling of the common variance and hence the calling of differentially expressed genes. These sources of additional information include gene length and gene-wise sample variances from other RNA-Seq and microarray datasets, of both related and seemingly unrelated tissue types. The results of this are promising, with our differential expression test, Tshrink+, performing favourably when compared to existing methods such as DESeq and edgeR when considering both gene ranking and sensitivity. These improved variance models could easily be implemented in both DESeq and edgeR and highlight the need for a database that offers a profile of gene variances over a range of tissue types and organisms.
format Online
Article
Text
id pubmed-3549800
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35498002013-01-23 Improved moderation for gene-wise variance estimation in RNA-Seq via the exploitation of external information Patrick, Ellis Buckley, Michael Lin, David Ming Yang, Yee Hwa BMC Genomics Proceedings BACKGROUND: The cost of RNA-Seq has been decreasing over the last few years. Despite this, experiments with four or less biological replicates are still quite common. Estimating the variances of gene expression estimates becomes both a challenging and interesting problem in these situations of low replication. However, with the wealth of microarray and other publicly available gene expression data readily accessible on public repositories, these sources of information can be leveraged to make improvements in variance estimation. RESULTS: We have proposed a novel approach called Tshrink+ for inferring differential gene expression through improved modelling of the gene-wise variances. Existing methods share information between genes of similar average expression by shrinking, or moderating, the gene-wise variances to a fitted common variance. We have been able to achieve improved estimation of the common variance by using gene-wise sample variances from external experiments, as well as gene length. CONCLUSIONS: Using biological data we show that utilising additional external information can improve the modelling of the common variance and hence the calling of differentially expressed genes. These sources of additional information include gene length and gene-wise sample variances from other RNA-Seq and microarray datasets, of both related and seemingly unrelated tissue types. The results of this are promising, with our differential expression test, Tshrink+, performing favourably when compared to existing methods such as DESeq and edgeR when considering both gene ranking and sensitivity. These improved variance models could easily be implemented in both DESeq and edgeR and highlight the need for a database that offers a profile of gene variances over a range of tissue types and organisms. BioMed Central 2013-01-21 /pmc/articles/PMC3549800/ /pubmed/23368783 http://dx.doi.org/10.1186/1471-2164-14-S1-S9 Text en Copyright ©2013 Patrick et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Patrick, Ellis
Buckley, Michael
Lin, David Ming
Yang, Yee Hwa
Improved moderation for gene-wise variance estimation in RNA-Seq via the exploitation of external information
title Improved moderation for gene-wise variance estimation in RNA-Seq via the exploitation of external information
title_full Improved moderation for gene-wise variance estimation in RNA-Seq via the exploitation of external information
title_fullStr Improved moderation for gene-wise variance estimation in RNA-Seq via the exploitation of external information
title_full_unstemmed Improved moderation for gene-wise variance estimation in RNA-Seq via the exploitation of external information
title_short Improved moderation for gene-wise variance estimation in RNA-Seq via the exploitation of external information
title_sort improved moderation for gene-wise variance estimation in rna-seq via the exploitation of external information
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549800/
https://www.ncbi.nlm.nih.gov/pubmed/23368783
http://dx.doi.org/10.1186/1471-2164-14-S1-S9
work_keys_str_mv AT patrickellis improvedmoderationforgenewisevarianceestimationinrnaseqviatheexploitationofexternalinformation
AT buckleymichael improvedmoderationforgenewisevarianceestimationinrnaseqviatheexploitationofexternalinformation
AT lindavidming improvedmoderationforgenewisevarianceestimationinrnaseqviatheexploitationofexternalinformation
AT yangyeehwa improvedmoderationforgenewisevarianceestimationinrnaseqviatheexploitationofexternalinformation