Cargando…

baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data

BACKGROUND: High throughput sequencing has become an important technology for studying expression levels in many types of genomic, and particularly transcriptomic, data. One key way of analysing such data is to look for elements of the data which display particular patterns of differential expressio...

Descripción completa

Detalles Bibliográficos
Autores principales: Hardcastle, Thomas J, Kelly, Krystyna A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2928208/
https://www.ncbi.nlm.nih.gov/pubmed/20698981
http://dx.doi.org/10.1186/1471-2105-11-422
_version_ 1782185837670170624
author Hardcastle, Thomas J
Kelly, Krystyna A
author_facet Hardcastle, Thomas J
Kelly, Krystyna A
author_sort Hardcastle, Thomas J
collection PubMed
description BACKGROUND: High throughput sequencing has become an important technology for studying expression levels in many types of genomic, and particularly transcriptomic, data. One key way of analysing such data is to look for elements of the data which display particular patterns of differential expression in order to take these forward for further analysis and validation. RESULTS: We propose a framework for defining patterns of differential expression and develop a novel algorithm, baySeq, which uses an empirical Bayes approach to detect these patterns of differential expression within a set of sequencing samples. The method assumes a negative binomial distribution for the data and derives an empirically determined prior distribution from the entire dataset. We examine the performance of the method on real and simulated data. CONCLUSIONS: Our method performs at least as well, and often better, than existing methods for analyses of pairwise differential expression in both real and simulated data. When we compare methods for the analysis of data from experimental designs involving multiple sample groups, our method again shows substantial gains in performance. We believe that this approach thus represents an important step forward for the analysis of count data from sequencing experiments.
format Text
id pubmed-2928208
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29282082010-08-26 baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data Hardcastle, Thomas J Kelly, Krystyna A BMC Bioinformatics Research Article BACKGROUND: High throughput sequencing has become an important technology for studying expression levels in many types of genomic, and particularly transcriptomic, data. One key way of analysing such data is to look for elements of the data which display particular patterns of differential expression in order to take these forward for further analysis and validation. RESULTS: We propose a framework for defining patterns of differential expression and develop a novel algorithm, baySeq, which uses an empirical Bayes approach to detect these patterns of differential expression within a set of sequencing samples. The method assumes a negative binomial distribution for the data and derives an empirically determined prior distribution from the entire dataset. We examine the performance of the method on real and simulated data. CONCLUSIONS: Our method performs at least as well, and often better, than existing methods for analyses of pairwise differential expression in both real and simulated data. When we compare methods for the analysis of data from experimental designs involving multiple sample groups, our method again shows substantial gains in performance. We believe that this approach thus represents an important step forward for the analysis of count data from sequencing experiments. BioMed Central 2010-08-10 /pmc/articles/PMC2928208/ /pubmed/20698981 http://dx.doi.org/10.1186/1471-2105-11-422 Text en Copyright ©2010 Hardcastle and Kelly; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hardcastle, Thomas J
Kelly, Krystyna A
baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data
title baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data
title_full baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data
title_fullStr baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data
title_full_unstemmed baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data
title_short baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data
title_sort bayseq: empirical bayesian methods for identifying differential expression in sequence count data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2928208/
https://www.ncbi.nlm.nih.gov/pubmed/20698981
http://dx.doi.org/10.1186/1471-2105-11-422
work_keys_str_mv AT hardcastlethomasj bayseqempiricalbayesianmethodsforidentifyingdifferentialexpressioninsequencecountdata
AT kellykrystynaa bayseqempiricalbayesianmethodsforidentifyingdifferentialexpressioninsequencecountdata