Cargando…

Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution

BACKGROUND: Pairing of samples arises naturally in many genomic experiments; for example, gene expression in tumour and normal tissue from the same patients. Methods for analysing high-throughput sequencing data from such experiments are required to identify differential expression, both within pair...

Descripción completa

Detalles Bibliográficos
Autores principales: Hardcastle, Thomas J, Kelly, Krystyna A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3658937/
https://www.ncbi.nlm.nih.gov/pubmed/23617841
http://dx.doi.org/10.1186/1471-2105-14-135
_version_ 1782270366261968896
author Hardcastle, Thomas J
Kelly, Krystyna A
author_facet Hardcastle, Thomas J
Kelly, Krystyna A
author_sort Hardcastle, Thomas J
collection PubMed
description BACKGROUND: Pairing of samples arises naturally in many genomic experiments; for example, gene expression in tumour and normal tissue from the same patients. Methods for analysing high-throughput sequencing data from such experiments are required to identify differential expression, both within paired samples and between pairs under different experimental conditions. RESULTS: We develop an empirical Bayesian method based on the beta-binomial distribution to model paired data from high-throughput sequencing experiments. We examine the performance of this method on simulated and real data in a variety of scenarios. Our methods are implemented as part of the RbaySeq package (versions 1.11.6 and greater) available from Bioconductor (http://www.bioconductor.org). CONCLUSIONS: We compare our approach to alternatives based on generalised linear modelling approaches and show that our method offers significant gains in performance on simulated data. In testing on real data from oral squamous cell carcinoma patients, we discover greater enrichment of previously identified head and neck squamous cell carcinoma associated gene sets than has previously been achieved through a generalised linear modelling approach, suggesting that similar gains in performance may be found in real data. Our methods thus show real and substantial improvements in analyses of high-throughput sequencing data from paired samples.
format Online
Article
Text
id pubmed-3658937
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36589372013-05-23 Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution Hardcastle, Thomas J Kelly, Krystyna A BMC Bioinformatics Research Article BACKGROUND: Pairing of samples arises naturally in many genomic experiments; for example, gene expression in tumour and normal tissue from the same patients. Methods for analysing high-throughput sequencing data from such experiments are required to identify differential expression, both within paired samples and between pairs under different experimental conditions. RESULTS: We develop an empirical Bayesian method based on the beta-binomial distribution to model paired data from high-throughput sequencing experiments. We examine the performance of this method on simulated and real data in a variety of scenarios. Our methods are implemented as part of the RbaySeq package (versions 1.11.6 and greater) available from Bioconductor (http://www.bioconductor.org). CONCLUSIONS: We compare our approach to alternatives based on generalised linear modelling approaches and show that our method offers significant gains in performance on simulated data. In testing on real data from oral squamous cell carcinoma patients, we discover greater enrichment of previously identified head and neck squamous cell carcinoma associated gene sets than has previously been achieved through a generalised linear modelling approach, suggesting that similar gains in performance may be found in real data. Our methods thus show real and substantial improvements in analyses of high-throughput sequencing data from paired samples. BioMed Central 2013-04-23 /pmc/articles/PMC3658937/ /pubmed/23617841 http://dx.doi.org/10.1186/1471-2105-14-135 Text en Copyright © 2013 Hardcastle and Kelly; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hardcastle, Thomas J
Kelly, Krystyna A
Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution
title Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution
title_full Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution
title_fullStr Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution
title_full_unstemmed Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution
title_short Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution
title_sort empirical bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3658937/
https://www.ncbi.nlm.nih.gov/pubmed/23617841
http://dx.doi.org/10.1186/1471-2105-14-135
work_keys_str_mv AT hardcastlethomasj empiricalbayesiananalysisofpairedhighthroughputsequencingdatawithabetabinomialdistribution
AT kellykrystynaa empiricalbayesiananalysisofpairedhighthroughputsequencingdatawithabetabinomialdistribution