Cargando…
Extra-binomial variation approach for analysis of pooled DNA sequencing data
Motivation: The invention of next-generation sequencing technology has made it possible to study the rare variants that are more likely to pinpoint causal disease genes. To make such experiments financially viable, DNA samples from several subjects are often pooled before sequencing. This induces la...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496343/ https://www.ncbi.nlm.nih.gov/pubmed/22976083 http://dx.doi.org/10.1093/bioinformatics/bts553 |
_version_ | 1782249619940442112 |
---|---|
author | Yang, Xin Todd, John A. Clayton, David Wallace, Chris |
author_facet | Yang, Xin Todd, John A. Clayton, David Wallace, Chris |
author_sort | Yang, Xin |
collection | PubMed |
description | Motivation: The invention of next-generation sequencing technology has made it possible to study the rare variants that are more likely to pinpoint causal disease genes. To make such experiments financially viable, DNA samples from several subjects are often pooled before sequencing. This induces large between-pool variation which, together with other sources of experimental error, creates over-dispersed data. Statistical analysis of pooled sequencing data needs to appropriately model this additional variance to avoid inflating the false-positive rate. Results: We propose a new statistical method based on an extra-binomial model to address the over-dispersion and apply it to pooled case-control data. We demonstrate that our model provides a better fit to the data than either a standard binomial model or a traditional extra-binomial model proposed by Williams and can analyse both rare and common variants with lower or more variable pool depths compared to the other methods. Availability: Package ‘extraBinomial’ is on http://cran.r-project.org/ Contact: chris.wallace@cimr.cam.ac.uk Supplementary information: Supplementary data are available at Bioinformatics Online. |
format | Online Article Text |
id | pubmed-3496343 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34963432012-12-12 Extra-binomial variation approach for analysis of pooled DNA sequencing data Yang, Xin Todd, John A. Clayton, David Wallace, Chris Bioinformatics Original Papers Motivation: The invention of next-generation sequencing technology has made it possible to study the rare variants that are more likely to pinpoint causal disease genes. To make such experiments financially viable, DNA samples from several subjects are often pooled before sequencing. This induces large between-pool variation which, together with other sources of experimental error, creates over-dispersed data. Statistical analysis of pooled sequencing data needs to appropriately model this additional variance to avoid inflating the false-positive rate. Results: We propose a new statistical method based on an extra-binomial model to address the over-dispersion and apply it to pooled case-control data. We demonstrate that our model provides a better fit to the data than either a standard binomial model or a traditional extra-binomial model proposed by Williams and can analyse both rare and common variants with lower or more variable pool depths compared to the other methods. Availability: Package ‘extraBinomial’ is on http://cran.r-project.org/ Contact: chris.wallace@cimr.cam.ac.uk Supplementary information: Supplementary data are available at Bioinformatics Online. Oxford University Press 2012-11-15 2012-09-12 /pmc/articles/PMC3496343/ /pubmed/22976083 http://dx.doi.org/10.1093/bioinformatics/bts553 Text en © The Author 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Yang, Xin Todd, John A. Clayton, David Wallace, Chris Extra-binomial variation approach for analysis of pooled DNA sequencing data |
title | Extra-binomial variation approach for analysis of pooled DNA sequencing data |
title_full | Extra-binomial variation approach for analysis of pooled DNA sequencing data |
title_fullStr | Extra-binomial variation approach for analysis of pooled DNA sequencing data |
title_full_unstemmed | Extra-binomial variation approach for analysis of pooled DNA sequencing data |
title_short | Extra-binomial variation approach for analysis of pooled DNA sequencing data |
title_sort | extra-binomial variation approach for analysis of pooled dna sequencing data |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496343/ https://www.ncbi.nlm.nih.gov/pubmed/22976083 http://dx.doi.org/10.1093/bioinformatics/bts553 |
work_keys_str_mv | AT yangxin extrabinomialvariationapproachforanalysisofpooleddnasequencingdata AT toddjohna extrabinomialvariationapproachforanalysisofpooleddnasequencingdata AT claytondavid extrabinomialvariationapproachforanalysisofpooleddnasequencingdata AT wallacechris extrabinomialvariationapproachforanalysisofpooleddnasequencingdata |