Cargando…

Extra-binomial variation approach for analysis of pooled DNA sequencing data

Motivation: The invention of next-generation sequencing technology has made it possible to study the rare variants that are more likely to pinpoint causal disease genes. To make such experiments financially viable, DNA samples from several subjects are often pooled before sequencing. This induces la...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Xin, Todd, John A., Clayton, David, Wallace, Chris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496343/
https://www.ncbi.nlm.nih.gov/pubmed/22976083
http://dx.doi.org/10.1093/bioinformatics/bts553
_version_ 1782249619940442112
author Yang, Xin
Todd, John A.
Clayton, David
Wallace, Chris
author_facet Yang, Xin
Todd, John A.
Clayton, David
Wallace, Chris
author_sort Yang, Xin
collection PubMed
description Motivation: The invention of next-generation sequencing technology has made it possible to study the rare variants that are more likely to pinpoint causal disease genes. To make such experiments financially viable, DNA samples from several subjects are often pooled before sequencing. This induces large between-pool variation which, together with other sources of experimental error, creates over-dispersed data. Statistical analysis of pooled sequencing data needs to appropriately model this additional variance to avoid inflating the false-positive rate. Results: We propose a new statistical method based on an extra-binomial model to address the over-dispersion and apply it to pooled case-control data. We demonstrate that our model provides a better fit to the data than either a standard binomial model or a traditional extra-binomial model proposed by Williams and can analyse both rare and common variants with lower or more variable pool depths compared to the other methods. Availability: Package ‘extraBinomial’ is on http://cran.r-project.org/ Contact: chris.wallace@cimr.cam.ac.uk Supplementary information: Supplementary data are available at Bioinformatics Online.
format Online
Article
Text
id pubmed-3496343
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34963432012-12-12 Extra-binomial variation approach for analysis of pooled DNA sequencing data Yang, Xin Todd, John A. Clayton, David Wallace, Chris Bioinformatics Original Papers Motivation: The invention of next-generation sequencing technology has made it possible to study the rare variants that are more likely to pinpoint causal disease genes. To make such experiments financially viable, DNA samples from several subjects are often pooled before sequencing. This induces large between-pool variation which, together with other sources of experimental error, creates over-dispersed data. Statistical analysis of pooled sequencing data needs to appropriately model this additional variance to avoid inflating the false-positive rate. Results: We propose a new statistical method based on an extra-binomial model to address the over-dispersion and apply it to pooled case-control data. We demonstrate that our model provides a better fit to the data than either a standard binomial model or a traditional extra-binomial model proposed by Williams and can analyse both rare and common variants with lower or more variable pool depths compared to the other methods. Availability: Package ‘extraBinomial’ is on http://cran.r-project.org/ Contact: chris.wallace@cimr.cam.ac.uk Supplementary information: Supplementary data are available at Bioinformatics Online. Oxford University Press 2012-11-15 2012-09-12 /pmc/articles/PMC3496343/ /pubmed/22976083 http://dx.doi.org/10.1093/bioinformatics/bts553 Text en © The Author 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Yang, Xin
Todd, John A.
Clayton, David
Wallace, Chris
Extra-binomial variation approach for analysis of pooled DNA sequencing data
title Extra-binomial variation approach for analysis of pooled DNA sequencing data
title_full Extra-binomial variation approach for analysis of pooled DNA sequencing data
title_fullStr Extra-binomial variation approach for analysis of pooled DNA sequencing data
title_full_unstemmed Extra-binomial variation approach for analysis of pooled DNA sequencing data
title_short Extra-binomial variation approach for analysis of pooled DNA sequencing data
title_sort extra-binomial variation approach for analysis of pooled dna sequencing data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496343/
https://www.ncbi.nlm.nih.gov/pubmed/22976083
http://dx.doi.org/10.1093/bioinformatics/bts553
work_keys_str_mv AT yangxin extrabinomialvariationapproachforanalysisofpooleddnasequencingdata
AT toddjohna extrabinomialvariationapproachforanalysisofpooleddnasequencingdata
AT claytondavid extrabinomialvariationapproachforanalysisofpooleddnasequencingdata
AT wallacechris extrabinomialvariationapproachforanalysisofpooleddnasequencingdata