Cargando…

SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data

We present a statistical framework for estimation and application of sample allele frequency spectra from New-Generation Sequencing (NGS) data. In this method, we first estimate the allele frequency spectrum using maximum likelihood. In contrast to previous methods, the likelihood function is calcul...

Descripción completa

Detalles Bibliográficos
Autores principales: Nielsen, Rasmus, Korneliussen, Thorfinn, Albrechtsen, Anders, Li, Yingrui, Wang, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3404070/
https://www.ncbi.nlm.nih.gov/pubmed/22911679
http://dx.doi.org/10.1371/journal.pone.0037558
_version_ 1782238982677987328
author Nielsen, Rasmus
Korneliussen, Thorfinn
Albrechtsen, Anders
Li, Yingrui
Wang, Jun
author_facet Nielsen, Rasmus
Korneliussen, Thorfinn
Albrechtsen, Anders
Li, Yingrui
Wang, Jun
author_sort Nielsen, Rasmus
collection PubMed
description We present a statistical framework for estimation and application of sample allele frequency spectra from New-Generation Sequencing (NGS) data. In this method, we first estimate the allele frequency spectrum using maximum likelihood. In contrast to previous methods, the likelihood function is calculated using a dynamic programming algorithm and numerically optimized using analytical derivatives. We then use a Bayesian method for estimating the sample allele frequency in a single site, and show how the method can be used for genotype calling and SNP calling. We also show how the method can be extended to various other cases including cases with deviations from Hardy-Weinberg equilibrium. We evaluate the statistical properties of the methods using simulations and by application to a real data set.
format Online
Article
Text
id pubmed-3404070
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34040702012-07-30 SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data Nielsen, Rasmus Korneliussen, Thorfinn Albrechtsen, Anders Li, Yingrui Wang, Jun PLoS One Research Article We present a statistical framework for estimation and application of sample allele frequency spectra from New-Generation Sequencing (NGS) data. In this method, we first estimate the allele frequency spectrum using maximum likelihood. In contrast to previous methods, the likelihood function is calculated using a dynamic programming algorithm and numerically optimized using analytical derivatives. We then use a Bayesian method for estimating the sample allele frequency in a single site, and show how the method can be used for genotype calling and SNP calling. We also show how the method can be extended to various other cases including cases with deviations from Hardy-Weinberg equilibrium. We evaluate the statistical properties of the methods using simulations and by application to a real data set. Public Library of Science 2012-07-24 /pmc/articles/PMC3404070/ /pubmed/22911679 http://dx.doi.org/10.1371/journal.pone.0037558 Text en © 2012 Nielsen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Nielsen, Rasmus
Korneliussen, Thorfinn
Albrechtsen, Anders
Li, Yingrui
Wang, Jun
SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data
title SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data
title_full SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data
title_fullStr SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data
title_full_unstemmed SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data
title_short SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data
title_sort snp calling, genotype calling, and sample allele frequency estimation from new-generation sequencing data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3404070/
https://www.ncbi.nlm.nih.gov/pubmed/22911679
http://dx.doi.org/10.1371/journal.pone.0037558
work_keys_str_mv AT nielsenrasmus snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata
AT korneliussenthorfinn snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata
AT albrechtsenanders snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata
AT liyingrui snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata
AT wangjun snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata