Cargando…
SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data
We present a statistical framework for estimation and application of sample allele frequency spectra from New-Generation Sequencing (NGS) data. In this method, we first estimate the allele frequency spectrum using maximum likelihood. In contrast to previous methods, the likelihood function is calcul...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3404070/ https://www.ncbi.nlm.nih.gov/pubmed/22911679 http://dx.doi.org/10.1371/journal.pone.0037558 |
_version_ | 1782238982677987328 |
---|---|
author | Nielsen, Rasmus Korneliussen, Thorfinn Albrechtsen, Anders Li, Yingrui Wang, Jun |
author_facet | Nielsen, Rasmus Korneliussen, Thorfinn Albrechtsen, Anders Li, Yingrui Wang, Jun |
author_sort | Nielsen, Rasmus |
collection | PubMed |
description | We present a statistical framework for estimation and application of sample allele frequency spectra from New-Generation Sequencing (NGS) data. In this method, we first estimate the allele frequency spectrum using maximum likelihood. In contrast to previous methods, the likelihood function is calculated using a dynamic programming algorithm and numerically optimized using analytical derivatives. We then use a Bayesian method for estimating the sample allele frequency in a single site, and show how the method can be used for genotype calling and SNP calling. We also show how the method can be extended to various other cases including cases with deviations from Hardy-Weinberg equilibrium. We evaluate the statistical properties of the methods using simulations and by application to a real data set. |
format | Online Article Text |
id | pubmed-3404070 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-34040702012-07-30 SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data Nielsen, Rasmus Korneliussen, Thorfinn Albrechtsen, Anders Li, Yingrui Wang, Jun PLoS One Research Article We present a statistical framework for estimation and application of sample allele frequency spectra from New-Generation Sequencing (NGS) data. In this method, we first estimate the allele frequency spectrum using maximum likelihood. In contrast to previous methods, the likelihood function is calculated using a dynamic programming algorithm and numerically optimized using analytical derivatives. We then use a Bayesian method for estimating the sample allele frequency in a single site, and show how the method can be used for genotype calling and SNP calling. We also show how the method can be extended to various other cases including cases with deviations from Hardy-Weinberg equilibrium. We evaluate the statistical properties of the methods using simulations and by application to a real data set. Public Library of Science 2012-07-24 /pmc/articles/PMC3404070/ /pubmed/22911679 http://dx.doi.org/10.1371/journal.pone.0037558 Text en © 2012 Nielsen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Nielsen, Rasmus Korneliussen, Thorfinn Albrechtsen, Anders Li, Yingrui Wang, Jun SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data |
title | SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data |
title_full | SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data |
title_fullStr | SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data |
title_full_unstemmed | SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data |
title_short | SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data |
title_sort | snp calling, genotype calling, and sample allele frequency estimation from new-generation sequencing data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3404070/ https://www.ncbi.nlm.nih.gov/pubmed/22911679 http://dx.doi.org/10.1371/journal.pone.0037558 |
work_keys_str_mv | AT nielsenrasmus snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata AT korneliussenthorfinn snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata AT albrechtsenanders snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata AT liyingrui snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata AT wangjun snpcallinggenotypecallingandsampleallelefrequencyestimationfromnewgenerationsequencingdata |