Cargando…

Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA

Summary: Here, we present a method for estimating the frequencies of SNP alleles present within pooled samples of DNA using high-throughput short-read sequencing. The method was tested on real data from six strains of the highly monomorphic pathogen Salmonella Paratyphi A, sequenced individually and...

Descripción completa

Detalles Bibliográficos
Autores principales: Holt, Kathryn E., Teo, Yik Y., Li, Heng, Nair, Satheesh, Dougan, Gordon, Wain, John, Parkhill, Julian
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2722999/
https://www.ncbi.nlm.nih.gov/pubmed/19497932
http://dx.doi.org/10.1093/bioinformatics/btp344
_version_ 1782170346203381760
author Holt, Kathryn E.
Teo, Yik Y.
Li, Heng
Nair, Satheesh
Dougan, Gordon
Wain, John
Parkhill, Julian
author_facet Holt, Kathryn E.
Teo, Yik Y.
Li, Heng
Nair, Satheesh
Dougan, Gordon
Wain, John
Parkhill, Julian
author_sort Holt, Kathryn E.
collection PubMed
description Summary: Here, we present a method for estimating the frequencies of SNP alleles present within pooled samples of DNA using high-throughput short-read sequencing. The method was tested on real data from six strains of the highly monomorphic pathogen Salmonella Paratyphi A, sequenced individually and in a pool. A variety of read mapping and quality-weighting procedures were tested to determine the optimal parameters, which afforded ≥80% sensitivity of SNP detection and strong correlation with true SNP frequency at poolwide read depth of 40×, declining only slightly at read depths 20–40×. Availability: The method was implemented in Perl and relies on the opensource software Maq for read mapping and SNP calling. The Perl script is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/pools/. Contact: kh2@sanger.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2722999
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27229992009-08-07 Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA Holt, Kathryn E. Teo, Yik Y. Li, Heng Nair, Satheesh Dougan, Gordon Wain, John Parkhill, Julian Bioinformatics Applications Note Summary: Here, we present a method for estimating the frequencies of SNP alleles present within pooled samples of DNA using high-throughput short-read sequencing. The method was tested on real data from six strains of the highly monomorphic pathogen Salmonella Paratyphi A, sequenced individually and in a pool. A variety of read mapping and quality-weighting procedures were tested to determine the optimal parameters, which afforded ≥80% sensitivity of SNP detection and strong correlation with true SNP frequency at poolwide read depth of 40×, declining only slightly at read depths 20–40×. Availability: The method was implemented in Perl and relies on the opensource software Maq for read mapping and SNP calling. The Perl script is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/pools/. Contact: kh2@sanger.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2009-08-15 2009-06-03 /pmc/articles/PMC2722999/ /pubmed/19497932 http://dx.doi.org/10.1093/bioinformatics/btp344 Text en http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Holt, Kathryn E.
Teo, Yik Y.
Li, Heng
Nair, Satheesh
Dougan, Gordon
Wain, John
Parkhill, Julian
Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA
title Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA
title_full Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA
title_fullStr Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA
title_full_unstemmed Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA
title_short Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA
title_sort detecting snps and estimating allele frequencies in clonal bacterial populations by sequencing pooled dna
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2722999/
https://www.ncbi.nlm.nih.gov/pubmed/19497932
http://dx.doi.org/10.1093/bioinformatics/btp344
work_keys_str_mv AT holtkathryne detectingsnpsandestimatingallelefrequenciesinclonalbacterialpopulationsbysequencingpooleddna
AT teoyiky detectingsnpsandestimatingallelefrequenciesinclonalbacterialpopulationsbysequencingpooleddna
AT liheng detectingsnpsandestimatingallelefrequenciesinclonalbacterialpopulationsbysequencingpooleddna
AT nairsatheesh detectingsnpsandestimatingallelefrequenciesinclonalbacterialpopulationsbysequencingpooleddna
AT dougangordon detectingsnpsandestimatingallelefrequenciesinclonalbacterialpopulationsbysequencingpooleddna
AT wainjohn detectingsnpsandestimatingallelefrequenciesinclonalbacterialpopulationsbysequencingpooleddna
AT parkhilljulian detectingsnpsandestimatingallelefrequenciesinclonalbacterialpopulationsbysequencingpooleddna