Cargando…
Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus
Knowledge of the genomic variation among different strains of a pathogenic microbial species can help in selecting optimal candidates for diagnostic assays and vaccine development. Pooled sequencing (Pool-seq) is a cost effective approach for population level genetic studies that require large numbe...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374712/ https://www.ncbi.nlm.nih.gov/pubmed/28361960 http://dx.doi.org/10.1038/srep45771 |
_version_ | 1782518951181287424 |
---|---|
author | Weldatsadik, Rigbe G. Wang, Jingwen Puhakainen, Kai Jiao, Hong Jalava, Jari Räisänen, Kati Datta, Neeta Skoog, Tiina Vuopio, Jaana Jokiranta, T. Sakari Kere, Juha |
author_facet | Weldatsadik, Rigbe G. Wang, Jingwen Puhakainen, Kai Jiao, Hong Jalava, Jari Räisänen, Kati Datta, Neeta Skoog, Tiina Vuopio, Jaana Jokiranta, T. Sakari Kere, Juha |
author_sort | Weldatsadik, Rigbe G. |
collection | PubMed |
description | Knowledge of the genomic variation among different strains of a pathogenic microbial species can help in selecting optimal candidates for diagnostic assays and vaccine development. Pooled sequencing (Pool-seq) is a cost effective approach for population level genetic studies that require large numbers of samples such as various strains of a microbe. To test the use of Pool-seq in identifying variation, we pooled DNA of 100 Streptococcus pyogenes strains of different emm types in two pools, each containing 50 strains. We used four variant calling tools (Freebayes, UnifiedGenotyper, SNVer, and SAMtools) and one emm1 strain, SF370, as a reference genome. In total 63719 SNPs and 164 INDELs were identified in the two pools concordantly by at least two of the tools. Majority of the variants (93.4%) from six individually sequenced strains used in the pools could be identified from the two pools and 72.3% and 97.4% of the variants in the pools could be mined from the analysis of the 44 complete Str. pyogenes genomes and 3407 sequence runs deposited in the European Nucleotide Archive respectively. We conclude that DNA sequencing of pooled samples of large numbers of bacterial strains is a robust, rapid and cost-efficient way to discover sequence variation. |
format | Online Article Text |
id | pubmed-5374712 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53747122017-04-03 Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus Weldatsadik, Rigbe G. Wang, Jingwen Puhakainen, Kai Jiao, Hong Jalava, Jari Räisänen, Kati Datta, Neeta Skoog, Tiina Vuopio, Jaana Jokiranta, T. Sakari Kere, Juha Sci Rep Article Knowledge of the genomic variation among different strains of a pathogenic microbial species can help in selecting optimal candidates for diagnostic assays and vaccine development. Pooled sequencing (Pool-seq) is a cost effective approach for population level genetic studies that require large numbers of samples such as various strains of a microbe. To test the use of Pool-seq in identifying variation, we pooled DNA of 100 Streptococcus pyogenes strains of different emm types in two pools, each containing 50 strains. We used four variant calling tools (Freebayes, UnifiedGenotyper, SNVer, and SAMtools) and one emm1 strain, SF370, as a reference genome. In total 63719 SNPs and 164 INDELs were identified in the two pools concordantly by at least two of the tools. Majority of the variants (93.4%) from six individually sequenced strains used in the pools could be identified from the two pools and 72.3% and 97.4% of the variants in the pools could be mined from the analysis of the 44 complete Str. pyogenes genomes and 3407 sequence runs deposited in the European Nucleotide Archive respectively. We conclude that DNA sequencing of pooled samples of large numbers of bacterial strains is a robust, rapid and cost-efficient way to discover sequence variation. Nature Publishing Group 2017-03-31 /pmc/articles/PMC5374712/ /pubmed/28361960 http://dx.doi.org/10.1038/srep45771 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Weldatsadik, Rigbe G. Wang, Jingwen Puhakainen, Kai Jiao, Hong Jalava, Jari Räisänen, Kati Datta, Neeta Skoog, Tiina Vuopio, Jaana Jokiranta, T. Sakari Kere, Juha Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus |
title | Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus |
title_full | Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus |
title_fullStr | Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus |
title_full_unstemmed | Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus |
title_short | Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus |
title_sort | sequence analysis of pooled bacterial samples enables identification of strain variation in group a streptococcus |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374712/ https://www.ncbi.nlm.nih.gov/pubmed/28361960 http://dx.doi.org/10.1038/srep45771 |
work_keys_str_mv | AT weldatsadikrigbeg sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT wangjingwen sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT puhakainenkai sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT jiaohong sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT jalavajari sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT raisanenkati sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT dattaneeta sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT skoogtiina sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT vuopiojaana sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT jokirantatsakari sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus AT kerejuha sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus |