Cargando…

Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus

Knowledge of the genomic variation among different strains of a pathogenic microbial species can help in selecting optimal candidates for diagnostic assays and vaccine development. Pooled sequencing (Pool-seq) is a cost effective approach for population level genetic studies that require large numbe...

Descripción completa

Detalles Bibliográficos
Autores principales: Weldatsadik, Rigbe G., Wang, Jingwen, Puhakainen, Kai, Jiao, Hong, Jalava, Jari, Räisänen, Kati, Datta, Neeta, Skoog, Tiina, Vuopio, Jaana, Jokiranta, T. Sakari, Kere, Juha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374712/
https://www.ncbi.nlm.nih.gov/pubmed/28361960
http://dx.doi.org/10.1038/srep45771
_version_ 1782518951181287424
author Weldatsadik, Rigbe G.
Wang, Jingwen
Puhakainen, Kai
Jiao, Hong
Jalava, Jari
Räisänen, Kati
Datta, Neeta
Skoog, Tiina
Vuopio, Jaana
Jokiranta, T. Sakari
Kere, Juha
author_facet Weldatsadik, Rigbe G.
Wang, Jingwen
Puhakainen, Kai
Jiao, Hong
Jalava, Jari
Räisänen, Kati
Datta, Neeta
Skoog, Tiina
Vuopio, Jaana
Jokiranta, T. Sakari
Kere, Juha
author_sort Weldatsadik, Rigbe G.
collection PubMed
description Knowledge of the genomic variation among different strains of a pathogenic microbial species can help in selecting optimal candidates for diagnostic assays and vaccine development. Pooled sequencing (Pool-seq) is a cost effective approach for population level genetic studies that require large numbers of samples such as various strains of a microbe. To test the use of Pool-seq in identifying variation, we pooled DNA of 100 Streptococcus pyogenes strains of different emm types in two pools, each containing 50 strains. We used four variant calling tools (Freebayes, UnifiedGenotyper, SNVer, and SAMtools) and one emm1 strain, SF370, as a reference genome. In total 63719 SNPs and 164 INDELs were identified in the two pools concordantly by at least two of the tools. Majority of the variants (93.4%) from six individually sequenced strains used in the pools could be identified from the two pools and 72.3% and 97.4% of the variants in the pools could be mined from the analysis of the 44 complete Str. pyogenes genomes and 3407 sequence runs deposited in the European Nucleotide Archive respectively. We conclude that DNA sequencing of pooled samples of large numbers of bacterial strains is a robust, rapid and cost-efficient way to discover sequence variation.
format Online
Article
Text
id pubmed-5374712
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53747122017-04-03 Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus Weldatsadik, Rigbe G. Wang, Jingwen Puhakainen, Kai Jiao, Hong Jalava, Jari Räisänen, Kati Datta, Neeta Skoog, Tiina Vuopio, Jaana Jokiranta, T. Sakari Kere, Juha Sci Rep Article Knowledge of the genomic variation among different strains of a pathogenic microbial species can help in selecting optimal candidates for diagnostic assays and vaccine development. Pooled sequencing (Pool-seq) is a cost effective approach for population level genetic studies that require large numbers of samples such as various strains of a microbe. To test the use of Pool-seq in identifying variation, we pooled DNA of 100 Streptococcus pyogenes strains of different emm types in two pools, each containing 50 strains. We used four variant calling tools (Freebayes, UnifiedGenotyper, SNVer, and SAMtools) and one emm1 strain, SF370, as a reference genome. In total 63719 SNPs and 164 INDELs were identified in the two pools concordantly by at least two of the tools. Majority of the variants (93.4%) from six individually sequenced strains used in the pools could be identified from the two pools and 72.3% and 97.4% of the variants in the pools could be mined from the analysis of the 44 complete Str. pyogenes genomes and 3407 sequence runs deposited in the European Nucleotide Archive respectively. We conclude that DNA sequencing of pooled samples of large numbers of bacterial strains is a robust, rapid and cost-efficient way to discover sequence variation. Nature Publishing Group 2017-03-31 /pmc/articles/PMC5374712/ /pubmed/28361960 http://dx.doi.org/10.1038/srep45771 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Weldatsadik, Rigbe G.
Wang, Jingwen
Puhakainen, Kai
Jiao, Hong
Jalava, Jari
Räisänen, Kati
Datta, Neeta
Skoog, Tiina
Vuopio, Jaana
Jokiranta, T. Sakari
Kere, Juha
Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus
title Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus
title_full Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus
title_fullStr Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus
title_full_unstemmed Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus
title_short Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus
title_sort sequence analysis of pooled bacterial samples enables identification of strain variation in group a streptococcus
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374712/
https://www.ncbi.nlm.nih.gov/pubmed/28361960
http://dx.doi.org/10.1038/srep45771
work_keys_str_mv AT weldatsadikrigbeg sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT wangjingwen sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT puhakainenkai sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT jiaohong sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT jalavajari sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT raisanenkati sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT dattaneeta sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT skoogtiina sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT vuopiojaana sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT jokirantatsakari sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus
AT kerejuha sequenceanalysisofpooledbacterialsamplesenablesidentificationofstrainvariationingroupastreptococcus