Cargando…
The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data
BACKGROUND: Next-generation sequencing provides a powerful means of molecular characterization. However, methods such as single-nucleotide polymorphism detection or whole-chromosome sequence analysis are computationally expensive, prone to errors, and are still less accessible than traditional typin...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4618880/ https://www.ncbi.nlm.nih.gov/pubmed/26490433 http://dx.doi.org/10.1186/s12866-015-0526-1 |
_version_ | 1782396991184044032 |
---|---|
author | Pightling, Arthur W. Petronella, Nicholas Pagotto, Franco |
author_facet | Pightling, Arthur W. Petronella, Nicholas Pagotto, Franco |
author_sort | Pightling, Arthur W. |
collection | PubMed |
description | BACKGROUND: Next-generation sequencing provides a powerful means of molecular characterization. However, methods such as single-nucleotide polymorphism detection or whole-chromosome sequence analysis are computationally expensive, prone to errors, and are still less accessible than traditional typing methods. Here, we present the Listeria monocytogenes core-genome sequence typing method for molecular characterization. This method uses a high-confidence core (HCC) genome, calculated to ensure accurate identification of orthologs. We also developed an evolutionarily relevant nomenclature based upon phylogenetic analysis of HCC genomes. Finally, we created a pipeline (LmCGST; https://sourceforge.net/projects/lmcgst/files/) that takes in raw next-generation sequencing reads, calculates a subject HCC profile, compares it to an expandable database, assigns a sequence type, and performs a phylogenetic analysis. RESULTS: We analyzed 29 high-quality, closed Listeria monocytogenes chromosome sequences and identified loci that are reliable targets for automated molecular characterization methods. We identified 1013 open-reading frames that comprise our high-confidence core (HCC) genome. We then populated a database with HCC profiles from 114 taxa. We sequenced 84 randomly selected isolates from the Listeriosis Reference Service for Canada’s collection and analysed them with the LmCGST pipeline. In addition, we generated pulsed-field gel electrophoresis, ribotyping, and in silico multi-locus sequence typing (MLST) data for the 84 isolates and compared the results to those obtained using the CGST method. We found that all of the methods yielded results that are generally congruent. However, due to the increased numbers of categories, the CGST method provides much greater discriminatory power than the other methods tested here. CONCLUSIONS: We show that the CGST method provides increased discriminatory power relative to typing methods such as pulsed-field gel electrophoresis, ribotyping, and multi-locus sequence typing while it addresses several shortcomings of other methods of molecular characterization with next-generation sequence data. It uses discrete, well-defined groupings (types) of organisms that are phylogenetically relevant and easily interpreted. In addition, the CGST scheme can be expanded to include additional loci and HCC profiles in the future. In total, the CGST method provides an approach to the molecular characterization of Listeria monocytogenes with next-generation sequence data that is highly reproducible, easily standardized, portable, and accessible. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12866-015-0526-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4618880 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46188802015-10-25 The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data Pightling, Arthur W. Petronella, Nicholas Pagotto, Franco BMC Microbiol Research Article BACKGROUND: Next-generation sequencing provides a powerful means of molecular characterization. However, methods such as single-nucleotide polymorphism detection or whole-chromosome sequence analysis are computationally expensive, prone to errors, and are still less accessible than traditional typing methods. Here, we present the Listeria monocytogenes core-genome sequence typing method for molecular characterization. This method uses a high-confidence core (HCC) genome, calculated to ensure accurate identification of orthologs. We also developed an evolutionarily relevant nomenclature based upon phylogenetic analysis of HCC genomes. Finally, we created a pipeline (LmCGST; https://sourceforge.net/projects/lmcgst/files/) that takes in raw next-generation sequencing reads, calculates a subject HCC profile, compares it to an expandable database, assigns a sequence type, and performs a phylogenetic analysis. RESULTS: We analyzed 29 high-quality, closed Listeria monocytogenes chromosome sequences and identified loci that are reliable targets for automated molecular characterization methods. We identified 1013 open-reading frames that comprise our high-confidence core (HCC) genome. We then populated a database with HCC profiles from 114 taxa. We sequenced 84 randomly selected isolates from the Listeriosis Reference Service for Canada’s collection and analysed them with the LmCGST pipeline. In addition, we generated pulsed-field gel electrophoresis, ribotyping, and in silico multi-locus sequence typing (MLST) data for the 84 isolates and compared the results to those obtained using the CGST method. We found that all of the methods yielded results that are generally congruent. However, due to the increased numbers of categories, the CGST method provides much greater discriminatory power than the other methods tested here. CONCLUSIONS: We show that the CGST method provides increased discriminatory power relative to typing methods such as pulsed-field gel electrophoresis, ribotyping, and multi-locus sequence typing while it addresses several shortcomings of other methods of molecular characterization with next-generation sequence data. It uses discrete, well-defined groupings (types) of organisms that are phylogenetically relevant and easily interpreted. In addition, the CGST scheme can be expanded to include additional loci and HCC profiles in the future. In total, the CGST method provides an approach to the molecular characterization of Listeria monocytogenes with next-generation sequence data that is highly reproducible, easily standardized, portable, and accessible. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12866-015-0526-1) contains supplementary material, which is available to authorized users. BioMed Central 2015-10-22 /pmc/articles/PMC4618880/ /pubmed/26490433 http://dx.doi.org/10.1186/s12866-015-0526-1 Text en © Pightling et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Pightling, Arthur W. Petronella, Nicholas Pagotto, Franco The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data |
title | The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data |
title_full | The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data |
title_fullStr | The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data |
title_full_unstemmed | The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data |
title_short | The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data |
title_sort | listeria monocytogenes core-genome sequence typer (lmcgst): a bioinformatic pipeline for molecular characterization with next-generation sequence data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4618880/ https://www.ncbi.nlm.nih.gov/pubmed/26490433 http://dx.doi.org/10.1186/s12866-015-0526-1 |
work_keys_str_mv | AT pightlingarthurw thelisteriamonocytogenescoregenomesequencetyperlmcgstabioinformaticpipelineformolecularcharacterizationwithnextgenerationsequencedata AT petronellanicholas thelisteriamonocytogenescoregenomesequencetyperlmcgstabioinformaticpipelineformolecularcharacterizationwithnextgenerationsequencedata AT pagottofranco thelisteriamonocytogenescoregenomesequencetyperlmcgstabioinformaticpipelineformolecularcharacterizationwithnextgenerationsequencedata AT pightlingarthurw listeriamonocytogenescoregenomesequencetyperlmcgstabioinformaticpipelineformolecularcharacterizationwithnextgenerationsequencedata AT petronellanicholas listeriamonocytogenescoregenomesequencetyperlmcgstabioinformaticpipelineformolecularcharacterizationwithnextgenerationsequencedata AT pagottofranco listeriamonocytogenescoregenomesequencetyperlmcgstabioinformaticpipelineformolecularcharacterizationwithnextgenerationsequencedata |