Cargando…

Gen2Epi: an automated whole-genome sequencing pipeline for linking full genomes to antimicrobial susceptibility and molecular epidemiological data in Neisseria gonorrhoeae

BACKGROUND: Recent adva1nces in whole genome sequencing (WGS) based technologies have facilitated multi-step applications for predicting antimicrobial resistance (AMR) and investigating the molecular epidemiology of Neisseria gonorrhoeae. However, generating full scaffolds of N. gonorrhoeae genomes...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Reema, Dillon, Jo-Anne R., Demczuk, Walter, Kusalik, Anthony
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6398234/
https://www.ncbi.nlm.nih.gov/pubmed/30832565
http://dx.doi.org/10.1186/s12864-019-5542-3
_version_ 1783399545555648512
author Singh, Reema
Dillon, Jo-Anne R.
Demczuk, Walter
Kusalik, Anthony
author_facet Singh, Reema
Dillon, Jo-Anne R.
Demczuk, Walter
Kusalik, Anthony
author_sort Singh, Reema
collection PubMed
description BACKGROUND: Recent adva1nces in whole genome sequencing (WGS) based technologies have facilitated multi-step applications for predicting antimicrobial resistance (AMR) and investigating the molecular epidemiology of Neisseria gonorrhoeae. However, generating full scaffolds of N. gonorrhoeae genomes from short reads, and the assignment of molecular epidemiological information (NG-MLST, NG-MAST, and NG-STAR) to multiple assembled samples, is challenging due to required manual tasks such as annotating antimicrobial resistance determinants with standard nomenclature for a large number of genomes. RESULTS: We present Gen2Epi, a pipeline that assembles short reads into full scaffolds and automatically assigns molecular epidemiological and AMR information to the assembled genomes. Gen2Epi is a command-line tool integrating third-party software and tailored specifically for N. gonorrhoeae. For its evaluation, the Gen2Epi pipeline successfully assembled the WGS short reads from 1484 N. gonorrhoeae samples into full-length genomes for both chromosomes and plasmids and was able to assign in silico molecular determinant information to each dataset automatically. The assemblies were generated using raw as well as trimmed short reads. The median genome coverage of full-length scaffolds and “N” statistics (N50, NG50, and NGA50) were higher than, or comparable to, previously published results and the scaffolding process improved the quality of the draft genome assemblies. Molecular antimicrobial resistant (AMR) determinants identified by Gen2Epi reproduced information for the 1484 samples as previously reported, including NG-MLST, NG-MAST, and NG-STAR molecular sequence types. CONCLUSIONS: Gen2Epi can be used to assemble short reads into full-length genomes and assign accurate molecular marker and AMR information automatically from NG-STAR, NG-MAST, and NG-MLST. Gen2Epi is publicly available under “CC BY-NC 2.0 CA” Creative Commons licensing as a VirtualBox image containing the constituent software components running on the LINUX operating system (CentOS 7). The image and associated documentation are available via anonymous FTP at ftp://www.cs.usask.ca/pub/combi or ftp://ftp.cs.usask.ca/pub/combi ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5542-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6398234
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63982342019-03-13 Gen2Epi: an automated whole-genome sequencing pipeline for linking full genomes to antimicrobial susceptibility and molecular epidemiological data in Neisseria gonorrhoeae Singh, Reema Dillon, Jo-Anne R. Demczuk, Walter Kusalik, Anthony BMC Genomics Software Article BACKGROUND: Recent adva1nces in whole genome sequencing (WGS) based technologies have facilitated multi-step applications for predicting antimicrobial resistance (AMR) and investigating the molecular epidemiology of Neisseria gonorrhoeae. However, generating full scaffolds of N. gonorrhoeae genomes from short reads, and the assignment of molecular epidemiological information (NG-MLST, NG-MAST, and NG-STAR) to multiple assembled samples, is challenging due to required manual tasks such as annotating antimicrobial resistance determinants with standard nomenclature for a large number of genomes. RESULTS: We present Gen2Epi, a pipeline that assembles short reads into full scaffolds and automatically assigns molecular epidemiological and AMR information to the assembled genomes. Gen2Epi is a command-line tool integrating third-party software and tailored specifically for N. gonorrhoeae. For its evaluation, the Gen2Epi pipeline successfully assembled the WGS short reads from 1484 N. gonorrhoeae samples into full-length genomes for both chromosomes and plasmids and was able to assign in silico molecular determinant information to each dataset automatically. The assemblies were generated using raw as well as trimmed short reads. The median genome coverage of full-length scaffolds and “N” statistics (N50, NG50, and NGA50) were higher than, or comparable to, previously published results and the scaffolding process improved the quality of the draft genome assemblies. Molecular antimicrobial resistant (AMR) determinants identified by Gen2Epi reproduced information for the 1484 samples as previously reported, including NG-MLST, NG-MAST, and NG-STAR molecular sequence types. CONCLUSIONS: Gen2Epi can be used to assemble short reads into full-length genomes and assign accurate molecular marker and AMR information automatically from NG-STAR, NG-MAST, and NG-MLST. Gen2Epi is publicly available under “CC BY-NC 2.0 CA” Creative Commons licensing as a VirtualBox image containing the constituent software components running on the LINUX operating system (CentOS 7). The image and associated documentation are available via anonymous FTP at ftp://www.cs.usask.ca/pub/combi or ftp://ftp.cs.usask.ca/pub/combi ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5542-3) contains supplementary material, which is available to authorized users. BioMed Central 2019-03-04 /pmc/articles/PMC6398234/ /pubmed/30832565 http://dx.doi.org/10.1186/s12864-019-5542-3 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software Article
Singh, Reema
Dillon, Jo-Anne R.
Demczuk, Walter
Kusalik, Anthony
Gen2Epi: an automated whole-genome sequencing pipeline for linking full genomes to antimicrobial susceptibility and molecular epidemiological data in Neisseria gonorrhoeae
title Gen2Epi: an automated whole-genome sequencing pipeline for linking full genomes to antimicrobial susceptibility and molecular epidemiological data in Neisseria gonorrhoeae
title_full Gen2Epi: an automated whole-genome sequencing pipeline for linking full genomes to antimicrobial susceptibility and molecular epidemiological data in Neisseria gonorrhoeae
title_fullStr Gen2Epi: an automated whole-genome sequencing pipeline for linking full genomes to antimicrobial susceptibility and molecular epidemiological data in Neisseria gonorrhoeae
title_full_unstemmed Gen2Epi: an automated whole-genome sequencing pipeline for linking full genomes to antimicrobial susceptibility and molecular epidemiological data in Neisseria gonorrhoeae
title_short Gen2Epi: an automated whole-genome sequencing pipeline for linking full genomes to antimicrobial susceptibility and molecular epidemiological data in Neisseria gonorrhoeae
title_sort gen2epi: an automated whole-genome sequencing pipeline for linking full genomes to antimicrobial susceptibility and molecular epidemiological data in neisseria gonorrhoeae
topic Software Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6398234/
https://www.ncbi.nlm.nih.gov/pubmed/30832565
http://dx.doi.org/10.1186/s12864-019-5542-3
work_keys_str_mv AT singhreema gen2epianautomatedwholegenomesequencingpipelineforlinkingfullgenomestoantimicrobialsusceptibilityandmolecularepidemiologicaldatainneisseriagonorrhoeae
AT dillonjoanner gen2epianautomatedwholegenomesequencingpipelineforlinkingfullgenomestoantimicrobialsusceptibilityandmolecularepidemiologicaldatainneisseriagonorrhoeae
AT demczukwalter gen2epianautomatedwholegenomesequencingpipelineforlinkingfullgenomestoantimicrobialsusceptibilityandmolecularepidemiologicaldatainneisseriagonorrhoeae
AT kusalikanthony gen2epianautomatedwholegenomesequencingpipelineforlinkingfullgenomestoantimicrobialsusceptibilityandmolecularepidemiologicaldatainneisseriagonorrhoeae