Cargando…

Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping

Epidemiological surveillance of bacterial pathogens requires real-time data analysis with a fast turnaround, while aiming at generating two main outcomes: (1) species-level identification and (2) variant mapping at different levels of genotypic resolution for population-based tracking and surveillan...

Descripción completa

Detalles Bibliográficos
Autores principales: Pavlovikj, Natasha, Gomes-Neto, Joao Carlos, Deogun, Jitender S., Benson, Andrew K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9147691/
https://www.ncbi.nlm.nih.gov/pubmed/35629339
http://dx.doi.org/10.3390/life12050670
_version_ 1784716870195085312
author Pavlovikj, Natasha
Gomes-Neto, Joao Carlos
Deogun, Jitender S.
Benson, Andrew K.
author_facet Pavlovikj, Natasha
Gomes-Neto, Joao Carlos
Deogun, Jitender S.
Benson, Andrew K.
author_sort Pavlovikj, Natasha
collection PubMed
description Epidemiological surveillance of bacterial pathogens requires real-time data analysis with a fast turnaround, while aiming at generating two main outcomes: (1) species-level identification and (2) variant mapping at different levels of genotypic resolution for population-based tracking and surveillance, in addition to predicting traits such as antimicrobial resistance (AMR). Multi-locus sequence typing (MLST) aids this process by identifying sequence types (ST) based on seven ubiquitous genome-scattered loci. In this paper, we selected one assembly-dependent and one assembly-free method for ST mapping and applied them with the default settings and ST schemes they are distributed with, and systematically assessed their accuracy and scalability across a wide array of phylogenetically divergent Public Health-relevant bacterial pathogens with available MLST databases. Our data show that the optimal k-mer length for stringMLST is species-specific and that genome-intrinsic and -extrinsic features can affect the performance and accuracy of the program. Although suitable parameters could be identified for most organisms, there were instances where this program may not be directly deployable in its current format. Next, we integrated stringMLST into our freely available and scalable hierarchical-based population genomics platform, ProkEvo, and further demonstrated how the implementation facilitates automated, reproducible bacterial population analysis.
format Online
Article
Text
id pubmed-9147691
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91476912022-05-29 Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping Pavlovikj, Natasha Gomes-Neto, Joao Carlos Deogun, Jitender S. Benson, Andrew K. Life (Basel) Article Epidemiological surveillance of bacterial pathogens requires real-time data analysis with a fast turnaround, while aiming at generating two main outcomes: (1) species-level identification and (2) variant mapping at different levels of genotypic resolution for population-based tracking and surveillance, in addition to predicting traits such as antimicrobial resistance (AMR). Multi-locus sequence typing (MLST) aids this process by identifying sequence types (ST) based on seven ubiquitous genome-scattered loci. In this paper, we selected one assembly-dependent and one assembly-free method for ST mapping and applied them with the default settings and ST schemes they are distributed with, and systematically assessed their accuracy and scalability across a wide array of phylogenetically divergent Public Health-relevant bacterial pathogens with available MLST databases. Our data show that the optimal k-mer length for stringMLST is species-specific and that genome-intrinsic and -extrinsic features can affect the performance and accuracy of the program. Although suitable parameters could be identified for most organisms, there were instances where this program may not be directly deployable in its current format. Next, we integrated stringMLST into our freely available and scalable hierarchical-based population genomics platform, ProkEvo, and further demonstrated how the implementation facilitates automated, reproducible bacterial population analysis. MDPI 2022-04-30 /pmc/articles/PMC9147691/ /pubmed/35629339 http://dx.doi.org/10.3390/life12050670 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pavlovikj, Natasha
Gomes-Neto, Joao Carlos
Deogun, Jitender S.
Benson, Andrew K.
Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping
title Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping
title_full Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping
title_fullStr Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping
title_full_unstemmed Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping
title_short Systems-Based Approach for Optimization of Assembly-Free Bacterial MLST Mapping
title_sort systems-based approach for optimization of assembly-free bacterial mlst mapping
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9147691/
https://www.ncbi.nlm.nih.gov/pubmed/35629339
http://dx.doi.org/10.3390/life12050670
work_keys_str_mv AT pavlovikjnatasha systemsbasedapproachforoptimizationofassemblyfreebacterialmlstmapping
AT gomesnetojoaocarlos systemsbasedapproachforoptimizationofassemblyfreebacterialmlstmapping
AT deogunjitenders systemsbasedapproachforoptimizationofassemblyfreebacterialmlstmapping
AT bensonandrewk systemsbasedapproachforoptimizationofassemblyfreebacterialmlstmapping