Cargando…

A Practical Bioinformatics Workflow for Routine Analysis of Bacterial WGS Data

The use of whole-genome sequencing (WGS) for bacterial characterisation has increased substantially in the last decade. Its high throughput and decreasing cost have led to significant changes in outbreak investigations and surveillance of a wide variety of microbial pathogens. Despite the innumerabl...

Descripción completa

Detalles Bibliográficos
Autores principales: Atxaerandio-Landa, Aitor, Arrieta-Gisasola, Ainhoa, Laorden, Lorena, Bikandi, Joseba, Garaizar, Javier, Martinez-Malaxetxebarria, Irati, Martinez-Ballesteros, Ilargi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9781918/
https://www.ncbi.nlm.nih.gov/pubmed/36557617
http://dx.doi.org/10.3390/microorganisms10122364
_version_ 1784857192349827072
author Atxaerandio-Landa, Aitor
Arrieta-Gisasola, Ainhoa
Laorden, Lorena
Bikandi, Joseba
Garaizar, Javier
Martinez-Malaxetxebarria, Irati
Martinez-Ballesteros, Ilargi
author_facet Atxaerandio-Landa, Aitor
Arrieta-Gisasola, Ainhoa
Laorden, Lorena
Bikandi, Joseba
Garaizar, Javier
Martinez-Malaxetxebarria, Irati
Martinez-Ballesteros, Ilargi
author_sort Atxaerandio-Landa, Aitor
collection PubMed
description The use of whole-genome sequencing (WGS) for bacterial characterisation has increased substantially in the last decade. Its high throughput and decreasing cost have led to significant changes in outbreak investigations and surveillance of a wide variety of microbial pathogens. Despite the innumerable advantages of WGS, several drawbacks concerning data analysis and management, as well as a general lack of standardisation, hinder its integration in routine use. In this work, a bioinformatics workflow for (Illumina) WGS data is presented for bacterial characterisation including genome annotation, species identification, serotype prediction, antimicrobial resistance prediction, virulence-related genes and plasmid replicon detection, core-genome-based or single nucleotide polymorphism (SNP)-based phylogenetic clustering and sequence typing. Workflow was tested using a collection of 22 in-house sequences of Salmonella enterica isolates belonging to a local outbreak, coupled with a collection of 182 Salmonella genomes publicly available. No errors were reported during the execution period, and all genomes were analysed. The bioinformatics workflow can be tailored to other pathogens of interest and is freely available for academic and non-profit use as an uploadable file to the Galaxy platform.
format Online
Article
Text
id pubmed-9781918
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97819182022-12-24 A Practical Bioinformatics Workflow for Routine Analysis of Bacterial WGS Data Atxaerandio-Landa, Aitor Arrieta-Gisasola, Ainhoa Laorden, Lorena Bikandi, Joseba Garaizar, Javier Martinez-Malaxetxebarria, Irati Martinez-Ballesteros, Ilargi Microorganisms Article The use of whole-genome sequencing (WGS) for bacterial characterisation has increased substantially in the last decade. Its high throughput and decreasing cost have led to significant changes in outbreak investigations and surveillance of a wide variety of microbial pathogens. Despite the innumerable advantages of WGS, several drawbacks concerning data analysis and management, as well as a general lack of standardisation, hinder its integration in routine use. In this work, a bioinformatics workflow for (Illumina) WGS data is presented for bacterial characterisation including genome annotation, species identification, serotype prediction, antimicrobial resistance prediction, virulence-related genes and plasmid replicon detection, core-genome-based or single nucleotide polymorphism (SNP)-based phylogenetic clustering and sequence typing. Workflow was tested using a collection of 22 in-house sequences of Salmonella enterica isolates belonging to a local outbreak, coupled with a collection of 182 Salmonella genomes publicly available. No errors were reported during the execution period, and all genomes were analysed. The bioinformatics workflow can be tailored to other pathogens of interest and is freely available for academic and non-profit use as an uploadable file to the Galaxy platform. MDPI 2022-11-29 /pmc/articles/PMC9781918/ /pubmed/36557617 http://dx.doi.org/10.3390/microorganisms10122364 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Atxaerandio-Landa, Aitor
Arrieta-Gisasola, Ainhoa
Laorden, Lorena
Bikandi, Joseba
Garaizar, Javier
Martinez-Malaxetxebarria, Irati
Martinez-Ballesteros, Ilargi
A Practical Bioinformatics Workflow for Routine Analysis of Bacterial WGS Data
title A Practical Bioinformatics Workflow for Routine Analysis of Bacterial WGS Data
title_full A Practical Bioinformatics Workflow for Routine Analysis of Bacterial WGS Data
title_fullStr A Practical Bioinformatics Workflow for Routine Analysis of Bacterial WGS Data
title_full_unstemmed A Practical Bioinformatics Workflow for Routine Analysis of Bacterial WGS Data
title_short A Practical Bioinformatics Workflow for Routine Analysis of Bacterial WGS Data
title_sort practical bioinformatics workflow for routine analysis of bacterial wgs data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9781918/
https://www.ncbi.nlm.nih.gov/pubmed/36557617
http://dx.doi.org/10.3390/microorganisms10122364
work_keys_str_mv AT atxaerandiolandaaitor apracticalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT arrietagisasolaainhoa apracticalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT laordenlorena apracticalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT bikandijoseba apracticalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT garaizarjavier apracticalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT martinezmalaxetxebarriairati apracticalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT martinezballesterosilargi apracticalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT atxaerandiolandaaitor practicalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT arrietagisasolaainhoa practicalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT laordenlorena practicalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT bikandijoseba practicalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT garaizarjavier practicalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT martinezmalaxetxebarriairati practicalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata
AT martinezballesterosilargi practicalbioinformaticsworkflowforroutineanalysisofbacterialwgsdata