Cargando…

INSaFLU: an automated open web-based bioinformatics suite “from-reads” for influenza whole-genome-sequencing-based surveillance

BACKGROUND: A new era of flu surveillance has already started based on the genetic characterization and exploration of influenza virus evolution at whole-genome scale. Although this has been prioritized by national and international health authorities, the demanded technological transition to whole-...

Descripción completa

Detalles Bibliográficos
Autores principales: Borges, Vítor, Pinheiro, Miguel, Pechirra, Pedro, Guiomar, Raquel, Gomes, João Paulo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6027769/
https://www.ncbi.nlm.nih.gov/pubmed/29954441
http://dx.doi.org/10.1186/s13073-018-0555-0
_version_ 1783336666608435200
author Borges, Vítor
Pinheiro, Miguel
Pechirra, Pedro
Guiomar, Raquel
Gomes, João Paulo
author_facet Borges, Vítor
Pinheiro, Miguel
Pechirra, Pedro
Guiomar, Raquel
Gomes, João Paulo
author_sort Borges, Vítor
collection PubMed
description BACKGROUND: A new era of flu surveillance has already started based on the genetic characterization and exploration of influenza virus evolution at whole-genome scale. Although this has been prioritized by national and international health authorities, the demanded technological transition to whole-genome sequencing (WGS)-based flu surveillance has been particularly delayed by the lack of bioinformatics infrastructures and/or expertise to deal with primary next-generation sequencing (NGS) data. RESULTS: We developed and implemented INSaFLU (“INSide the FLU”), which is the first influenza-oriented bioinformatics free web-based suite that deals with primary NGS data (reads) towards the automatic generation of the output data that are actually the core first-line “genetic requests” for effective and timely influenza laboratory surveillance (e.g., type and sub-type, gene and whole-genome consensus sequences, variants’ annotation, alignments and phylogenetic trees). By handling NGS data collected from any amplicon-based schema, the implemented pipeline enables any laboratory to perform multi-step software intensive analyses in a user-friendly manner without previous advanced training in bioinformatics. INSaFLU gives access to user-restricted sample databases and projects management, being a transparent and flexible tool specifically designed to automatically update project outputs as more samples are uploaded. Data integration is thus cumulative and scalable, fitting the need for a continuous epidemiological surveillance during the flu epidemics. Multiple outputs are provided in nomenclature-stable and standardized formats that can be explored in situ or through multiple compatible downstream applications for fine-tuned data analysis. This platform additionally flags samples as “putative mixed infections” if the population admixture enrolls influenza viruses with clearly distinct genetic backgrounds, and enriches the traditional “consensus-based” influenza genetic characterization with relevant data on influenza sub-population diversification through a depth analysis of intra-patient minor variants. This dual approach is expected to strengthen our ability not only to detect the emergence of antigenic and drug resistance variants but also to decode alternative pathways of influenza evolution and to unveil intricate routes of transmission. CONCLUSIONS: In summary, INSaFLU supplies public health laboratories and influenza researchers with an open “one size fits all” framework, potentiating the operationalization of a harmonized multi-country WGS-based surveillance for influenza virus. INSaFLU can be accessed through https://insaflu.insa.pt. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13073-018-0555-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6027769
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60277692018-07-09 INSaFLU: an automated open web-based bioinformatics suite “from-reads” for influenza whole-genome-sequencing-based surveillance Borges, Vítor Pinheiro, Miguel Pechirra, Pedro Guiomar, Raquel Gomes, João Paulo Genome Med Software BACKGROUND: A new era of flu surveillance has already started based on the genetic characterization and exploration of influenza virus evolution at whole-genome scale. Although this has been prioritized by national and international health authorities, the demanded technological transition to whole-genome sequencing (WGS)-based flu surveillance has been particularly delayed by the lack of bioinformatics infrastructures and/or expertise to deal with primary next-generation sequencing (NGS) data. RESULTS: We developed and implemented INSaFLU (“INSide the FLU”), which is the first influenza-oriented bioinformatics free web-based suite that deals with primary NGS data (reads) towards the automatic generation of the output data that are actually the core first-line “genetic requests” for effective and timely influenza laboratory surveillance (e.g., type and sub-type, gene and whole-genome consensus sequences, variants’ annotation, alignments and phylogenetic trees). By handling NGS data collected from any amplicon-based schema, the implemented pipeline enables any laboratory to perform multi-step software intensive analyses in a user-friendly manner without previous advanced training in bioinformatics. INSaFLU gives access to user-restricted sample databases and projects management, being a transparent and flexible tool specifically designed to automatically update project outputs as more samples are uploaded. Data integration is thus cumulative and scalable, fitting the need for a continuous epidemiological surveillance during the flu epidemics. Multiple outputs are provided in nomenclature-stable and standardized formats that can be explored in situ or through multiple compatible downstream applications for fine-tuned data analysis. This platform additionally flags samples as “putative mixed infections” if the population admixture enrolls influenza viruses with clearly distinct genetic backgrounds, and enriches the traditional “consensus-based” influenza genetic characterization with relevant data on influenza sub-population diversification through a depth analysis of intra-patient minor variants. This dual approach is expected to strengthen our ability not only to detect the emergence of antigenic and drug resistance variants but also to decode alternative pathways of influenza evolution and to unveil intricate routes of transmission. CONCLUSIONS: In summary, INSaFLU supplies public health laboratories and influenza researchers with an open “one size fits all” framework, potentiating the operationalization of a harmonized multi-country WGS-based surveillance for influenza virus. INSaFLU can be accessed through https://insaflu.insa.pt. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13073-018-0555-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-29 /pmc/articles/PMC6027769/ /pubmed/29954441 http://dx.doi.org/10.1186/s13073-018-0555-0 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Borges, Vítor
Pinheiro, Miguel
Pechirra, Pedro
Guiomar, Raquel
Gomes, João Paulo
INSaFLU: an automated open web-based bioinformatics suite “from-reads” for influenza whole-genome-sequencing-based surveillance
title INSaFLU: an automated open web-based bioinformatics suite “from-reads” for influenza whole-genome-sequencing-based surveillance
title_full INSaFLU: an automated open web-based bioinformatics suite “from-reads” for influenza whole-genome-sequencing-based surveillance
title_fullStr INSaFLU: an automated open web-based bioinformatics suite “from-reads” for influenza whole-genome-sequencing-based surveillance
title_full_unstemmed INSaFLU: an automated open web-based bioinformatics suite “from-reads” for influenza whole-genome-sequencing-based surveillance
title_short INSaFLU: an automated open web-based bioinformatics suite “from-reads” for influenza whole-genome-sequencing-based surveillance
title_sort insaflu: an automated open web-based bioinformatics suite “from-reads” for influenza whole-genome-sequencing-based surveillance
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6027769/
https://www.ncbi.nlm.nih.gov/pubmed/29954441
http://dx.doi.org/10.1186/s13073-018-0555-0
work_keys_str_mv AT borgesvitor insafluanautomatedopenwebbasedbioinformaticssuitefromreadsforinfluenzawholegenomesequencingbasedsurveillance
AT pinheiromiguel insafluanautomatedopenwebbasedbioinformaticssuitefromreadsforinfluenzawholegenomesequencingbasedsurveillance
AT pechirrapedro insafluanautomatedopenwebbasedbioinformaticssuitefromreadsforinfluenzawholegenomesequencingbasedsurveillance
AT guiomarraquel insafluanautomatedopenwebbasedbioinformaticssuitefromreadsforinfluenzawholegenomesequencingbasedsurveillance
AT gomesjoaopaulo insafluanautomatedopenwebbasedbioinformaticssuitefromreadsforinfluenzawholegenomesequencingbasedsurveillance