Cargando…

Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications

MOTIVATION: Influenza viruses represent a global public health burden due to annual epidemics and pandemic potential. Due to a rapidly evolving RNA genome, inter-species transmission, intra-host variation, and noise in short-read data, reads can be lost during mapping, and de novo assembly can be ti...

Descripción completa

Detalles Bibliográficos
Autores principales: Southgate, Joel A, Bull, Matthew J, Brown, Clare M, Watkins, Joanne, Corden, Sally, Southgate, Benjamin, Moore, Catherine, Connor, Thomas R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703727/
https://www.ncbi.nlm.nih.gov/pubmed/31693070
http://dx.doi.org/10.1093/bioinformatics/btz814
_version_ 1783616682700308480
author Southgate, Joel A
Bull, Matthew J
Brown, Clare M
Watkins, Joanne
Corden, Sally
Southgate, Benjamin
Moore, Catherine
Connor, Thomas R
author_facet Southgate, Joel A
Bull, Matthew J
Brown, Clare M
Watkins, Joanne
Corden, Sally
Southgate, Benjamin
Moore, Catherine
Connor, Thomas R
author_sort Southgate, Joel A
collection PubMed
description MOTIVATION: Influenza viruses represent a global public health burden due to annual epidemics and pandemic potential. Due to a rapidly evolving RNA genome, inter-species transmission, intra-host variation, and noise in short-read data, reads can be lost during mapping, and de novo assembly can be time consuming and result in misassembly. We assessed read loss during mapping and designed a graph-based classifier, VAPOR, for selecting mapping references, assembly validation and detection of strains of non-human origin. RESULTS: Standard human reference viruses were insufficient for mapping diverse influenza samples in simulation. VAPOR retrieved references for 257 real whole-genome sequencing samples with a mean of [Formula: see text] identity to assemblies, and increased the proportion of mapped reads by up to 13.3% compared to standard references. VAPOR has the potential to improve the robustness of bioinformatics pipelines for surveillance and could be adapted to other RNA viruses. AVAILABILITY AND IMPLEMENTATION: VAPOR is available at https://github.com/connor-lab/vapor. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7703727
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77037272020-12-07 Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications Southgate, Joel A Bull, Matthew J Brown, Clare M Watkins, Joanne Corden, Sally Southgate, Benjamin Moore, Catherine Connor, Thomas R Bioinformatics Original Papers MOTIVATION: Influenza viruses represent a global public health burden due to annual epidemics and pandemic potential. Due to a rapidly evolving RNA genome, inter-species transmission, intra-host variation, and noise in short-read data, reads can be lost during mapping, and de novo assembly can be time consuming and result in misassembly. We assessed read loss during mapping and designed a graph-based classifier, VAPOR, for selecting mapping references, assembly validation and detection of strains of non-human origin. RESULTS: Standard human reference viruses were insufficient for mapping diverse influenza samples in simulation. VAPOR retrieved references for 257 real whole-genome sequencing samples with a mean of [Formula: see text] identity to assemblies, and increased the proportion of mapped reads by up to 13.3% compared to standard references. VAPOR has the potential to improve the robustness of bioinformatics pipelines for surveillance and could be adapted to other RNA viruses. AVAILABILITY AND IMPLEMENTATION: VAPOR is available at https://github.com/connor-lab/vapor. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-03-15 2019-11-06 /pmc/articles/PMC7703727/ /pubmed/31693070 http://dx.doi.org/10.1093/bioinformatics/btz814 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Southgate, Joel A
Bull, Matthew J
Brown, Clare M
Watkins, Joanne
Corden, Sally
Southgate, Benjamin
Moore, Catherine
Connor, Thomas R
Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications
title Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications
title_full Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications
title_fullStr Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications
title_full_unstemmed Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications
title_short Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications
title_sort influenza classification from short reads with vapor facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703727/
https://www.ncbi.nlm.nih.gov/pubmed/31693070
http://dx.doi.org/10.1093/bioinformatics/btz814
work_keys_str_mv AT southgatejoela influenzaclassificationfromshortreadswithvaporfacilitatesrobustmappingpipelinesandzoonoticstraindetectionforroutinesurveillanceapplications
AT bullmatthewj influenzaclassificationfromshortreadswithvaporfacilitatesrobustmappingpipelinesandzoonoticstraindetectionforroutinesurveillanceapplications
AT brownclarem influenzaclassificationfromshortreadswithvaporfacilitatesrobustmappingpipelinesandzoonoticstraindetectionforroutinesurveillanceapplications
AT watkinsjoanne influenzaclassificationfromshortreadswithvaporfacilitatesrobustmappingpipelinesandzoonoticstraindetectionforroutinesurveillanceapplications
AT cordensally influenzaclassificationfromshortreadswithvaporfacilitatesrobustmappingpipelinesandzoonoticstraindetectionforroutinesurveillanceapplications
AT southgatebenjamin influenzaclassificationfromshortreadswithvaporfacilitatesrobustmappingpipelinesandzoonoticstraindetectionforroutinesurveillanceapplications
AT moorecatherine influenzaclassificationfromshortreadswithvaporfacilitatesrobustmappingpipelinesandzoonoticstraindetectionforroutinesurveillanceapplications
AT connorthomasr influenzaclassificationfromshortreadswithvaporfacilitatesrobustmappingpipelinesandzoonoticstraindetectionforroutinesurveillanceapplications