Cargando…

ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids

BACKGROUND: Genetic recombination is a tremendous source of intrahost diversity in viruses and is critical for their ability to rapidly adapt to new environments or fitness challenges. While viruses are routinely characterized using high-throughput sequencing techniques, characterizing the genetic p...

Descripción completa

Detalles Bibliográficos
Autores principales: Sotcheff, Stephanea, Zhou, Yiyang, Yeung, Jason, Sun, Yan, Johnson, John E, Torbett, Bruce E, Routh, Andrew L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025937/
https://www.ncbi.nlm.nih.gov/pubmed/36939008
http://dx.doi.org/10.1093/gigascience/giad009
_version_ 1784909440696188928
author Sotcheff, Stephanea
Zhou, Yiyang
Yeung, Jason
Sun, Yan
Johnson, John E
Torbett, Bruce E
Routh, Andrew L
author_facet Sotcheff, Stephanea
Zhou, Yiyang
Yeung, Jason
Sun, Yan
Johnson, John E
Torbett, Bruce E
Routh, Andrew L
author_sort Sotcheff, Stephanea
collection PubMed
description BACKGROUND: Genetic recombination is a tremendous source of intrahost diversity in viruses and is critical for their ability to rapidly adapt to new environments or fitness challenges. While viruses are routinely characterized using high-throughput sequencing techniques, characterizing the genetic products of recombination in next-generation sequencing data remains a challenge. Viral recombination events can be highly diverse and variable in nature, including simple duplications and deletions, or more complex events such as copy/snap-back recombination, intervirus or intersegment recombination, and insertions of host nucleic acids. Due to the variable mechanisms driving virus recombination and the different selection pressures acting on the progeny, recombination junctions rarely adhere to simple canonical sites or sequences. Furthermore, numerous different events may be present simultaneously in a viral population, yielding a complex mutational landscape. FINDINGS: We have previously developed an algorithm called ViReMa (Virus Recombination Mapper) that bootstraps the bowtie short-read aligner to capture and annotate a wide range of recombinant species found within virus populations. Here, we have updated ViReMa to provide an “error density” function designed to accurately detect recombination events in the longer reads now routinely generated by the Illumina platforms and provide output reports for multiple types of recombinant species using standardized formats. We demonstrate the utility and flexibility of ViReMa in different settings to report deletion events in simulated data from Flock House virus, copy-back RNA species in Sendai viruses, short duplication events in HIV, and virus-to-host recombination in an archaeal DNA virus.
format Online
Article
Text
id pubmed-10025937
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100259372023-03-21 ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids Sotcheff, Stephanea Zhou, Yiyang Yeung, Jason Sun, Yan Johnson, John E Torbett, Bruce E Routh, Andrew L Gigascience Technical Note BACKGROUND: Genetic recombination is a tremendous source of intrahost diversity in viruses and is critical for their ability to rapidly adapt to new environments or fitness challenges. While viruses are routinely characterized using high-throughput sequencing techniques, characterizing the genetic products of recombination in next-generation sequencing data remains a challenge. Viral recombination events can be highly diverse and variable in nature, including simple duplications and deletions, or more complex events such as copy/snap-back recombination, intervirus or intersegment recombination, and insertions of host nucleic acids. Due to the variable mechanisms driving virus recombination and the different selection pressures acting on the progeny, recombination junctions rarely adhere to simple canonical sites or sequences. Furthermore, numerous different events may be present simultaneously in a viral population, yielding a complex mutational landscape. FINDINGS: We have previously developed an algorithm called ViReMa (Virus Recombination Mapper) that bootstraps the bowtie short-read aligner to capture and annotate a wide range of recombinant species found within virus populations. Here, we have updated ViReMa to provide an “error density” function designed to accurately detect recombination events in the longer reads now routinely generated by the Illumina platforms and provide output reports for multiple types of recombinant species using standardized formats. We demonstrate the utility and flexibility of ViReMa in different settings to report deletion events in simulated data from Flock House virus, copy-back RNA species in Sendai viruses, short duplication events in HIV, and virus-to-host recombination in an archaeal DNA virus. Oxford University Press 2023-03-20 /pmc/articles/PMC10025937/ /pubmed/36939008 http://dx.doi.org/10.1093/gigascience/giad009 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Sotcheff, Stephanea
Zhou, Yiyang
Yeung, Jason
Sun, Yan
Johnson, John E
Torbett, Bruce E
Routh, Andrew L
ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids
title ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids
title_full ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids
title_fullStr ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids
title_full_unstemmed ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids
title_short ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids
title_sort virema: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025937/
https://www.ncbi.nlm.nih.gov/pubmed/36939008
http://dx.doi.org/10.1093/gigascience/giad009
work_keys_str_mv AT sotcheffstephanea viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids
AT zhouyiyang viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids
AT yeungjason viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids
AT sunyan viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids
AT johnsonjohne viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids
AT torbettbrucee viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids
AT routhandrewl viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids