Cargando…
ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids
BACKGROUND: Genetic recombination is a tremendous source of intrahost diversity in viruses and is critical for their ability to rapidly adapt to new environments or fitness challenges. While viruses are routinely characterized using high-throughput sequencing techniques, characterizing the genetic p...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025937/ https://www.ncbi.nlm.nih.gov/pubmed/36939008 http://dx.doi.org/10.1093/gigascience/giad009 |
_version_ | 1784909440696188928 |
---|---|
author | Sotcheff, Stephanea Zhou, Yiyang Yeung, Jason Sun, Yan Johnson, John E Torbett, Bruce E Routh, Andrew L |
author_facet | Sotcheff, Stephanea Zhou, Yiyang Yeung, Jason Sun, Yan Johnson, John E Torbett, Bruce E Routh, Andrew L |
author_sort | Sotcheff, Stephanea |
collection | PubMed |
description | BACKGROUND: Genetic recombination is a tremendous source of intrahost diversity in viruses and is critical for their ability to rapidly adapt to new environments or fitness challenges. While viruses are routinely characterized using high-throughput sequencing techniques, characterizing the genetic products of recombination in next-generation sequencing data remains a challenge. Viral recombination events can be highly diverse and variable in nature, including simple duplications and deletions, or more complex events such as copy/snap-back recombination, intervirus or intersegment recombination, and insertions of host nucleic acids. Due to the variable mechanisms driving virus recombination and the different selection pressures acting on the progeny, recombination junctions rarely adhere to simple canonical sites or sequences. Furthermore, numerous different events may be present simultaneously in a viral population, yielding a complex mutational landscape. FINDINGS: We have previously developed an algorithm called ViReMa (Virus Recombination Mapper) that bootstraps the bowtie short-read aligner to capture and annotate a wide range of recombinant species found within virus populations. Here, we have updated ViReMa to provide an “error density” function designed to accurately detect recombination events in the longer reads now routinely generated by the Illumina platforms and provide output reports for multiple types of recombinant species using standardized formats. We demonstrate the utility and flexibility of ViReMa in different settings to report deletion events in simulated data from Flock House virus, copy-back RNA species in Sendai viruses, short duplication events in HIV, and virus-to-host recombination in an archaeal DNA virus. |
format | Online Article Text |
id | pubmed-10025937 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-100259372023-03-21 ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids Sotcheff, Stephanea Zhou, Yiyang Yeung, Jason Sun, Yan Johnson, John E Torbett, Bruce E Routh, Andrew L Gigascience Technical Note BACKGROUND: Genetic recombination is a tremendous source of intrahost diversity in viruses and is critical for their ability to rapidly adapt to new environments or fitness challenges. While viruses are routinely characterized using high-throughput sequencing techniques, characterizing the genetic products of recombination in next-generation sequencing data remains a challenge. Viral recombination events can be highly diverse and variable in nature, including simple duplications and deletions, or more complex events such as copy/snap-back recombination, intervirus or intersegment recombination, and insertions of host nucleic acids. Due to the variable mechanisms driving virus recombination and the different selection pressures acting on the progeny, recombination junctions rarely adhere to simple canonical sites or sequences. Furthermore, numerous different events may be present simultaneously in a viral population, yielding a complex mutational landscape. FINDINGS: We have previously developed an algorithm called ViReMa (Virus Recombination Mapper) that bootstraps the bowtie short-read aligner to capture and annotate a wide range of recombinant species found within virus populations. Here, we have updated ViReMa to provide an “error density” function designed to accurately detect recombination events in the longer reads now routinely generated by the Illumina platforms and provide output reports for multiple types of recombinant species using standardized formats. We demonstrate the utility and flexibility of ViReMa in different settings to report deletion events in simulated data from Flock House virus, copy-back RNA species in Sendai viruses, short duplication events in HIV, and virus-to-host recombination in an archaeal DNA virus. Oxford University Press 2023-03-20 /pmc/articles/PMC10025937/ /pubmed/36939008 http://dx.doi.org/10.1093/gigascience/giad009 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Sotcheff, Stephanea Zhou, Yiyang Yeung, Jason Sun, Yan Johnson, John E Torbett, Bruce E Routh, Andrew L ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids |
title |
ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids |
title_full |
ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids |
title_fullStr |
ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids |
title_full_unstemmed |
ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids |
title_short |
ViReMa: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids |
title_sort | virema: a virus recombination mapper of next-generation sequencing data characterizes diverse recombinant viral nucleic acids |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025937/ https://www.ncbi.nlm.nih.gov/pubmed/36939008 http://dx.doi.org/10.1093/gigascience/giad009 |
work_keys_str_mv | AT sotcheffstephanea viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids AT zhouyiyang viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids AT yeungjason viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids AT sunyan viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids AT johnsonjohne viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids AT torbettbrucee viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids AT routhandrewl viremaavirusrecombinationmapperofnextgenerationsequencingdatacharacterizesdiverserecombinantviralnucleicacids |