Cargando…

DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data

The generation of different types of defective viral genomes (DVG) is an unavoidable consequence of the error-prone replication of RNA viruses. In recent years, a particular class of DVGs, those containing long deletions or genome rearrangements, has gain interest due to their potential therapeutic...

Descripción completa

Detalles Bibliográficos
Autores principales: Olmo-Uceda, Maria J., Muñoz-Sánchez, Juan C., Lasso-Giraldo, Wilberth, Arnau, Vicente, Díaz-Villanueva, Wladimiro, Elena, Santiago F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144107/
https://www.ncbi.nlm.nih.gov/pubmed/35632855
http://dx.doi.org/10.3390/v14051114
_version_ 1784715968327450624
author Olmo-Uceda, Maria J.
Muñoz-Sánchez, Juan C.
Lasso-Giraldo, Wilberth
Arnau, Vicente
Díaz-Villanueva, Wladimiro
Elena, Santiago F.
author_facet Olmo-Uceda, Maria J.
Muñoz-Sánchez, Juan C.
Lasso-Giraldo, Wilberth
Arnau, Vicente
Díaz-Villanueva, Wladimiro
Elena, Santiago F.
author_sort Olmo-Uceda, Maria J.
collection PubMed
description The generation of different types of defective viral genomes (DVG) is an unavoidable consequence of the error-prone replication of RNA viruses. In recent years, a particular class of DVGs, those containing long deletions or genome rearrangements, has gain interest due to their potential therapeutic and biotechnological applications. Identifying such DVGs in high-throughput sequencing (HTS) data has become an interesting computational problem. Several algorithms have been proposed to accomplish this goal, though all incur false positives, a problem of practical interest if such DVGs have to be synthetized and tested in the laboratory. We present a metasearch tool, DVGfinder, that wraps the two most commonly used DVG search algorithms in a single workflow for the identification of the DVGs in HTS data. DVGfinder processes the results of ViReMa-a and DI-tector and uses a gradient boosting classifier machine learning algorithm to reduce the number of false-positive events. The program also generates output files in user-friendly HTML format, which can help users to explore the DVGs identified in the sample. We evaluated the performance of DVGfinder compared to the two search algorithms used separately and found that it slightly improves sensitivities for low-coverage synthetic HTS data and DI-tector precision for high-coverage samples. The metasearch program also showed higher sensitivity on a real sample for which a set of copy-backs were previously validated.
format Online
Article
Text
id pubmed-9144107
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91441072022-05-29 DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data Olmo-Uceda, Maria J. Muñoz-Sánchez, Juan C. Lasso-Giraldo, Wilberth Arnau, Vicente Díaz-Villanueva, Wladimiro Elena, Santiago F. Viruses Article The generation of different types of defective viral genomes (DVG) is an unavoidable consequence of the error-prone replication of RNA viruses. In recent years, a particular class of DVGs, those containing long deletions or genome rearrangements, has gain interest due to their potential therapeutic and biotechnological applications. Identifying such DVGs in high-throughput sequencing (HTS) data has become an interesting computational problem. Several algorithms have been proposed to accomplish this goal, though all incur false positives, a problem of practical interest if such DVGs have to be synthetized and tested in the laboratory. We present a metasearch tool, DVGfinder, that wraps the two most commonly used DVG search algorithms in a single workflow for the identification of the DVGs in HTS data. DVGfinder processes the results of ViReMa-a and DI-tector and uses a gradient boosting classifier machine learning algorithm to reduce the number of false-positive events. The program also generates output files in user-friendly HTML format, which can help users to explore the DVGs identified in the sample. We evaluated the performance of DVGfinder compared to the two search algorithms used separately and found that it slightly improves sensitivities for low-coverage synthetic HTS data and DI-tector precision for high-coverage samples. The metasearch program also showed higher sensitivity on a real sample for which a set of copy-backs were previously validated. MDPI 2022-05-23 /pmc/articles/PMC9144107/ /pubmed/35632855 http://dx.doi.org/10.3390/v14051114 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Olmo-Uceda, Maria J.
Muñoz-Sánchez, Juan C.
Lasso-Giraldo, Wilberth
Arnau, Vicente
Díaz-Villanueva, Wladimiro
Elena, Santiago F.
DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data
title DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data
title_full DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data
title_fullStr DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data
title_full_unstemmed DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data
title_short DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data
title_sort dvgfinder: a metasearch tool for identifying defective viral genomes in rna-seq data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144107/
https://www.ncbi.nlm.nih.gov/pubmed/35632855
http://dx.doi.org/10.3390/v14051114
work_keys_str_mv AT olmoucedamariaj dvgfinderametasearchtoolforidentifyingdefectiveviralgenomesinrnaseqdata
AT munozsanchezjuanc dvgfinderametasearchtoolforidentifyingdefectiveviralgenomesinrnaseqdata
AT lassogiraldowilberth dvgfinderametasearchtoolforidentifyingdefectiveviralgenomesinrnaseqdata
AT arnauvicente dvgfinderametasearchtoolforidentifyingdefectiveviralgenomesinrnaseqdata
AT diazvillanuevawladimiro dvgfinderametasearchtoolforidentifyingdefectiveviralgenomesinrnaseqdata
AT elenasantiagof dvgfinderametasearchtoolforidentifyingdefectiveviralgenomesinrnaseqdata