Cargando…

FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA

DNA viruses are important infectious agents known to mediate a large number of human diseases, including cancer. Viral integration into the host genome and the formation of hybrid transcripts are also associated with increased pathogenicity. The high variability of viral genomes, however requires th...

Descripción completa

Detalles Bibliográficos
Autores principales: Javadzadeh, Sara, Rajkumar, Utkrisht, Nguyen, Nam, Sarmashghi, Shahab, Luebeck, Jens, Shang, Jingbo, Bafna, Vineet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041341/
https://www.ncbi.nlm.nih.gov/pubmed/35493723
http://dx.doi.org/10.1093/nargab/lqac032
_version_ 1784694525252337664
author Javadzadeh, Sara
Rajkumar, Utkrisht
Nguyen, Nam
Sarmashghi, Shahab
Luebeck, Jens
Shang, Jingbo
Bafna, Vineet
author_facet Javadzadeh, Sara
Rajkumar, Utkrisht
Nguyen, Nam
Sarmashghi, Shahab
Luebeck, Jens
Shang, Jingbo
Bafna, Vineet
author_sort Javadzadeh, Sara
collection PubMed
description DNA viruses are important infectious agents known to mediate a large number of human diseases, including cancer. Viral integration into the host genome and the formation of hybrid transcripts are also associated with increased pathogenicity. The high variability of viral genomes, however requires the use of sensitive ensemble hidden Markov models that add to the computational complexity, often requiring > 40 CPU-hours per sample. Here, we describe FastViFi, a fast 2-stage filtering method that reduces the computational burden. On simulated and cancer genomic data, FastViFi improved the running time by 2 orders of magnitude with comparable accuracy on challenging data sets. Recently published methods have focused on identification of location of viral integration into the human host genome using local assembly, but do not extend to RNA. To identify human viral hybrid transcripts, we additionally developed ensemble Hidden Markov Models for the Epstein Barr virus (EBV) to add to the models for Hepatitis B (HBV), Hepatitis C (HCV) viruses and the Human Papillomavirus (HPV), and used FastViFi to query RNA-seq data from Gastric cancer (EBV) and liver cancer (HBV/HCV). FastViFi ran in <10 minutes per sample and identified multiple hybrids that fuse viral and human genes suggesting new mechanisms for oncoviral pathogenicity. FastViFi is available at https://github.com/sara-javadzadeh/FastViFi.
format Online
Article
Text
id pubmed-9041341
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-90413412022-04-27 FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA Javadzadeh, Sara Rajkumar, Utkrisht Nguyen, Nam Sarmashghi, Shahab Luebeck, Jens Shang, Jingbo Bafna, Vineet NAR Genom Bioinform Standard Article DNA viruses are important infectious agents known to mediate a large number of human diseases, including cancer. Viral integration into the host genome and the formation of hybrid transcripts are also associated with increased pathogenicity. The high variability of viral genomes, however requires the use of sensitive ensemble hidden Markov models that add to the computational complexity, often requiring > 40 CPU-hours per sample. Here, we describe FastViFi, a fast 2-stage filtering method that reduces the computational burden. On simulated and cancer genomic data, FastViFi improved the running time by 2 orders of magnitude with comparable accuracy on challenging data sets. Recently published methods have focused on identification of location of viral integration into the human host genome using local assembly, but do not extend to RNA. To identify human viral hybrid transcripts, we additionally developed ensemble Hidden Markov Models for the Epstein Barr virus (EBV) to add to the models for Hepatitis B (HBV), Hepatitis C (HCV) viruses and the Human Papillomavirus (HPV), and used FastViFi to query RNA-seq data from Gastric cancer (EBV) and liver cancer (HBV/HCV). FastViFi ran in <10 minutes per sample and identified multiple hybrids that fuse viral and human genes suggesting new mechanisms for oncoviral pathogenicity. FastViFi is available at https://github.com/sara-javadzadeh/FastViFi. Oxford University Press 2022-04-26 /pmc/articles/PMC9041341/ /pubmed/35493723 http://dx.doi.org/10.1093/nargab/lqac032 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Standard Article
Javadzadeh, Sara
Rajkumar, Utkrisht
Nguyen, Nam
Sarmashghi, Shahab
Luebeck, Jens
Shang, Jingbo
Bafna, Vineet
FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA
title FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA
title_full FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA
title_fullStr FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA
title_full_unstemmed FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA
title_short FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA
title_sort fastvifi: fast and accurate detection of (hybrid) viral dna and rna
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041341/
https://www.ncbi.nlm.nih.gov/pubmed/35493723
http://dx.doi.org/10.1093/nargab/lqac032
work_keys_str_mv AT javadzadehsara fastvififastandaccuratedetectionofhybridviraldnaandrna
AT rajkumarutkrisht fastvififastandaccuratedetectionofhybridviraldnaandrna
AT nguyennam fastvififastandaccuratedetectionofhybridviraldnaandrna
AT sarmashghishahab fastvififastandaccuratedetectionofhybridviraldnaandrna
AT luebeckjens fastvififastandaccuratedetectionofhybridviraldnaandrna
AT shangjingbo fastvififastandaccuratedetectionofhybridviraldnaandrna
AT bafnavineet fastvififastandaccuratedetectionofhybridviraldnaandrna