Cargando…
FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA
DNA viruses are important infectious agents known to mediate a large number of human diseases, including cancer. Viral integration into the host genome and the formation of hybrid transcripts are also associated with increased pathogenicity. The high variability of viral genomes, however requires th...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041341/ https://www.ncbi.nlm.nih.gov/pubmed/35493723 http://dx.doi.org/10.1093/nargab/lqac032 |
_version_ | 1784694525252337664 |
---|---|
author | Javadzadeh, Sara Rajkumar, Utkrisht Nguyen, Nam Sarmashghi, Shahab Luebeck, Jens Shang, Jingbo Bafna, Vineet |
author_facet | Javadzadeh, Sara Rajkumar, Utkrisht Nguyen, Nam Sarmashghi, Shahab Luebeck, Jens Shang, Jingbo Bafna, Vineet |
author_sort | Javadzadeh, Sara |
collection | PubMed |
description | DNA viruses are important infectious agents known to mediate a large number of human diseases, including cancer. Viral integration into the host genome and the formation of hybrid transcripts are also associated with increased pathogenicity. The high variability of viral genomes, however requires the use of sensitive ensemble hidden Markov models that add to the computational complexity, often requiring > 40 CPU-hours per sample. Here, we describe FastViFi, a fast 2-stage filtering method that reduces the computational burden. On simulated and cancer genomic data, FastViFi improved the running time by 2 orders of magnitude with comparable accuracy on challenging data sets. Recently published methods have focused on identification of location of viral integration into the human host genome using local assembly, but do not extend to RNA. To identify human viral hybrid transcripts, we additionally developed ensemble Hidden Markov Models for the Epstein Barr virus (EBV) to add to the models for Hepatitis B (HBV), Hepatitis C (HCV) viruses and the Human Papillomavirus (HPV), and used FastViFi to query RNA-seq data from Gastric cancer (EBV) and liver cancer (HBV/HCV). FastViFi ran in <10 minutes per sample and identified multiple hybrids that fuse viral and human genes suggesting new mechanisms for oncoviral pathogenicity. FastViFi is available at https://github.com/sara-javadzadeh/FastViFi. |
format | Online Article Text |
id | pubmed-9041341 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-90413412022-04-27 FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA Javadzadeh, Sara Rajkumar, Utkrisht Nguyen, Nam Sarmashghi, Shahab Luebeck, Jens Shang, Jingbo Bafna, Vineet NAR Genom Bioinform Standard Article DNA viruses are important infectious agents known to mediate a large number of human diseases, including cancer. Viral integration into the host genome and the formation of hybrid transcripts are also associated with increased pathogenicity. The high variability of viral genomes, however requires the use of sensitive ensemble hidden Markov models that add to the computational complexity, often requiring > 40 CPU-hours per sample. Here, we describe FastViFi, a fast 2-stage filtering method that reduces the computational burden. On simulated and cancer genomic data, FastViFi improved the running time by 2 orders of magnitude with comparable accuracy on challenging data sets. Recently published methods have focused on identification of location of viral integration into the human host genome using local assembly, but do not extend to RNA. To identify human viral hybrid transcripts, we additionally developed ensemble Hidden Markov Models for the Epstein Barr virus (EBV) to add to the models for Hepatitis B (HBV), Hepatitis C (HCV) viruses and the Human Papillomavirus (HPV), and used FastViFi to query RNA-seq data from Gastric cancer (EBV) and liver cancer (HBV/HCV). FastViFi ran in <10 minutes per sample and identified multiple hybrids that fuse viral and human genes suggesting new mechanisms for oncoviral pathogenicity. FastViFi is available at https://github.com/sara-javadzadeh/FastViFi. Oxford University Press 2022-04-26 /pmc/articles/PMC9041341/ /pubmed/35493723 http://dx.doi.org/10.1093/nargab/lqac032 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Standard Article Javadzadeh, Sara Rajkumar, Utkrisht Nguyen, Nam Sarmashghi, Shahab Luebeck, Jens Shang, Jingbo Bafna, Vineet FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA |
title | FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA |
title_full | FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA |
title_fullStr | FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA |
title_full_unstemmed | FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA |
title_short | FastViFi: Fast and accurate detection of (Hybrid) Viral DNA and RNA |
title_sort | fastvifi: fast and accurate detection of (hybrid) viral dna and rna |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041341/ https://www.ncbi.nlm.nih.gov/pubmed/35493723 http://dx.doi.org/10.1093/nargab/lqac032 |
work_keys_str_mv | AT javadzadehsara fastvififastandaccuratedetectionofhybridviraldnaandrna AT rajkumarutkrisht fastvififastandaccuratedetectionofhybridviraldnaandrna AT nguyennam fastvififastandaccuratedetectionofhybridviraldnaandrna AT sarmashghishahab fastvififastandaccuratedetectionofhybridviraldnaandrna AT luebeckjens fastvififastandaccuratedetectionofhybridviraldnaandrna AT shangjingbo fastvififastandaccuratedetectionofhybridviraldnaandrna AT bafnavineet fastvififastandaccuratedetectionofhybridviraldnaandrna |