Cargando…
BATVI: Fast, sensitive and accurate detection of virus integrations
BACKGROUND: The study of virus integrations in human genome is important since virus integrations were shown to be associated with diseases. In the literature, few methods have been proposed that predict virus integrations using next generation sequencing datasets. Although they work, they are slow...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374687/ https://www.ncbi.nlm.nih.gov/pubmed/28361674 http://dx.doi.org/10.1186/s12859-017-1470-x |
_version_ | 1782518943370444800 |
---|---|
author | Tennakoon, Chandana Sung, Wing Kin |
author_facet | Tennakoon, Chandana Sung, Wing Kin |
author_sort | Tennakoon, Chandana |
collection | PubMed |
description | BACKGROUND: The study of virus integrations in human genome is important since virus integrations were shown to be associated with diseases. In the literature, few methods have been proposed that predict virus integrations using next generation sequencing datasets. Although they work, they are slow and are not very sensitive. RESULTS AND DISCUSSION: This paper introduces a new method BatVI to predict viral integrations. Our method uses a fast screening method to filter out chimeric reads containing possible viral integrations. Next, sensitive alignments of these candidate chimeric reads are called by BLAST. Chimeric reads that are co-localized in the human genome are clustered. Finally, by assembling the chimeric reads in each cluster, high confident virus integration sites are extracted. CONCLUSION: We compared the performance of BatVI with existing methods VirusFinder and VirusSeq using both simulated and real-life datasets of liver cancer patients. BatVI ran an order of magnitude faster and was able to predict almost twice the number of true positives compared to other methods while maintaining a false positive rate less than 1%. For the liver cancer datasets, BatVI uncovered novel integrations to two important genes TERT and MLL4, which were missed by previous studies. Through gene expression data, we verified the correctness of these additional integrations. BatVI can be downloaded from http://biogpu.ddns.comp.nus.edu.sg/~ksung/batvi/index.html. |
format | Online Article Text |
id | pubmed-5374687 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53746872017-04-03 BATVI: Fast, sensitive and accurate detection of virus integrations Tennakoon, Chandana Sung, Wing Kin BMC Bioinformatics Research BACKGROUND: The study of virus integrations in human genome is important since virus integrations were shown to be associated with diseases. In the literature, few methods have been proposed that predict virus integrations using next generation sequencing datasets. Although they work, they are slow and are not very sensitive. RESULTS AND DISCUSSION: This paper introduces a new method BatVI to predict viral integrations. Our method uses a fast screening method to filter out chimeric reads containing possible viral integrations. Next, sensitive alignments of these candidate chimeric reads are called by BLAST. Chimeric reads that are co-localized in the human genome are clustered. Finally, by assembling the chimeric reads in each cluster, high confident virus integration sites are extracted. CONCLUSION: We compared the performance of BatVI with existing methods VirusFinder and VirusSeq using both simulated and real-life datasets of liver cancer patients. BatVI ran an order of magnitude faster and was able to predict almost twice the number of true positives compared to other methods while maintaining a false positive rate less than 1%. For the liver cancer datasets, BatVI uncovered novel integrations to two important genes TERT and MLL4, which were missed by previous studies. Through gene expression data, we verified the correctness of these additional integrations. BatVI can be downloaded from http://biogpu.ddns.comp.nus.edu.sg/~ksung/batvi/index.html. BioMed Central 2017-03-14 /pmc/articles/PMC5374687/ /pubmed/28361674 http://dx.doi.org/10.1186/s12859-017-1470-x Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Tennakoon, Chandana Sung, Wing Kin BATVI: Fast, sensitive and accurate detection of virus integrations |
title | BATVI: Fast, sensitive and accurate detection of virus integrations |
title_full | BATVI: Fast, sensitive and accurate detection of virus integrations |
title_fullStr | BATVI: Fast, sensitive and accurate detection of virus integrations |
title_full_unstemmed | BATVI: Fast, sensitive and accurate detection of virus integrations |
title_short | BATVI: Fast, sensitive and accurate detection of virus integrations |
title_sort | batvi: fast, sensitive and accurate detection of virus integrations |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374687/ https://www.ncbi.nlm.nih.gov/pubmed/28361674 http://dx.doi.org/10.1186/s12859-017-1470-x |
work_keys_str_mv | AT tennakoonchandana batvifastsensitiveandaccuratedetectionofvirusintegrations AT sungwingkin batvifastsensitiveandaccuratedetectionofvirusintegrations |