Cargando…

BATVI: Fast, sensitive and accurate detection of virus integrations

BACKGROUND: The study of virus integrations in human genome is important since virus integrations were shown to be associated with diseases. In the literature, few methods have been proposed that predict virus integrations using next generation sequencing datasets. Although they work, they are slow...

Descripción completa

Detalles Bibliográficos
Autores principales: Tennakoon, Chandana, Sung, Wing Kin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374687/
https://www.ncbi.nlm.nih.gov/pubmed/28361674
http://dx.doi.org/10.1186/s12859-017-1470-x
_version_ 1782518943370444800
author Tennakoon, Chandana
Sung, Wing Kin
author_facet Tennakoon, Chandana
Sung, Wing Kin
author_sort Tennakoon, Chandana
collection PubMed
description BACKGROUND: The study of virus integrations in human genome is important since virus integrations were shown to be associated with diseases. In the literature, few methods have been proposed that predict virus integrations using next generation sequencing datasets. Although they work, they are slow and are not very sensitive. RESULTS AND DISCUSSION: This paper introduces a new method BatVI to predict viral integrations. Our method uses a fast screening method to filter out chimeric reads containing possible viral integrations. Next, sensitive alignments of these candidate chimeric reads are called by BLAST. Chimeric reads that are co-localized in the human genome are clustered. Finally, by assembling the chimeric reads in each cluster, high confident virus integration sites are extracted. CONCLUSION: We compared the performance of BatVI with existing methods VirusFinder and VirusSeq using both simulated and real-life datasets of liver cancer patients. BatVI ran an order of magnitude faster and was able to predict almost twice the number of true positives compared to other methods while maintaining a false positive rate less than 1%. For the liver cancer datasets, BatVI uncovered novel integrations to two important genes TERT and MLL4, which were missed by previous studies. Through gene expression data, we verified the correctness of these additional integrations. BatVI can be downloaded from http://biogpu.ddns.comp.nus.edu.sg/~ksung/batvi/index.html.
format Online
Article
Text
id pubmed-5374687
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53746872017-04-03 BATVI: Fast, sensitive and accurate detection of virus integrations Tennakoon, Chandana Sung, Wing Kin BMC Bioinformatics Research BACKGROUND: The study of virus integrations in human genome is important since virus integrations were shown to be associated with diseases. In the literature, few methods have been proposed that predict virus integrations using next generation sequencing datasets. Although they work, they are slow and are not very sensitive. RESULTS AND DISCUSSION: This paper introduces a new method BatVI to predict viral integrations. Our method uses a fast screening method to filter out chimeric reads containing possible viral integrations. Next, sensitive alignments of these candidate chimeric reads are called by BLAST. Chimeric reads that are co-localized in the human genome are clustered. Finally, by assembling the chimeric reads in each cluster, high confident virus integration sites are extracted. CONCLUSION: We compared the performance of BatVI with existing methods VirusFinder and VirusSeq using both simulated and real-life datasets of liver cancer patients. BatVI ran an order of magnitude faster and was able to predict almost twice the number of true positives compared to other methods while maintaining a false positive rate less than 1%. For the liver cancer datasets, BatVI uncovered novel integrations to two important genes TERT and MLL4, which were missed by previous studies. Through gene expression data, we verified the correctness of these additional integrations. BatVI can be downloaded from http://biogpu.ddns.comp.nus.edu.sg/~ksung/batvi/index.html. BioMed Central 2017-03-14 /pmc/articles/PMC5374687/ /pubmed/28361674 http://dx.doi.org/10.1186/s12859-017-1470-x Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Tennakoon, Chandana
Sung, Wing Kin
BATVI: Fast, sensitive and accurate detection of virus integrations
title BATVI: Fast, sensitive and accurate detection of virus integrations
title_full BATVI: Fast, sensitive and accurate detection of virus integrations
title_fullStr BATVI: Fast, sensitive and accurate detection of virus integrations
title_full_unstemmed BATVI: Fast, sensitive and accurate detection of virus integrations
title_short BATVI: Fast, sensitive and accurate detection of virus integrations
title_sort batvi: fast, sensitive and accurate detection of virus integrations
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374687/
https://www.ncbi.nlm.nih.gov/pubmed/28361674
http://dx.doi.org/10.1186/s12859-017-1470-x
work_keys_str_mv AT tennakoonchandana batvifastsensitiveandaccuratedetectionofvirusintegrations
AT sungwingkin batvifastsensitiveandaccuratedetectionofvirusintegrations