Cargando…

Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery

Recent years have witnessed an increase in research activity for the detection of structural variants (SVs) and their association to human disease. The advent of next-generation sequencing technologies make it possible to extend the scope of structural variation studies to a point previously unimagi...

Descripción completa

Detalles Bibliográficos
Autores principales: Hormozdiari, Fereydoun, Hajirasouliha, Iman, Dao, Phuong, Hach, Faraz, Yorukoglu, Deniz, Alkan, Can, Eichler, Evan E., Sahinalp, S. Cenk
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881400/
https://www.ncbi.nlm.nih.gov/pubmed/20529927
http://dx.doi.org/10.1093/bioinformatics/btq216
_version_ 1782182115650043904
author Hormozdiari, Fereydoun
Hajirasouliha, Iman
Dao, Phuong
Hach, Faraz
Yorukoglu, Deniz
Alkan, Can
Eichler, Evan E.
Sahinalp, S. Cenk
author_facet Hormozdiari, Fereydoun
Hajirasouliha, Iman
Dao, Phuong
Hach, Faraz
Yorukoglu, Deniz
Alkan, Can
Eichler, Evan E.
Sahinalp, S. Cenk
author_sort Hormozdiari, Fereydoun
collection PubMed
description Recent years have witnessed an increase in research activity for the detection of structural variants (SVs) and their association to human disease. The advent of next-generation sequencing technologies make it possible to extend the scope of structural variation studies to a point previously unimaginable as exemplified by the 1000 Genomes Project. Although various computational methods have been described for the detection of SVs, no such algorithm is yet fully capable of discovering transposon insertions, a very important class of SVs to the study of human evolution and disease. In this article, we provide a complete and novel formulation to discover both loci and classes of transposons inserted into genomes sequenced with high-throughput sequencing technologies. In addition, we also present ‘conflict resolution’ improvements to our earlier combinatorial SV detection algorithm (VariationHunter) by taking the diploid nature of the human genome into consideration. We test our algorithms with simulated data from the Venter genome (HuRef) and are able to discover >85% of transposon insertion events with precision of >90%. We also demonstrate that our conflict resolution algorithm (denoted as VariationHunter-CR) outperforms current state of the art (such as original VariationHunter, BreakDancer and MoDIL) algorithms when tested on the genome of the Yoruba African individual (NA18507). Availability: The implementation of algorithm is available at http://compbio.cs.sfu.ca/strvar.htm. Contact: eee@gs.washington.edu; cenk@cs.sfu.ca Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2881400
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28814002010-06-08 Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery Hormozdiari, Fereydoun Hajirasouliha, Iman Dao, Phuong Hach, Faraz Yorukoglu, Deniz Alkan, Can Eichler, Evan E. Sahinalp, S. Cenk Bioinformatics Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa Recent years have witnessed an increase in research activity for the detection of structural variants (SVs) and their association to human disease. The advent of next-generation sequencing technologies make it possible to extend the scope of structural variation studies to a point previously unimaginable as exemplified by the 1000 Genomes Project. Although various computational methods have been described for the detection of SVs, no such algorithm is yet fully capable of discovering transposon insertions, a very important class of SVs to the study of human evolution and disease. In this article, we provide a complete and novel formulation to discover both loci and classes of transposons inserted into genomes sequenced with high-throughput sequencing technologies. In addition, we also present ‘conflict resolution’ improvements to our earlier combinatorial SV detection algorithm (VariationHunter) by taking the diploid nature of the human genome into consideration. We test our algorithms with simulated data from the Venter genome (HuRef) and are able to discover >85% of transposon insertion events with precision of >90%. We also demonstrate that our conflict resolution algorithm (denoted as VariationHunter-CR) outperforms current state of the art (such as original VariationHunter, BreakDancer and MoDIL) algorithms when tested on the genome of the Yoruba African individual (NA18507). Availability: The implementation of algorithm is available at http://compbio.cs.sfu.ca/strvar.htm. Contact: eee@gs.washington.edu; cenk@cs.sfu.ca Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-06-15 2010-06-01 /pmc/articles/PMC2881400/ /pubmed/20529927 http://dx.doi.org/10.1093/bioinformatics/btq216 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
Hormozdiari, Fereydoun
Hajirasouliha, Iman
Dao, Phuong
Hach, Faraz
Yorukoglu, Deniz
Alkan, Can
Eichler, Evan E.
Sahinalp, S. Cenk
Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery
title Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery
title_full Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery
title_fullStr Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery
title_full_unstemmed Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery
title_short Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery
title_sort next-generation variationhunter: combinatorial algorithms for transposon insertion discovery
topic Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881400/
https://www.ncbi.nlm.nih.gov/pubmed/20529927
http://dx.doi.org/10.1093/bioinformatics/btq216
work_keys_str_mv AT hormozdiarifereydoun nextgenerationvariationhuntercombinatorialalgorithmsfortransposoninsertiondiscovery
AT hajirasoulihaiman nextgenerationvariationhuntercombinatorialalgorithmsfortransposoninsertiondiscovery
AT daophuong nextgenerationvariationhuntercombinatorialalgorithmsfortransposoninsertiondiscovery
AT hachfaraz nextgenerationvariationhuntercombinatorialalgorithmsfortransposoninsertiondiscovery
AT yorukogludeniz nextgenerationvariationhuntercombinatorialalgorithmsfortransposoninsertiondiscovery
AT alkancan nextgenerationvariationhuntercombinatorialalgorithmsfortransposoninsertiondiscovery
AT eichlerevane nextgenerationvariationhuntercombinatorialalgorithmsfortransposoninsertiondiscovery
AT sahinalpscenk nextgenerationvariationhuntercombinatorialalgorithmsfortransposoninsertiondiscovery