Cargando…

A robust framework for detecting structural variations in a genome

Motivation: Recently, structural genomic variants have come to the forefront as a significant source of variation in the human population, but the identification of these variants in a large genome remains a challenge. The complete sequencing of a human individual is prohibitive at current costs, wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Seunghak, Cheran, Elango, Brudno, Michael
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718654/
https://www.ncbi.nlm.nih.gov/pubmed/18586745
http://dx.doi.org/10.1093/bioinformatics/btn176
_version_ 1782170009092489216
author Lee, Seunghak
Cheran, Elango
Brudno, Michael
author_facet Lee, Seunghak
Cheran, Elango
Brudno, Michael
author_sort Lee, Seunghak
collection PubMed
description Motivation: Recently, structural genomic variants have come to the forefront as a significant source of variation in the human population, but the identification of these variants in a large genome remains a challenge. The complete sequencing of a human individual is prohibitive at current costs, while current polymorphism detection technologies, such as SNP arrays, are not able to identify many of the large scale events. One of the most promising methods to detect such variants is the computational mapping of clone-end sequences to a reference genome. Results: Here, we present a probabilistic framework for the identification of structural variants using clone-end sequencing. Unlike previous methods, our approach does not rely on an a priori determined mapping of all reads to the reference. Instead, we build a framework for finding the most probable assignment of sequenced clones to potential structural variants based on the other clones. We compare our predictions with the structural variants identified in three previous studies. While there is a statistically significant correlation between the predictions, we also find a significant number of previously uncharacterized structural variants. Furthermore, we identify a number of putative cross-chromosomal events, primarily located proximally to the centromeres of the chromosomes. Availability: Our dataset, results and source code are available at http://compbio.cs.toronto.edu/structvar/ Contact:seunghak@cs.toronto.edu,echeran@cs.toronto.edu,brudno@cs.toronto.edu
format Text
id pubmed-2718654
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27186542009-07-31 A robust framework for detecting structural variations in a genome Lee, Seunghak Cheran, Elango Brudno, Michael Bioinformatics Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto Motivation: Recently, structural genomic variants have come to the forefront as a significant source of variation in the human population, but the identification of these variants in a large genome remains a challenge. The complete sequencing of a human individual is prohibitive at current costs, while current polymorphism detection technologies, such as SNP arrays, are not able to identify many of the large scale events. One of the most promising methods to detect such variants is the computational mapping of clone-end sequences to a reference genome. Results: Here, we present a probabilistic framework for the identification of structural variants using clone-end sequencing. Unlike previous methods, our approach does not rely on an a priori determined mapping of all reads to the reference. Instead, we build a framework for finding the most probable assignment of sequenced clones to potential structural variants based on the other clones. We compare our predictions with the structural variants identified in three previous studies. While there is a statistically significant correlation between the predictions, we also find a significant number of previously uncharacterized structural variants. Furthermore, we identify a number of putative cross-chromosomal events, primarily located proximally to the centromeres of the chromosomes. Availability: Our dataset, results and source code are available at http://compbio.cs.toronto.edu/structvar/ Contact:seunghak@cs.toronto.edu,echeran@cs.toronto.edu,brudno@cs.toronto.edu Oxford University Press 2008-07-01 /pmc/articles/PMC2718654/ /pubmed/18586745 http://dx.doi.org/10.1093/bioinformatics/btn176 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
Lee, Seunghak
Cheran, Elango
Brudno, Michael
A robust framework for detecting structural variations in a genome
title A robust framework for detecting structural variations in a genome
title_full A robust framework for detecting structural variations in a genome
title_fullStr A robust framework for detecting structural variations in a genome
title_full_unstemmed A robust framework for detecting structural variations in a genome
title_short A robust framework for detecting structural variations in a genome
title_sort robust framework for detecting structural variations in a genome
topic Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718654/
https://www.ncbi.nlm.nih.gov/pubmed/18586745
http://dx.doi.org/10.1093/bioinformatics/btn176
work_keys_str_mv AT leeseunghak arobustframeworkfordetectingstructuralvariationsinagenome
AT cheranelango arobustframeworkfordetectingstructuralvariationsinagenome
AT brudnomichael arobustframeworkfordetectingstructuralvariationsinagenome
AT leeseunghak robustframeworkfordetectingstructuralvariationsinagenome
AT cheranelango robustframeworkfordetectingstructuralvariationsinagenome
AT brudnomichael robustframeworkfordetectingstructuralvariationsinagenome