Cargando…
Representing and decomposing genomic structural variants as balanced integer flows on sequence graphs
BACKGROUND: The study of genomic variation has provided key insights into the functional role of mutations. Predominantly, studies have focused on single nucleotide variants (SNV), which are relatively easy to detect and can be described with rich mathematical models. However, it has been observed t...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5043639/ https://www.ncbi.nlm.nih.gov/pubmed/27687569 http://dx.doi.org/10.1186/s12859-016-1258-4 |
_version_ | 1782456792867930112 |
---|---|
author | Zerbino, Daniel R. Ballinger, Tracy Paten, Benedict Hickey, Glenn Haussler, David |
author_facet | Zerbino, Daniel R. Ballinger, Tracy Paten, Benedict Hickey, Glenn Haussler, David |
author_sort | Zerbino, Daniel R. |
collection | PubMed |
description | BACKGROUND: The study of genomic variation has provided key insights into the functional role of mutations. Predominantly, studies have focused on single nucleotide variants (SNV), which are relatively easy to detect and can be described with rich mathematical models. However, it has been observed that genomes are highly plastic, and that whole regions can be moved, removed or duplicated in bulk. These structural variants (SV) have been shown to have significant impact on phenotype, but their study has been held back by the combinatorial complexity of the underlying models. RESULTS: We describe here a general model of structural variation that encompasses both balanced rearrangements and arbitrary copy-number variants (CNV). CONCLUSIONS: In this model, we show that the space of possible evolutionary histories that explain the structural differences between any two genomes can be sampled ergodically. |
format | Online Article Text |
id | pubmed-5043639 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50436392016-10-05 Representing and decomposing genomic structural variants as balanced integer flows on sequence graphs Zerbino, Daniel R. Ballinger, Tracy Paten, Benedict Hickey, Glenn Haussler, David BMC Bioinformatics Methodology Article BACKGROUND: The study of genomic variation has provided key insights into the functional role of mutations. Predominantly, studies have focused on single nucleotide variants (SNV), which are relatively easy to detect and can be described with rich mathematical models. However, it has been observed that genomes are highly plastic, and that whole regions can be moved, removed or duplicated in bulk. These structural variants (SV) have been shown to have significant impact on phenotype, but their study has been held back by the combinatorial complexity of the underlying models. RESULTS: We describe here a general model of structural variation that encompasses both balanced rearrangements and arbitrary copy-number variants (CNV). CONCLUSIONS: In this model, we show that the space of possible evolutionary histories that explain the structural differences between any two genomes can be sampled ergodically. BioMed Central 2016-09-29 /pmc/articles/PMC5043639/ /pubmed/27687569 http://dx.doi.org/10.1186/s12859-016-1258-4 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Zerbino, Daniel R. Ballinger, Tracy Paten, Benedict Hickey, Glenn Haussler, David Representing and decomposing genomic structural variants as balanced integer flows on sequence graphs |
title | Representing and decomposing genomic structural variants as balanced integer flows on sequence graphs |
title_full | Representing and decomposing genomic structural variants as balanced integer flows on sequence graphs |
title_fullStr | Representing and decomposing genomic structural variants as balanced integer flows on sequence graphs |
title_full_unstemmed | Representing and decomposing genomic structural variants as balanced integer flows on sequence graphs |
title_short | Representing and decomposing genomic structural variants as balanced integer flows on sequence graphs |
title_sort | representing and decomposing genomic structural variants as balanced integer flows on sequence graphs |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5043639/ https://www.ncbi.nlm.nih.gov/pubmed/27687569 http://dx.doi.org/10.1186/s12859-016-1258-4 |
work_keys_str_mv | AT zerbinodanielr representinganddecomposinggenomicstructuralvariantsasbalancedintegerflowsonsequencegraphs AT ballingertracy representinganddecomposinggenomicstructuralvariantsasbalancedintegerflowsonsequencegraphs AT patenbenedict representinganddecomposinggenomicstructuralvariantsasbalancedintegerflowsonsequencegraphs AT hickeyglenn representinganddecomposinggenomicstructuralvariantsasbalancedintegerflowsonsequencegraphs AT hausslerdavid representinganddecomposinggenomicstructuralvariantsasbalancedintegerflowsonsequencegraphs |