Cargando…

CLOVE: classification of genomic fusions into structural variation events

BACKGROUND: A precise understanding of structural variants (SVs) in DNA is important in the study of cancer and population diversity. Many methods have been designed to identify SVs from DNA sequencing data. However, the problem remains challenging because existing approaches suffer from low sensiti...

Descripción completa

Detalles Bibliográficos
Autores principales: Schröder, Jan, Wirawan, Adrianto, Schmidt, Bertil, Papenfuss, Anthony T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5520322/
https://www.ncbi.nlm.nih.gov/pubmed/28728542
http://dx.doi.org/10.1186/s12859-017-1760-3
_version_ 1783251794987581440
author Schröder, Jan
Wirawan, Adrianto
Schmidt, Bertil
Papenfuss, Anthony T.
author_facet Schröder, Jan
Wirawan, Adrianto
Schmidt, Bertil
Papenfuss, Anthony T.
author_sort Schröder, Jan
collection PubMed
description BACKGROUND: A precise understanding of structural variants (SVs) in DNA is important in the study of cancer and population diversity. Many methods have been designed to identify SVs from DNA sequencing data. However, the problem remains challenging because existing approaches suffer from low sensitivity, precision, and positional accuracy. Furthermore, many existing tools only identify breakpoints, and so not collect related breakpoints and classify them as a particular type of SV. Due to the rapidly increasing usage of high throughput sequencing technologies in this area, there is an urgent need for algorithms that can accurately classify complex genomic rearrangements (involving more than one breakpoint or fusion). RESULTS: We present CLOVE, an algorithm for integrating the results of multiple breakpoint or SV callers and classifying the results as a particular SV. CLOVE is based on a graph data structure that is created from the breakpoint information. The algorithm looks for patterns in the graph that are characteristic of more complex rearrangement types. CLOVE is able to integrate the results of multiple callers, producing a consensus call. CONCLUSIONS: We demonstrate using simulated and real data that re-classified SV calls produced by CLOVE improve on the raw call set of existing SV algorithms, particularly in terms of accuracy. CLOVE is freely available from http://www.github.com/PapenfussLab. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1760-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5520322
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55203222017-07-21 CLOVE: classification of genomic fusions into structural variation events Schröder, Jan Wirawan, Adrianto Schmidt, Bertil Papenfuss, Anthony T. BMC Bioinformatics Methodology Article BACKGROUND: A precise understanding of structural variants (SVs) in DNA is important in the study of cancer and population diversity. Many methods have been designed to identify SVs from DNA sequencing data. However, the problem remains challenging because existing approaches suffer from low sensitivity, precision, and positional accuracy. Furthermore, many existing tools only identify breakpoints, and so not collect related breakpoints and classify them as a particular type of SV. Due to the rapidly increasing usage of high throughput sequencing technologies in this area, there is an urgent need for algorithms that can accurately classify complex genomic rearrangements (involving more than one breakpoint or fusion). RESULTS: We present CLOVE, an algorithm for integrating the results of multiple breakpoint or SV callers and classifying the results as a particular SV. CLOVE is based on a graph data structure that is created from the breakpoint information. The algorithm looks for patterns in the graph that are characteristic of more complex rearrangement types. CLOVE is able to integrate the results of multiple callers, producing a consensus call. CONCLUSIONS: We demonstrate using simulated and real data that re-classified SV calls produced by CLOVE improve on the raw call set of existing SV algorithms, particularly in terms of accuracy. CLOVE is freely available from http://www.github.com/PapenfussLab. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1760-3) contains supplementary material, which is available to authorized users. BioMed Central 2017-07-20 /pmc/articles/PMC5520322/ /pubmed/28728542 http://dx.doi.org/10.1186/s12859-017-1760-3 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Schröder, Jan
Wirawan, Adrianto
Schmidt, Bertil
Papenfuss, Anthony T.
CLOVE: classification of genomic fusions into structural variation events
title CLOVE: classification of genomic fusions into structural variation events
title_full CLOVE: classification of genomic fusions into structural variation events
title_fullStr CLOVE: classification of genomic fusions into structural variation events
title_full_unstemmed CLOVE: classification of genomic fusions into structural variation events
title_short CLOVE: classification of genomic fusions into structural variation events
title_sort clove: classification of genomic fusions into structural variation events
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5520322/
https://www.ncbi.nlm.nih.gov/pubmed/28728542
http://dx.doi.org/10.1186/s12859-017-1760-3
work_keys_str_mv AT schroderjan cloveclassificationofgenomicfusionsintostructuralvariationevents
AT wirawanadrianto cloveclassificationofgenomicfusionsintostructuralvariationevents
AT schmidtbertil cloveclassificationofgenomicfusionsintostructuralvariationevents
AT papenfussanthonyt cloveclassificationofgenomicfusionsintostructuralvariationevents