Cargando…

DBS: a fast and informative segmentation algorithm for DNA copy number analysis

BACKGROUND: Genome-wide DNA copy number changes are the hallmark events in the initiation and progression of cancers. Quantitative analysis of somatic copy number alterations (CNAs) has broad applications in cancer research. With the increasing capacity of high-throughput sequencing technologies, fa...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruan, Jun, Liu, Zhen, Sun, Ming, Wang, Yue, Yue, Junqiu, Yu, Guoqiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6318921/
https://www.ncbi.nlm.nih.gov/pubmed/30606105
http://dx.doi.org/10.1186/s12859-018-2565-8
_version_ 1783384972045844480
author Ruan, Jun
Liu, Zhen
Sun, Ming
Wang, Yue
Yue, Junqiu
Yu, Guoqiang
author_facet Ruan, Jun
Liu, Zhen
Sun, Ming
Wang, Yue
Yue, Junqiu
Yu, Guoqiang
author_sort Ruan, Jun
collection PubMed
description BACKGROUND: Genome-wide DNA copy number changes are the hallmark events in the initiation and progression of cancers. Quantitative analysis of somatic copy number alterations (CNAs) has broad applications in cancer research. With the increasing capacity of high-throughput sequencing technologies, fast and efficient segmentation algorithms are required when characterizing high density CNAs data. RESULTS: A fast and informative segmentation algorithm, DBS (Deviation Binary Segmentation), is developed and discussed. The DBS method is based on the least absolute error principles and is inspired by the segmentation method rooted in the circular binary segmentation procedure. DBS uses point-by-point model calculation to ensure the accuracy of segmentation and combines a binary search algorithm with heuristics derived from the Central Limit Theorem. The DBS algorithm is very efficient requiring a computational complexity of O(n*log n), and is faster than its predecessors. Moreover, DBS measures the change-point amplitude of mean values of two adjacent segments at a breakpoint, where the significant degree of change-point amplitude is determined by the weighted average deviation at breakpoints. Accordingly, using the constructed binary tree of significant degree, DBS informs whether the results of segmentation are over- or under-segmented. CONCLUSION: DBS is implemented in a platform-independent and open-source Java application (ToolSeg), including a graphical user interface and simulation data generation, as well as various segmentation methods in the native Java language.
format Online
Article
Text
id pubmed-6318921
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63189212019-01-08 DBS: a fast and informative segmentation algorithm for DNA copy number analysis Ruan, Jun Liu, Zhen Sun, Ming Wang, Yue Yue, Junqiu Yu, Guoqiang BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide DNA copy number changes are the hallmark events in the initiation and progression of cancers. Quantitative analysis of somatic copy number alterations (CNAs) has broad applications in cancer research. With the increasing capacity of high-throughput sequencing technologies, fast and efficient segmentation algorithms are required when characterizing high density CNAs data. RESULTS: A fast and informative segmentation algorithm, DBS (Deviation Binary Segmentation), is developed and discussed. The DBS method is based on the least absolute error principles and is inspired by the segmentation method rooted in the circular binary segmentation procedure. DBS uses point-by-point model calculation to ensure the accuracy of segmentation and combines a binary search algorithm with heuristics derived from the Central Limit Theorem. The DBS algorithm is very efficient requiring a computational complexity of O(n*log n), and is faster than its predecessors. Moreover, DBS measures the change-point amplitude of mean values of two adjacent segments at a breakpoint, where the significant degree of change-point amplitude is determined by the weighted average deviation at breakpoints. Accordingly, using the constructed binary tree of significant degree, DBS informs whether the results of segmentation are over- or under-segmented. CONCLUSION: DBS is implemented in a platform-independent and open-source Java application (ToolSeg), including a graphical user interface and simulation data generation, as well as various segmentation methods in the native Java language. BioMed Central 2019-01-03 /pmc/articles/PMC6318921/ /pubmed/30606105 http://dx.doi.org/10.1186/s12859-018-2565-8 Text en © The Author(s). 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Ruan, Jun
Liu, Zhen
Sun, Ming
Wang, Yue
Yue, Junqiu
Yu, Guoqiang
DBS: a fast and informative segmentation algorithm for DNA copy number analysis
title DBS: a fast and informative segmentation algorithm for DNA copy number analysis
title_full DBS: a fast and informative segmentation algorithm for DNA copy number analysis
title_fullStr DBS: a fast and informative segmentation algorithm for DNA copy number analysis
title_full_unstemmed DBS: a fast and informative segmentation algorithm for DNA copy number analysis
title_short DBS: a fast and informative segmentation algorithm for DNA copy number analysis
title_sort dbs: a fast and informative segmentation algorithm for dna copy number analysis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6318921/
https://www.ncbi.nlm.nih.gov/pubmed/30606105
http://dx.doi.org/10.1186/s12859-018-2565-8
work_keys_str_mv AT ruanjun dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis
AT liuzhen dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis
AT sunming dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis
AT wangyue dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis
AT yuejunqiu dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis
AT yuguoqiang dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis