Cargando…
DBS: a fast and informative segmentation algorithm for DNA copy number analysis
BACKGROUND: Genome-wide DNA copy number changes are the hallmark events in the initiation and progression of cancers. Quantitative analysis of somatic copy number alterations (CNAs) has broad applications in cancer research. With the increasing capacity of high-throughput sequencing technologies, fa...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6318921/ https://www.ncbi.nlm.nih.gov/pubmed/30606105 http://dx.doi.org/10.1186/s12859-018-2565-8 |
_version_ | 1783384972045844480 |
---|---|
author | Ruan, Jun Liu, Zhen Sun, Ming Wang, Yue Yue, Junqiu Yu, Guoqiang |
author_facet | Ruan, Jun Liu, Zhen Sun, Ming Wang, Yue Yue, Junqiu Yu, Guoqiang |
author_sort | Ruan, Jun |
collection | PubMed |
description | BACKGROUND: Genome-wide DNA copy number changes are the hallmark events in the initiation and progression of cancers. Quantitative analysis of somatic copy number alterations (CNAs) has broad applications in cancer research. With the increasing capacity of high-throughput sequencing technologies, fast and efficient segmentation algorithms are required when characterizing high density CNAs data. RESULTS: A fast and informative segmentation algorithm, DBS (Deviation Binary Segmentation), is developed and discussed. The DBS method is based on the least absolute error principles and is inspired by the segmentation method rooted in the circular binary segmentation procedure. DBS uses point-by-point model calculation to ensure the accuracy of segmentation and combines a binary search algorithm with heuristics derived from the Central Limit Theorem. The DBS algorithm is very efficient requiring a computational complexity of O(n*log n), and is faster than its predecessors. Moreover, DBS measures the change-point amplitude of mean values of two adjacent segments at a breakpoint, where the significant degree of change-point amplitude is determined by the weighted average deviation at breakpoints. Accordingly, using the constructed binary tree of significant degree, DBS informs whether the results of segmentation are over- or under-segmented. CONCLUSION: DBS is implemented in a platform-independent and open-source Java application (ToolSeg), including a graphical user interface and simulation data generation, as well as various segmentation methods in the native Java language. |
format | Online Article Text |
id | pubmed-6318921 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63189212019-01-08 DBS: a fast and informative segmentation algorithm for DNA copy number analysis Ruan, Jun Liu, Zhen Sun, Ming Wang, Yue Yue, Junqiu Yu, Guoqiang BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide DNA copy number changes are the hallmark events in the initiation and progression of cancers. Quantitative analysis of somatic copy number alterations (CNAs) has broad applications in cancer research. With the increasing capacity of high-throughput sequencing technologies, fast and efficient segmentation algorithms are required when characterizing high density CNAs data. RESULTS: A fast and informative segmentation algorithm, DBS (Deviation Binary Segmentation), is developed and discussed. The DBS method is based on the least absolute error principles and is inspired by the segmentation method rooted in the circular binary segmentation procedure. DBS uses point-by-point model calculation to ensure the accuracy of segmentation and combines a binary search algorithm with heuristics derived from the Central Limit Theorem. The DBS algorithm is very efficient requiring a computational complexity of O(n*log n), and is faster than its predecessors. Moreover, DBS measures the change-point amplitude of mean values of two adjacent segments at a breakpoint, where the significant degree of change-point amplitude is determined by the weighted average deviation at breakpoints. Accordingly, using the constructed binary tree of significant degree, DBS informs whether the results of segmentation are over- or under-segmented. CONCLUSION: DBS is implemented in a platform-independent and open-source Java application (ToolSeg), including a graphical user interface and simulation data generation, as well as various segmentation methods in the native Java language. BioMed Central 2019-01-03 /pmc/articles/PMC6318921/ /pubmed/30606105 http://dx.doi.org/10.1186/s12859-018-2565-8 Text en © The Author(s). 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Ruan, Jun Liu, Zhen Sun, Ming Wang, Yue Yue, Junqiu Yu, Guoqiang DBS: a fast and informative segmentation algorithm for DNA copy number analysis |
title | DBS: a fast and informative segmentation algorithm for DNA copy number analysis |
title_full | DBS: a fast and informative segmentation algorithm for DNA copy number analysis |
title_fullStr | DBS: a fast and informative segmentation algorithm for DNA copy number analysis |
title_full_unstemmed | DBS: a fast and informative segmentation algorithm for DNA copy number analysis |
title_short | DBS: a fast and informative segmentation algorithm for DNA copy number analysis |
title_sort | dbs: a fast and informative segmentation algorithm for dna copy number analysis |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6318921/ https://www.ncbi.nlm.nih.gov/pubmed/30606105 http://dx.doi.org/10.1186/s12859-018-2565-8 |
work_keys_str_mv | AT ruanjun dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis AT liuzhen dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis AT sunming dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis AT wangyue dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis AT yuejunqiu dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis AT yuguoqiang dbsafastandinformativesegmentationalgorithmfordnacopynumberanalysis |