Cargando…

Simple binary segmentation frameworks for identifying variation in DNA copy number

BACKGROUND: Variation in DNA copy number, due to gains and losses of chromosome segments, is common. A first step for analyzing DNA copy number data is to identify amplified or deleted regions in individuals. To locate such regions, we propose a circular binary segmentation procedure, which is based...

Descripción completa

Detalles Bibliográficos
Autor principal:	Yang, Tae Young
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3571941/ https://www.ncbi.nlm.nih.gov/pubmed/23107320 http://dx.doi.org/10.1186/1471-2105-13-277

_version_	1782259240036990976
author	Yang, Tae Young
author_facet	Yang, Tae Young
author_sort	Yang, Tae Young
collection	PubMed
description	BACKGROUND: Variation in DNA copy number, due to gains and losses of chromosome segments, is common. A first step for analyzing DNA copy number data is to identify amplified or deleted regions in individuals. To locate such regions, we propose a circular binary segmentation procedure, which is based on a sequence of nested hypothesis tests, each using the Bayesian information criterion. RESULTS: Our procedure is convenient for analyzing DNA copy number in two general situations: (1) when using data from multiple sources and (2) when using cohort analysis of multiple patients suffering from the same type of cancer. In the first case, data from multiple sources such as different platforms, labs, or preprocessing methods are used to study variation in copy number in the same individual. Combining these sources provides a higher resolution, which leads to a more detailed genome-wide survey of the individual. In this case, we provide a simple statistical framework to derive a consensus molecular signature. In the framework, the multiple sequences from various sources are integrated into a single sequence, and then the proposed segmentation procedure is applied to this sequence to detect aberrant regions. In the second case, cohort analysis of multiple patients is carried out to derive overall molecular signatures for the cohort. For this case, we provide another simple statistical framework in which data across multiple profiles is standardized before segmentation. The proposed segmentation procedure is then applied to the standardized profiles one at a time to detect aberrant regions. Any such regions that are common across two or more profiles are probably real and may play important roles in the cancer pathogenesis process. CONCLUSIONS: The main advantages of the proposed procedure are flexibility and simplicity.
format	Online Article Text
id	pubmed-3571941
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-35719412013-02-20 Simple binary segmentation frameworks for identifying variation in DNA copy number Yang, Tae Young BMC Bioinformatics Methodology Article BACKGROUND: Variation in DNA copy number, due to gains and losses of chromosome segments, is common. A first step for analyzing DNA copy number data is to identify amplified or deleted regions in individuals. To locate such regions, we propose a circular binary segmentation procedure, which is based on a sequence of nested hypothesis tests, each using the Bayesian information criterion. RESULTS: Our procedure is convenient for analyzing DNA copy number in two general situations: (1) when using data from multiple sources and (2) when using cohort analysis of multiple patients suffering from the same type of cancer. In the first case, data from multiple sources such as different platforms, labs, or preprocessing methods are used to study variation in copy number in the same individual. Combining these sources provides a higher resolution, which leads to a more detailed genome-wide survey of the individual. In this case, we provide a simple statistical framework to derive a consensus molecular signature. In the framework, the multiple sequences from various sources are integrated into a single sequence, and then the proposed segmentation procedure is applied to this sequence to detect aberrant regions. In the second case, cohort analysis of multiple patients is carried out to derive overall molecular signatures for the cohort. For this case, we provide another simple statistical framework in which data across multiple profiles is standardized before segmentation. The proposed segmentation procedure is then applied to the standardized profiles one at a time to detect aberrant regions. Any such regions that are common across two or more profiles are probably real and may play important roles in the cancer pathogenesis process. CONCLUSIONS: The main advantages of the proposed procedure are flexibility and simplicity. BioMed Central 2012-10-30 /pmc/articles/PMC3571941/ /pubmed/23107320 http://dx.doi.org/10.1186/1471-2105-13-277 Text en Copyright ©2012 Yang; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Yang, Tae Young Simple binary segmentation frameworks for identifying variation in DNA copy number
title	Simple binary segmentation frameworks for identifying variation in DNA copy number
title_full	Simple binary segmentation frameworks for identifying variation in DNA copy number
title_fullStr	Simple binary segmentation frameworks for identifying variation in DNA copy number
title_full_unstemmed	Simple binary segmentation frameworks for identifying variation in DNA copy number
title_short	Simple binary segmentation frameworks for identifying variation in DNA copy number
title_sort	simple binary segmentation frameworks for identifying variation in dna copy number
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3571941/ https://www.ncbi.nlm.nih.gov/pubmed/23107320 http://dx.doi.org/10.1186/1471-2105-13-277
work_keys_str_mv	AT yangtaeyoung simplebinarysegmentationframeworksforidentifyingvariationindnacopynumber

Simple binary segmentation frameworks for identifying variation in DNA copy number

Ejemplares similares