Cargando…

Bayesian DNA copy number analysis

BACKGROUND: Some diseases, like tumors, can be related to chromosomal aberrations, leading to changes of DNA copy number. The copy number of an aberrant genome can be represented as a piecewise constant function, since it can exhibit regions of deletions or gains. Instead, in a healthy cell the copy...

Descripción completa

Detalles Bibliográficos
Autores principales: Rancoita, Paola MV, Hutter, Marcus, Bertoni, Francesco, Kwee, Ivo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674052/
https://www.ncbi.nlm.nih.gov/pubmed/19133123
http://dx.doi.org/10.1186/1471-2105-10-10
_version_ 1782166614297280512
author Rancoita, Paola MV
Hutter, Marcus
Bertoni, Francesco
Kwee, Ivo
author_facet Rancoita, Paola MV
Hutter, Marcus
Bertoni, Francesco
Kwee, Ivo
author_sort Rancoita, Paola MV
collection PubMed
description BACKGROUND: Some diseases, like tumors, can be related to chromosomal aberrations, leading to changes of DNA copy number. The copy number of an aberrant genome can be represented as a piecewise constant function, since it can exhibit regions of deletions or gains. Instead, in a healthy cell the copy number is two because we inherit one copy of each chromosome from each our parents. Bayesian Piecewise Constant Regression (BPCR) is a Bayesian regression method for data that are noisy observations of a piecewise constant function. The method estimates the unknown segment number, the endpoints of the segments and the value of the segment levels of the underlying piecewise constant function. The Bayesian Regression Curve (BRC) estimates the same data with a smoothing curve. However, in the original formulation, some estimators failed to properly determine the corresponding parameters. For example, the boundary estimator did not take into account the dependency among the boundaries and succeeded in estimating more than one breakpoint at the same position, losing segments. RESULTS: We derived an improved version of the BPCR (called mBPCR) and BRC, changing the segment number estimator and the boundary estimator to enhance the fitting procedure. We also proposed an alternative estimator of the variance of the segment levels, which is useful in case of data with high noise. Using artificial data, we compared the original and the modified version of BPCR and BRC with other regression methods, showing that our improved version of BPCR generally outperformed all the others. Similar results were also observed on real data. CONCLUSION: We propose an improved method for DNA copy number estimation, mBPCR, which performed very well compared to previously published algorithms. In particular, mBPCR was more powerful in the detection of the true position of the breakpoints and of small aberrations in very noisy data. Hence, from a biological point of view, our method can be very useful, for example, to find targets of genomic aberrations in clinical cancer samples.
format Text
id pubmed-2674052
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26740522009-04-28 Bayesian DNA copy number analysis Rancoita, Paola MV Hutter, Marcus Bertoni, Francesco Kwee, Ivo BMC Bioinformatics Research Article BACKGROUND: Some diseases, like tumors, can be related to chromosomal aberrations, leading to changes of DNA copy number. The copy number of an aberrant genome can be represented as a piecewise constant function, since it can exhibit regions of deletions or gains. Instead, in a healthy cell the copy number is two because we inherit one copy of each chromosome from each our parents. Bayesian Piecewise Constant Regression (BPCR) is a Bayesian regression method for data that are noisy observations of a piecewise constant function. The method estimates the unknown segment number, the endpoints of the segments and the value of the segment levels of the underlying piecewise constant function. The Bayesian Regression Curve (BRC) estimates the same data with a smoothing curve. However, in the original formulation, some estimators failed to properly determine the corresponding parameters. For example, the boundary estimator did not take into account the dependency among the boundaries and succeeded in estimating more than one breakpoint at the same position, losing segments. RESULTS: We derived an improved version of the BPCR (called mBPCR) and BRC, changing the segment number estimator and the boundary estimator to enhance the fitting procedure. We also proposed an alternative estimator of the variance of the segment levels, which is useful in case of data with high noise. Using artificial data, we compared the original and the modified version of BPCR and BRC with other regression methods, showing that our improved version of BPCR generally outperformed all the others. Similar results were also observed on real data. CONCLUSION: We propose an improved method for DNA copy number estimation, mBPCR, which performed very well compared to previously published algorithms. In particular, mBPCR was more powerful in the detection of the true position of the breakpoints and of small aberrations in very noisy data. Hence, from a biological point of view, our method can be very useful, for example, to find targets of genomic aberrations in clinical cancer samples. BioMed Central 2009-01-08 /pmc/articles/PMC2674052/ /pubmed/19133123 http://dx.doi.org/10.1186/1471-2105-10-10 Text en Copyright © 2009 Rancoita et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Rancoita, Paola MV
Hutter, Marcus
Bertoni, Francesco
Kwee, Ivo
Bayesian DNA copy number analysis
title Bayesian DNA copy number analysis
title_full Bayesian DNA copy number analysis
title_fullStr Bayesian DNA copy number analysis
title_full_unstemmed Bayesian DNA copy number analysis
title_short Bayesian DNA copy number analysis
title_sort bayesian dna copy number analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674052/
https://www.ncbi.nlm.nih.gov/pubmed/19133123
http://dx.doi.org/10.1186/1471-2105-10-10
work_keys_str_mv AT rancoitapaolamv bayesiandnacopynumberanalysis
AT huttermarcus bayesiandnacopynumberanalysis
AT bertonifrancesco bayesiandnacopynumberanalysis
AT kweeivo bayesiandnacopynumberanalysis