Cargando…

A statistical approach for array CGH data analysis

BACKGROUND: Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizing targets of genomic DNA from a test and a reference sample to sequences immobilized on a slide. These probes are genomic DNA sequences (BACs) that are mapped on the genome. The signal has a spatia...

Descripción completa

Detalles Bibliográficos
Autores principales: Picard, Franck, Robin, Stephane, Lavielle, Marc, Vaisse, Christian, Daudin, Jean-Jacques
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC549559/
https://www.ncbi.nlm.nih.gov/pubmed/15705208
http://dx.doi.org/10.1186/1471-2105-6-27
_version_ 1782122433137868800
author Picard, Franck
Robin, Stephane
Lavielle, Marc
Vaisse, Christian
Daudin, Jean-Jacques
author_facet Picard, Franck
Robin, Stephane
Lavielle, Marc
Vaisse, Christian
Daudin, Jean-Jacques
author_sort Picard, Franck
collection PubMed
description BACKGROUND: Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizing targets of genomic DNA from a test and a reference sample to sequences immobilized on a slide. These probes are genomic DNA sequences (BACs) that are mapped on the genome. The signal has a spatial coherence that can be handled by specific statistical tools. Segmentation methods seem to be a natural framework for this purpose. A CGH profile can be viewed as a succession of segments that represent homogeneous regions in the genome whose BACs share the same relative copy number on average. We model a CGH profile by a random Gaussian process whose distribution parameters are affected by abrupt changes at unknown coordinates. Two major problems arise : to determine which parameters are affected by the abrupt changes (the mean and the variance, or the mean only), and the selection of the number of segments in the profile. RESULTS: We demonstrate that existing methods for estimating the number of segments are not well adapted in the case of array CGH data, and we propose an adaptive criterion that detects previously mapped chromosomal aberrations. The performances of this method are discussed based on simulations and publicly available data sets. Then we discuss the choice of modeling for array CGH data and show that the model with a homogeneous variance is adapted to this context. CONCLUSIONS: Array CGH data analysis is an emerging field that needs appropriate statistical tools. Process segmentation and model selection provide a theoretical framework that allows precise biological interpretations. Adaptive methods for model selection give promising results concerning the estimation of the number of altered regions on the genome.
format Text
id pubmed-549559
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5495592005-02-25 A statistical approach for array CGH data analysis Picard, Franck Robin, Stephane Lavielle, Marc Vaisse, Christian Daudin, Jean-Jacques BMC Bioinformatics Research Article BACKGROUND: Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizing targets of genomic DNA from a test and a reference sample to sequences immobilized on a slide. These probes are genomic DNA sequences (BACs) that are mapped on the genome. The signal has a spatial coherence that can be handled by specific statistical tools. Segmentation methods seem to be a natural framework for this purpose. A CGH profile can be viewed as a succession of segments that represent homogeneous regions in the genome whose BACs share the same relative copy number on average. We model a CGH profile by a random Gaussian process whose distribution parameters are affected by abrupt changes at unknown coordinates. Two major problems arise : to determine which parameters are affected by the abrupt changes (the mean and the variance, or the mean only), and the selection of the number of segments in the profile. RESULTS: We demonstrate that existing methods for estimating the number of segments are not well adapted in the case of array CGH data, and we propose an adaptive criterion that detects previously mapped chromosomal aberrations. The performances of this method are discussed based on simulations and publicly available data sets. Then we discuss the choice of modeling for array CGH data and show that the model with a homogeneous variance is adapted to this context. CONCLUSIONS: Array CGH data analysis is an emerging field that needs appropriate statistical tools. Process segmentation and model selection provide a theoretical framework that allows precise biological interpretations. Adaptive methods for model selection give promising results concerning the estimation of the number of altered regions on the genome. BioMed Central 2005-02-11 /pmc/articles/PMC549559/ /pubmed/15705208 http://dx.doi.org/10.1186/1471-2105-6-27 Text en Copyright © 2005 Picard et al; licensee BioMed Central Ltd.
spellingShingle Research Article
Picard, Franck
Robin, Stephane
Lavielle, Marc
Vaisse, Christian
Daudin, Jean-Jacques
A statistical approach for array CGH data analysis
title A statistical approach for array CGH data analysis
title_full A statistical approach for array CGH data analysis
title_fullStr A statistical approach for array CGH data analysis
title_full_unstemmed A statistical approach for array CGH data analysis
title_short A statistical approach for array CGH data analysis
title_sort statistical approach for array cgh data analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC549559/
https://www.ncbi.nlm.nih.gov/pubmed/15705208
http://dx.doi.org/10.1186/1471-2105-6-27
work_keys_str_mv AT picardfranck astatisticalapproachforarraycghdataanalysis
AT robinstephane astatisticalapproachforarraycghdataanalysis
AT laviellemarc astatisticalapproachforarraycghdataanalysis
AT vaissechristian astatisticalapproachforarraycghdataanalysis
AT daudinjeanjacques astatisticalapproachforarraycghdataanalysis
AT picardfranck statisticalapproachforarraycghdataanalysis
AT robinstephane statisticalapproachforarraycghdataanalysis
AT laviellemarc statisticalapproachforarraycghdataanalysis
AT vaissechristian statisticalapproachforarraycghdataanalysis
AT daudinjeanjacques statisticalapproachforarraycghdataanalysis