Cargando…

KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data

BACKGROUND: Most approaches used to find recurrent or differential DNA Copy Number Alterations (CNA) in array Comparative Genomic Hybridization (aCGH) data from groups of tumour samples depend on the discretization of the aCGH data to gain, loss or no-change states. This causes loss of valuable biol...

Descripción completa

Detalles Bibliográficos
Autores principales: de Ronde, Jorma J, Klijn, Christiaan, Velds, Arno, Holstege, Henne, Reinders, Marcel JT, Jonkers, Jos, Wessels, Lodewyk FA
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2995794/
https://www.ncbi.nlm.nih.gov/pubmed/21070656
http://dx.doi.org/10.1186/1756-0500-3-298
_version_ 1782193113335332864
author de Ronde, Jorma J
Klijn, Christiaan
Velds, Arno
Holstege, Henne
Reinders, Marcel JT
Jonkers, Jos
Wessels, Lodewyk FA
author_facet de Ronde, Jorma J
Klijn, Christiaan
Velds, Arno
Holstege, Henne
Reinders, Marcel JT
Jonkers, Jos
Wessels, Lodewyk FA
author_sort de Ronde, Jorma J
collection PubMed
description BACKGROUND: Most approaches used to find recurrent or differential DNA Copy Number Alterations (CNA) in array Comparative Genomic Hybridization (aCGH) data from groups of tumour samples depend on the discretization of the aCGH data to gain, loss or no-change states. This causes loss of valuable biological information in tumour samples, which are frequently heterogeneous. We have previously developed an algorithm, KC-SMART, that bases its estimate of the magnitude of the CNA at a given genomic location on kernel convolution (Klijn et al., 2008). This accounts for the intensity of the probe signal, its local genomic environment and the signal distribution across multiple samples. RESULTS: Here we extend the approach to allow comparative analyses of two groups of samples and introduce the R implementation of these two approaches. The comparative module allows for a supervised analysis to be performed, to enable the identification of regions that are differentially aberrated between two user-defined classes. We analyzed data from a series of B- and T-cell lymphomas and were able to retrieve all positive control regions (VDJ regions) in addition to a number of new regions. A t-test employing segmented data, that we implemented, was also able to locate all the positive control regions and a number of new regions but these regions were highly fragmented. CONCLUSIONS: KC-SMARTR offers recurrent CNA and class specific CNA detection, at different genomic scales, in a single package without the need for additional segmentation. It is memory efficient and runs on a wide range of machines. Most importantly, it does not rely on data discretization and therefore maximally exploits the biological information in the aCGH data. The program is freely available from the Bioconductor website http://www.bioconductor.org/ under the terms of the GNU General Public License.
format Text
id pubmed-2995794
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29957942011-01-05 KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data de Ronde, Jorma J Klijn, Christiaan Velds, Arno Holstege, Henne Reinders, Marcel JT Jonkers, Jos Wessels, Lodewyk FA BMC Res Notes Research Article BACKGROUND: Most approaches used to find recurrent or differential DNA Copy Number Alterations (CNA) in array Comparative Genomic Hybridization (aCGH) data from groups of tumour samples depend on the discretization of the aCGH data to gain, loss or no-change states. This causes loss of valuable biological information in tumour samples, which are frequently heterogeneous. We have previously developed an algorithm, KC-SMART, that bases its estimate of the magnitude of the CNA at a given genomic location on kernel convolution (Klijn et al., 2008). This accounts for the intensity of the probe signal, its local genomic environment and the signal distribution across multiple samples. RESULTS: Here we extend the approach to allow comparative analyses of two groups of samples and introduce the R implementation of these two approaches. The comparative module allows for a supervised analysis to be performed, to enable the identification of regions that are differentially aberrated between two user-defined classes. We analyzed data from a series of B- and T-cell lymphomas and were able to retrieve all positive control regions (VDJ regions) in addition to a number of new regions. A t-test employing segmented data, that we implemented, was also able to locate all the positive control regions and a number of new regions but these regions were highly fragmented. CONCLUSIONS: KC-SMARTR offers recurrent CNA and class specific CNA detection, at different genomic scales, in a single package without the need for additional segmentation. It is memory efficient and runs on a wide range of machines. Most importantly, it does not rely on data discretization and therefore maximally exploits the biological information in the aCGH data. The program is freely available from the Bioconductor website http://www.bioconductor.org/ under the terms of the GNU General Public License. BioMed Central 2010-11-11 /pmc/articles/PMC2995794/ /pubmed/21070656 http://dx.doi.org/10.1186/1756-0500-3-298 Text en Copyright ©2009 de Ronde et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
de Ronde, Jorma J
Klijn, Christiaan
Velds, Arno
Holstege, Henne
Reinders, Marcel JT
Jonkers, Jos
Wessels, Lodewyk FA
KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data
title KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data
title_full KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data
title_fullStr KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data
title_full_unstemmed KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data
title_short KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data
title_sort kc-smartr: an r package for detection of statistically significant aberrations in multi-experiment acgh data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2995794/
https://www.ncbi.nlm.nih.gov/pubmed/21070656
http://dx.doi.org/10.1186/1756-0500-3-298
work_keys_str_mv AT derondejormaj kcsmartranrpackagefordetectionofstatisticallysignificantaberrationsinmultiexperimentacghdata
AT klijnchristiaan kcsmartranrpackagefordetectionofstatisticallysignificantaberrationsinmultiexperimentacghdata
AT veldsarno kcsmartranrpackagefordetectionofstatisticallysignificantaberrationsinmultiexperimentacghdata
AT holstegehenne kcsmartranrpackagefordetectionofstatisticallysignificantaberrationsinmultiexperimentacghdata
AT reindersmarceljt kcsmartranrpackagefordetectionofstatisticallysignificantaberrationsinmultiexperimentacghdata
AT jonkersjos kcsmartranrpackagefordetectionofstatisticallysignificantaberrationsinmultiexperimentacghdata
AT wesselslodewykfa kcsmartranrpackagefordetectionofstatisticallysignificantaberrationsinmultiexperimentacghdata