Cargando…

Genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer

BACKGROUND: Genomic copy number changes and regional alterations in epigenetic states have been linked to grade in breast cancer. However, the relative contribution of specific alterations to the pathology of different breast cancer subtypes remains unclear. The heterogeneity and interplay of genomi...

Descripción completa

Detalles Bibliográficos
Autores principales: Dexter, Tim J, Sims, David, Mitsopoulos, Costas, Mackay, Alan, Grigoriadis, Anita, Ahmad, Amar S, Zvelebil, Marketa
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2946304/
https://www.ncbi.nlm.nih.gov/pubmed/20825665
http://dx.doi.org/10.1186/1752-0509-4-127
_version_ 1782187285255553024
author Dexter, Tim J
Sims, David
Mitsopoulos, Costas
Mackay, Alan
Grigoriadis, Anita
Ahmad, Amar S
Zvelebil, Marketa
author_facet Dexter, Tim J
Sims, David
Mitsopoulos, Costas
Mackay, Alan
Grigoriadis, Anita
Ahmad, Amar S
Zvelebil, Marketa
author_sort Dexter, Tim J
collection PubMed
description BACKGROUND: Genomic copy number changes and regional alterations in epigenetic states have been linked to grade in breast cancer. However, the relative contribution of specific alterations to the pathology of different breast cancer subtypes remains unclear. The heterogeneity and interplay of genomic and epigenetic variations means that large datasets and statistical data mining methods are required to uncover recurrent patterns that are likely to be important in cancer progression. RESULTS: We employed ridge regression to model the relationship between regional changes in gene expression and proliferation. Regional features were extracted from tumour gene expression data using a novel clustering method, called genomic distance entrained agglomerative (GDEC) clustering. Using gene expression data in this way provides a simple means of integrating the phenotypic effects of both copy number aberrations and alterations in chromatin state. We show that regional metagenes derived from GDEC clustering are representative of recurrent regions of epigenetic regulation or copy number aberrations in breast cancer. Furthermore, detected patterns of genomic alterations are conserved across independent oestrogen receptor positive breast cancer datasets. Sequential competitive metagene selection was used to reveal the relative importance of genomic regions in predicting proliferation rate. The predictive model suggested additive interactions between the most informative regions such as 8p22-12 and 8q13-22. CONCLUSIONS: Data-mining of large-scale microarray gene expression datasets can reveal regional clusters of co-ordinate gene expression, independent of cause. By correlating these clusters with tumour proliferation we have identified a number of genomic regions that act together to promote proliferation in ER+ breast cancer. Identification of such regions should enable prioritisation of genomic regions for combinatorial functional studies to pinpoint the key genes and interactions contributing to tumourigenicity.
format Text
id pubmed-2946304
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29463042010-10-21 Genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer Dexter, Tim J Sims, David Mitsopoulos, Costas Mackay, Alan Grigoriadis, Anita Ahmad, Amar S Zvelebil, Marketa BMC Syst Biol Research Article BACKGROUND: Genomic copy number changes and regional alterations in epigenetic states have been linked to grade in breast cancer. However, the relative contribution of specific alterations to the pathology of different breast cancer subtypes remains unclear. The heterogeneity and interplay of genomic and epigenetic variations means that large datasets and statistical data mining methods are required to uncover recurrent patterns that are likely to be important in cancer progression. RESULTS: We employed ridge regression to model the relationship between regional changes in gene expression and proliferation. Regional features were extracted from tumour gene expression data using a novel clustering method, called genomic distance entrained agglomerative (GDEC) clustering. Using gene expression data in this way provides a simple means of integrating the phenotypic effects of both copy number aberrations and alterations in chromatin state. We show that regional metagenes derived from GDEC clustering are representative of recurrent regions of epigenetic regulation or copy number aberrations in breast cancer. Furthermore, detected patterns of genomic alterations are conserved across independent oestrogen receptor positive breast cancer datasets. Sequential competitive metagene selection was used to reveal the relative importance of genomic regions in predicting proliferation rate. The predictive model suggested additive interactions between the most informative regions such as 8p22-12 and 8q13-22. CONCLUSIONS: Data-mining of large-scale microarray gene expression datasets can reveal regional clusters of co-ordinate gene expression, independent of cause. By correlating these clusters with tumour proliferation we have identified a number of genomic regions that act together to promote proliferation in ER+ breast cancer. Identification of such regions should enable prioritisation of genomic regions for combinatorial functional studies to pinpoint the key genes and interactions contributing to tumourigenicity. BioMed Central 2010-09-08 /pmc/articles/PMC2946304/ /pubmed/20825665 http://dx.doi.org/10.1186/1752-0509-4-127 Text en Copyright ©2010 Dexter et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dexter, Tim J
Sims, David
Mitsopoulos, Costas
Mackay, Alan
Grigoriadis, Anita
Ahmad, Amar S
Zvelebil, Marketa
Genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer
title Genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer
title_full Genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer
title_fullStr Genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer
title_full_unstemmed Genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer
title_short Genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer
title_sort genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2946304/
https://www.ncbi.nlm.nih.gov/pubmed/20825665
http://dx.doi.org/10.1186/1752-0509-4-127
work_keys_str_mv AT dextertimj genomicdistanceentrainedclusteringandregressionmodellinghighlightsinteractinggenomicregionscontributingtoproliferationinbreastcancer
AT simsdavid genomicdistanceentrainedclusteringandregressionmodellinghighlightsinteractinggenomicregionscontributingtoproliferationinbreastcancer
AT mitsopouloscostas genomicdistanceentrainedclusteringandregressionmodellinghighlightsinteractinggenomicregionscontributingtoproliferationinbreastcancer
AT mackayalan genomicdistanceentrainedclusteringandregressionmodellinghighlightsinteractinggenomicregionscontributingtoproliferationinbreastcancer
AT grigoriadisanita genomicdistanceentrainedclusteringandregressionmodellinghighlightsinteractinggenomicregionscontributingtoproliferationinbreastcancer
AT ahmadamars genomicdistanceentrainedclusteringandregressionmodellinghighlightsinteractinggenomicregionscontributingtoproliferationinbreastcancer
AT zvelebilmarketa genomicdistanceentrainedclusteringandregressionmodellinghighlightsinteractinggenomicregionscontributingtoproliferationinbreastcancer