Cargando…

Prediction of epigenetically regulated genes in breast cancer cell lines

BACKGROUND: Methylation of CpG islands within the DNA promoter regions is one mechanism that leads to aberrant gene expression in cancer. In particular, the abnormal methylation of CpG islands may silence associated genes. Therefore, using high-throughput microarrays to measure CpG island methylatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Loss, Leandro A, Sadanandam, Anguraj, Durinck, Steffen, Nautiyal, Shivani, Flaucher, Diane, Carlton, Victoria EH, Moorhead, Martin, Lu, Yontao, Gray, Joe W, Faham, Malek, Spellman, Paul, Parvin, Bahram
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2903569/
https://www.ncbi.nlm.nih.gov/pubmed/20525369
http://dx.doi.org/10.1186/1471-2105-11-305
_version_ 1782183817244573696
author Loss, Leandro A
Sadanandam, Anguraj
Durinck, Steffen
Nautiyal, Shivani
Flaucher, Diane
Carlton, Victoria EH
Moorhead, Martin
Lu, Yontao
Gray, Joe W
Faham, Malek
Spellman, Paul
Parvin, Bahram
author_facet Loss, Leandro A
Sadanandam, Anguraj
Durinck, Steffen
Nautiyal, Shivani
Flaucher, Diane
Carlton, Victoria EH
Moorhead, Martin
Lu, Yontao
Gray, Joe W
Faham, Malek
Spellman, Paul
Parvin, Bahram
author_sort Loss, Leandro A
collection PubMed
description BACKGROUND: Methylation of CpG islands within the DNA promoter regions is one mechanism that leads to aberrant gene expression in cancer. In particular, the abnormal methylation of CpG islands may silence associated genes. Therefore, using high-throughput microarrays to measure CpG island methylation will lead to better understanding of tumor pathobiology and progression, while revealing potentially new biomarkers. We have examined a recently developed high-throughput technology for measuring genome-wide methylation patterns called mTACL. Here, we propose a computational pipeline for integrating gene expression and CpG island methylation profles to identify epigenetically regulated genes for a panel of 45 breast cancer cell lines, which is widely used in the Integrative Cancer Biology Program (ICBP). The pipeline (i) reduces the dimensionality of the methylation data, (ii) associates the reduced methylation data with gene expression data, and (iii) ranks methylation-expression associations according to their epigenetic regulation. Dimensionality reduction is performed in two steps: (i) methylation sites are grouped across the genome to identify regions of interest, and (ii) methylation profles are clustered within each region. Associations between the clustered methylation and the gene expression data sets generate candidate matches within a fxed neighborhood around each gene. Finally, the methylation-expression associations are ranked through a logistic regression, and their significance is quantified through permutation analysis. RESULTS: Our two-step dimensionality reduction compressed 90% of the original data, reducing 137,688 methylation sites to 14,505 clusters. Methylation-expression associations produced 18,312 correspondences, which were used to further analyze epigenetic regulation. Logistic regression was used to identify 58 genes from these correspondences that showed a statistically signifcant negative correlation between methylation profles and gene expression in the panel of breast cancer cell lines. Subnetwork enrichment of these genes has identifed 35 common regulators with 6 or more predicted markers. In addition to identifying epigenetically regulated genes, we show evidence of differentially expressed methylation patterns between the basal and luminal subtypes. CONCLUSIONS: Our results indicate that the proposed computational protocol is a viable platform for identifying epigenetically regulated genes. Our protocol has generated a list of predictors including COL1A2, TOP2A, TFF1, and VAV3, genes whose key roles in epigenetic regulation is documented in the literature. Subnetwork enrichment of these predicted markers further suggests that epigenetic regulation of individual genes occurs in a coordinated fashion and through common regulators.
format Text
id pubmed-2903569
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29035692010-07-14 Prediction of epigenetically regulated genes in breast cancer cell lines Loss, Leandro A Sadanandam, Anguraj Durinck, Steffen Nautiyal, Shivani Flaucher, Diane Carlton, Victoria EH Moorhead, Martin Lu, Yontao Gray, Joe W Faham, Malek Spellman, Paul Parvin, Bahram BMC Bioinformatics Methodology article BACKGROUND: Methylation of CpG islands within the DNA promoter regions is one mechanism that leads to aberrant gene expression in cancer. In particular, the abnormal methylation of CpG islands may silence associated genes. Therefore, using high-throughput microarrays to measure CpG island methylation will lead to better understanding of tumor pathobiology and progression, while revealing potentially new biomarkers. We have examined a recently developed high-throughput technology for measuring genome-wide methylation patterns called mTACL. Here, we propose a computational pipeline for integrating gene expression and CpG island methylation profles to identify epigenetically regulated genes for a panel of 45 breast cancer cell lines, which is widely used in the Integrative Cancer Biology Program (ICBP). The pipeline (i) reduces the dimensionality of the methylation data, (ii) associates the reduced methylation data with gene expression data, and (iii) ranks methylation-expression associations according to their epigenetic regulation. Dimensionality reduction is performed in two steps: (i) methylation sites are grouped across the genome to identify regions of interest, and (ii) methylation profles are clustered within each region. Associations between the clustered methylation and the gene expression data sets generate candidate matches within a fxed neighborhood around each gene. Finally, the methylation-expression associations are ranked through a logistic regression, and their significance is quantified through permutation analysis. RESULTS: Our two-step dimensionality reduction compressed 90% of the original data, reducing 137,688 methylation sites to 14,505 clusters. Methylation-expression associations produced 18,312 correspondences, which were used to further analyze epigenetic regulation. Logistic regression was used to identify 58 genes from these correspondences that showed a statistically signifcant negative correlation between methylation profles and gene expression in the panel of breast cancer cell lines. Subnetwork enrichment of these genes has identifed 35 common regulators with 6 or more predicted markers. In addition to identifying epigenetically regulated genes, we show evidence of differentially expressed methylation patterns between the basal and luminal subtypes. CONCLUSIONS: Our results indicate that the proposed computational protocol is a viable platform for identifying epigenetically regulated genes. Our protocol has generated a list of predictors including COL1A2, TOP2A, TFF1, and VAV3, genes whose key roles in epigenetic regulation is documented in the literature. Subnetwork enrichment of these predicted markers further suggests that epigenetic regulation of individual genes occurs in a coordinated fashion and through common regulators. BioMed Central 2010-06-04 /pmc/articles/PMC2903569/ /pubmed/20525369 http://dx.doi.org/10.1186/1471-2105-11-305 Text en Copyright ©2010 Loss et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Loss, Leandro A
Sadanandam, Anguraj
Durinck, Steffen
Nautiyal, Shivani
Flaucher, Diane
Carlton, Victoria EH
Moorhead, Martin
Lu, Yontao
Gray, Joe W
Faham, Malek
Spellman, Paul
Parvin, Bahram
Prediction of epigenetically regulated genes in breast cancer cell lines
title Prediction of epigenetically regulated genes in breast cancer cell lines
title_full Prediction of epigenetically regulated genes in breast cancer cell lines
title_fullStr Prediction of epigenetically regulated genes in breast cancer cell lines
title_full_unstemmed Prediction of epigenetically regulated genes in breast cancer cell lines
title_short Prediction of epigenetically regulated genes in breast cancer cell lines
title_sort prediction of epigenetically regulated genes in breast cancer cell lines
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2903569/
https://www.ncbi.nlm.nih.gov/pubmed/20525369
http://dx.doi.org/10.1186/1471-2105-11-305
work_keys_str_mv AT lossleandroa predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT sadanandamanguraj predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT durincksteffen predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT nautiyalshivani predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT flaucherdiane predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT carltonvictoriaeh predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT moorheadmartin predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT luyontao predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT grayjoew predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT fahammalek predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT spellmanpaul predictionofepigeneticallyregulatedgenesinbreastcancercelllines
AT parvinbahram predictionofepigeneticallyregulatedgenesinbreastcancercelllines