Cargando…

Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis

The emergence of large-scale multi-omics data warrants method development for data integration. Genomic studies from cancer patients have identified epigenetic and genetic regulators – such as methylation marks, somatic mutations, and somatic copy number alterations (SCNAs), among others – as predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Amy, Chapuy, Bjoern, Varelas, Xaralabos, Sebastiani, Paola, Monti, Stefano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6858347/
https://www.ncbi.nlm.nih.gov/pubmed/31729402
http://dx.doi.org/10.1038/s41598-019-52886-z
_version_ 1783470938855047168
author Li, Amy
Chapuy, Bjoern
Varelas, Xaralabos
Sebastiani, Paola
Monti, Stefano
author_facet Li, Amy
Chapuy, Bjoern
Varelas, Xaralabos
Sebastiani, Paola
Monti, Stefano
author_sort Li, Amy
collection PubMed
description The emergence of large-scale multi-omics data warrants method development for data integration. Genomic studies from cancer patients have identified epigenetic and genetic regulators – such as methylation marks, somatic mutations, and somatic copy number alterations (SCNAs), among others – as predictive features of cancer outcome. However, identification of “driver genes” associated with a given alteration remains a challenge. To this end, we developed a computational tool, iEDGE, to model cis and trans effects of (epi-)DNA alterations and identify potential cis driver genes, where cis and trans genes denote those genes falling within and outside the genomic boundaries of a given (epi-)genetic alteration, respectively. iEDGE first identifies the cis and trans gene expression signatures associated with the presence/absence of a particular epi-DNA alteration across samples. It then applies tests of statistical mediation to determine the cis genes predictive of the trans gene expression. Finally, cis and trans effects are annotated by pathway enrichment analysis to gain insights into the underlying regulatory networks. We used iEDGE to perform integrative analysis of SCNAs and gene expression data from breast cancer and 18 additional cancer types included in The Cancer Genome Atlas (TCGA). Notably, cis gene drivers identified by iEDGE were found to be significantly enriched for known driver genes from multiple compendia of validated oncogenes and tumor suppressors, suggesting that the remainder are of equal importance. Furthermore, predicted drivers were enriched for functionally relevant cancer genes with amplification-driven dependencies, which are of potential prognostic and therapeutic value. All the analyses results are accessible at https://montilab.bu.edu/iEDGE. In summary, integrative analysis of SCNAs and gene expression using iEDGE successfully identified known cancer driver genes and putative cancer therapeutic targets across 19 cancer types in the TCGA. The proposed method can easily be applied to the integration of gene expression profiles with other epi-DNA assays in a variety of disease contexts.
format Online
Article
Text
id pubmed-6858347
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-68583472019-11-27 Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis Li, Amy Chapuy, Bjoern Varelas, Xaralabos Sebastiani, Paola Monti, Stefano Sci Rep Article The emergence of large-scale multi-omics data warrants method development for data integration. Genomic studies from cancer patients have identified epigenetic and genetic regulators – such as methylation marks, somatic mutations, and somatic copy number alterations (SCNAs), among others – as predictive features of cancer outcome. However, identification of “driver genes” associated with a given alteration remains a challenge. To this end, we developed a computational tool, iEDGE, to model cis and trans effects of (epi-)DNA alterations and identify potential cis driver genes, where cis and trans genes denote those genes falling within and outside the genomic boundaries of a given (epi-)genetic alteration, respectively. iEDGE first identifies the cis and trans gene expression signatures associated with the presence/absence of a particular epi-DNA alteration across samples. It then applies tests of statistical mediation to determine the cis genes predictive of the trans gene expression. Finally, cis and trans effects are annotated by pathway enrichment analysis to gain insights into the underlying regulatory networks. We used iEDGE to perform integrative analysis of SCNAs and gene expression data from breast cancer and 18 additional cancer types included in The Cancer Genome Atlas (TCGA). Notably, cis gene drivers identified by iEDGE were found to be significantly enriched for known driver genes from multiple compendia of validated oncogenes and tumor suppressors, suggesting that the remainder are of equal importance. Furthermore, predicted drivers were enriched for functionally relevant cancer genes with amplification-driven dependencies, which are of potential prognostic and therapeutic value. All the analyses results are accessible at https://montilab.bu.edu/iEDGE. In summary, integrative analysis of SCNAs and gene expression using iEDGE successfully identified known cancer driver genes and putative cancer therapeutic targets across 19 cancer types in the TCGA. The proposed method can easily be applied to the integration of gene expression profiles with other epi-DNA assays in a variety of disease contexts. Nature Publishing Group UK 2019-11-15 /pmc/articles/PMC6858347/ /pubmed/31729402 http://dx.doi.org/10.1038/s41598-019-52886-z Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Li, Amy
Chapuy, Bjoern
Varelas, Xaralabos
Sebastiani, Paola
Monti, Stefano
Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis
title Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis
title_full Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis
title_fullStr Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis
title_full_unstemmed Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis
title_short Identification of candidate cancer drivers by integrative Epi-DNA and Gene Expression (iEDGE) data analysis
title_sort identification of candidate cancer drivers by integrative epi-dna and gene expression (iedge) data analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6858347/
https://www.ncbi.nlm.nih.gov/pubmed/31729402
http://dx.doi.org/10.1038/s41598-019-52886-z
work_keys_str_mv AT liamy identificationofcandidatecancerdriversbyintegrativeepidnaandgeneexpressioniedgedataanalysis
AT chapuybjoern identificationofcandidatecancerdriversbyintegrativeepidnaandgeneexpressioniedgedataanalysis
AT varelasxaralabos identificationofcandidatecancerdriversbyintegrativeepidnaandgeneexpressioniedgedataanalysis
AT sebastianipaola identificationofcandidatecancerdriversbyintegrativeepidnaandgeneexpressioniedgedataanalysis
AT montistefano identificationofcandidatecancerdriversbyintegrativeepidnaandgeneexpressioniedgedataanalysis