Cargando…
Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer
DNA methylation of various genomic regions has been found to be associated with gene expression in diverse biological contexts. However, most genome-wide studies have focused on the effect of (1) methylation in cis, not in trans and (2) a single CpG, not the collective effects of multiple CpGs, on g...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7054398/ https://www.ncbi.nlm.nih.gov/pubmed/32127627 http://dx.doi.org/10.1038/s41598-020-60845-2 |
_version_ | 1783503189338750976 |
---|---|
author | Kim, Soyeon Park, Hyun Jung Cui, Xiangqin Zhi, Degui |
author_facet | Kim, Soyeon Park, Hyun Jung Cui, Xiangqin Zhi, Degui |
author_sort | Kim, Soyeon |
collection | PubMed |
description | DNA methylation of various genomic regions has been found to be associated with gene expression in diverse biological contexts. However, most genome-wide studies have focused on the effect of (1) methylation in cis, not in trans and (2) a single CpG, not the collective effects of multiple CpGs, on gene expression. In this study, we developed a statistical machine learning model, geneEXPLORE (gene expression prediction by long-range epigenetics), that quantifies the collective effects of both cis- and trans- methylations on gene expression. By applying geneEXPLORE to The Cancer Genome Atlas (TCGA) breast and 10 other types of cancer data, we found that most genes are associated with methylations of as much as 10 Mb from the promoters or more, and the long-range methylation explains 50% of the variation in gene expression on average, far greater than cis-methylation. geneEXPLORE outperforms competing methods such as BioMethyl and MethylXcan. Further, the predicted gene expressions could predict clinical phenotypes such as breast tumor status and estrogen receptor status (AUC = 0.999, 0.94 respectively) as accurately as the measured gene expression levels. These results suggest that geneEXPLORE provides a means for accurate imputation of gene expression, which can be further used to predict clinical phenotypes. |
format | Online Article Text |
id | pubmed-7054398 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-70543982020-03-11 Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer Kim, Soyeon Park, Hyun Jung Cui, Xiangqin Zhi, Degui Sci Rep Article DNA methylation of various genomic regions has been found to be associated with gene expression in diverse biological contexts. However, most genome-wide studies have focused on the effect of (1) methylation in cis, not in trans and (2) a single CpG, not the collective effects of multiple CpGs, on gene expression. In this study, we developed a statistical machine learning model, geneEXPLORE (gene expression prediction by long-range epigenetics), that quantifies the collective effects of both cis- and trans- methylations on gene expression. By applying geneEXPLORE to The Cancer Genome Atlas (TCGA) breast and 10 other types of cancer data, we found that most genes are associated with methylations of as much as 10 Mb from the promoters or more, and the long-range methylation explains 50% of the variation in gene expression on average, far greater than cis-methylation. geneEXPLORE outperforms competing methods such as BioMethyl and MethylXcan. Further, the predicted gene expressions could predict clinical phenotypes such as breast tumor status and estrogen receptor status (AUC = 0.999, 0.94 respectively) as accurately as the measured gene expression levels. These results suggest that geneEXPLORE provides a means for accurate imputation of gene expression, which can be further used to predict clinical phenotypes. Nature Publishing Group UK 2020-03-03 /pmc/articles/PMC7054398/ /pubmed/32127627 http://dx.doi.org/10.1038/s41598-020-60845-2 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Kim, Soyeon Park, Hyun Jung Cui, Xiangqin Zhi, Degui Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer |
title | Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer |
title_full | Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer |
title_fullStr | Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer |
title_full_unstemmed | Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer |
title_short | Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer |
title_sort | collective effects of long-range dna methylations predict gene expressions and estimate phenotypes in cancer |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7054398/ https://www.ncbi.nlm.nih.gov/pubmed/32127627 http://dx.doi.org/10.1038/s41598-020-60845-2 |
work_keys_str_mv | AT kimsoyeon collectiveeffectsoflongrangednamethylationspredictgeneexpressionsandestimatephenotypesincancer AT parkhyunjung collectiveeffectsoflongrangednamethylationspredictgeneexpressionsandestimatephenotypesincancer AT cuixiangqin collectiveeffectsoflongrangednamethylationspredictgeneexpressionsandestimatephenotypesincancer AT zhidegui collectiveeffectsoflongrangednamethylationspredictgeneexpressionsandestimatephenotypesincancer |