Cargando…

Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer

DNA methylation of various genomic regions has been found to be associated with gene expression in diverse biological contexts. However, most genome-wide studies have focused on the effect of (1) methylation in cis, not in trans and (2) a single CpG, not the collective effects of multiple CpGs, on g...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Soyeon, Park, Hyun Jung, Cui, Xiangqin, Zhi, Degui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7054398/
https://www.ncbi.nlm.nih.gov/pubmed/32127627
http://dx.doi.org/10.1038/s41598-020-60845-2
_version_ 1783503189338750976
author Kim, Soyeon
Park, Hyun Jung
Cui, Xiangqin
Zhi, Degui
author_facet Kim, Soyeon
Park, Hyun Jung
Cui, Xiangqin
Zhi, Degui
author_sort Kim, Soyeon
collection PubMed
description DNA methylation of various genomic regions has been found to be associated with gene expression in diverse biological contexts. However, most genome-wide studies have focused on the effect of (1) methylation in cis, not in trans and (2) a single CpG, not the collective effects of multiple CpGs, on gene expression. In this study, we developed a statistical machine learning model, geneEXPLORE (gene expression prediction by long-range epigenetics), that quantifies the collective effects of both cis- and trans- methylations on gene expression. By applying geneEXPLORE to The Cancer Genome Atlas (TCGA) breast and 10 other types of cancer data, we found that most genes are associated with methylations of as much as 10 Mb from the promoters or more, and the long-range methylation explains 50% of the variation in gene expression on average, far greater than cis-methylation. geneEXPLORE outperforms competing methods such as BioMethyl and MethylXcan. Further, the predicted gene expressions could predict clinical phenotypes such as breast tumor status and estrogen receptor status (AUC = 0.999, 0.94 respectively) as accurately as the measured gene expression levels. These results suggest that geneEXPLORE provides a means for accurate imputation of gene expression, which can be further used to predict clinical phenotypes.
format Online
Article
Text
id pubmed-7054398
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-70543982020-03-11 Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer Kim, Soyeon Park, Hyun Jung Cui, Xiangqin Zhi, Degui Sci Rep Article DNA methylation of various genomic regions has been found to be associated with gene expression in diverse biological contexts. However, most genome-wide studies have focused on the effect of (1) methylation in cis, not in trans and (2) a single CpG, not the collective effects of multiple CpGs, on gene expression. In this study, we developed a statistical machine learning model, geneEXPLORE (gene expression prediction by long-range epigenetics), that quantifies the collective effects of both cis- and trans- methylations on gene expression. By applying geneEXPLORE to The Cancer Genome Atlas (TCGA) breast and 10 other types of cancer data, we found that most genes are associated with methylations of as much as 10 Mb from the promoters or more, and the long-range methylation explains 50% of the variation in gene expression on average, far greater than cis-methylation. geneEXPLORE outperforms competing methods such as BioMethyl and MethylXcan. Further, the predicted gene expressions could predict clinical phenotypes such as breast tumor status and estrogen receptor status (AUC = 0.999, 0.94 respectively) as accurately as the measured gene expression levels. These results suggest that geneEXPLORE provides a means for accurate imputation of gene expression, which can be further used to predict clinical phenotypes. Nature Publishing Group UK 2020-03-03 /pmc/articles/PMC7054398/ /pubmed/32127627 http://dx.doi.org/10.1038/s41598-020-60845-2 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Kim, Soyeon
Park, Hyun Jung
Cui, Xiangqin
Zhi, Degui
Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer
title Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer
title_full Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer
title_fullStr Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer
title_full_unstemmed Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer
title_short Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer
title_sort collective effects of long-range dna methylations predict gene expressions and estimate phenotypes in cancer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7054398/
https://www.ncbi.nlm.nih.gov/pubmed/32127627
http://dx.doi.org/10.1038/s41598-020-60845-2
work_keys_str_mv AT kimsoyeon collectiveeffectsoflongrangednamethylationspredictgeneexpressionsandestimatephenotypesincancer
AT parkhyunjung collectiveeffectsoflongrangednamethylationspredictgeneexpressionsandestimatephenotypesincancer
AT cuixiangqin collectiveeffectsoflongrangednamethylationspredictgeneexpressionsandestimatephenotypesincancer
AT zhidegui collectiveeffectsoflongrangednamethylationspredictgeneexpressionsandestimatephenotypesincancer