Cargando…
BioMethyl: an R package for biological interpretation of DNA methylation data
MOTIVATION: The accumulation of publicly available DNA methylation datasets has resulted in the need for tools to interpret the specific cellular phenotypes in bulk tissue data. Current approaches use either single differentially methylated CpG sites or differentially methylated regions that map to...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761945/ https://www.ncbi.nlm.nih.gov/pubmed/30799505 http://dx.doi.org/10.1093/bioinformatics/btz137 |
_version_ | 1783454129233854464 |
---|---|
author | Wang, Yue Franks, Jennifer M Whitfield, Michael L Cheng, Chao |
author_facet | Wang, Yue Franks, Jennifer M Whitfield, Michael L Cheng, Chao |
author_sort | Wang, Yue |
collection | PubMed |
description | MOTIVATION: The accumulation of publicly available DNA methylation datasets has resulted in the need for tools to interpret the specific cellular phenotypes in bulk tissue data. Current approaches use either single differentially methylated CpG sites or differentially methylated regions that map to genes. However, these approaches may introduce biases in downstream analyses of biological interpretation, because of the variability in gene length. There is a lack of approaches to interpret DNA methylation effectively. Therefore, we have developed computational models to provide biological interpretation of relevant gene sets using DNA methylation data in the context of The Cancer Genome Atlas. RESULTS: We illustrate that Biological interpretation of DNA Methylation (BioMethyl) utilizes the complete DNA methylation data for a given cancer type to reflect corresponding gene expression profiles and performs pathway enrichment analyses, providing unique biological insight. Using breast cancer as an example, BioMethyl shows high consistency in the identification of enriched biological pathways from DNA methylation data compared to the results calculated from RNA sequencing data. We find that 12 out of 14 pathways identified by BioMethyl are shared with those by using RNA-seq data, with a Jaccard score 0.8 for estrogen receptor (ER) positive samples. For ER negative samples, three pathways are shared in the two enrichments with a slight lower similarity (Jaccard score = 0.6). Using BioMethyl, we can successfully identify those hidden biological pathways in DNA methylation data when gene expression profile is lacking. AVAILABILITY AND IMPLEMENTATION: BioMethyl R package is freely available in the GitHub repository (https://github.com/yuewangpanda/BioMethyl). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6761945 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-67619452019-10-02 BioMethyl: an R package for biological interpretation of DNA methylation data Wang, Yue Franks, Jennifer M Whitfield, Michael L Cheng, Chao Bioinformatics Original Papers MOTIVATION: The accumulation of publicly available DNA methylation datasets has resulted in the need for tools to interpret the specific cellular phenotypes in bulk tissue data. Current approaches use either single differentially methylated CpG sites or differentially methylated regions that map to genes. However, these approaches may introduce biases in downstream analyses of biological interpretation, because of the variability in gene length. There is a lack of approaches to interpret DNA methylation effectively. Therefore, we have developed computational models to provide biological interpretation of relevant gene sets using DNA methylation data in the context of The Cancer Genome Atlas. RESULTS: We illustrate that Biological interpretation of DNA Methylation (BioMethyl) utilizes the complete DNA methylation data for a given cancer type to reflect corresponding gene expression profiles and performs pathway enrichment analyses, providing unique biological insight. Using breast cancer as an example, BioMethyl shows high consistency in the identification of enriched biological pathways from DNA methylation data compared to the results calculated from RNA sequencing data. We find that 12 out of 14 pathways identified by BioMethyl are shared with those by using RNA-seq data, with a Jaccard score 0.8 for estrogen receptor (ER) positive samples. For ER negative samples, three pathways are shared in the two enrichments with a slight lower similarity (Jaccard score = 0.6). Using BioMethyl, we can successfully identify those hidden biological pathways in DNA methylation data when gene expression profile is lacking. AVAILABILITY AND IMPLEMENTATION: BioMethyl R package is freely available in the GitHub repository (https://github.com/yuewangpanda/BioMethyl). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-10-01 2019-02-25 /pmc/articles/PMC6761945/ /pubmed/30799505 http://dx.doi.org/10.1093/bioinformatics/btz137 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Wang, Yue Franks, Jennifer M Whitfield, Michael L Cheng, Chao BioMethyl: an R package for biological interpretation of DNA methylation data |
title | BioMethyl: an R package for biological interpretation of DNA methylation data |
title_full | BioMethyl: an R package for biological interpretation of DNA methylation data |
title_fullStr | BioMethyl: an R package for biological interpretation of DNA methylation data |
title_full_unstemmed | BioMethyl: an R package for biological interpretation of DNA methylation data |
title_short | BioMethyl: an R package for biological interpretation of DNA methylation data |
title_sort | biomethyl: an r package for biological interpretation of dna methylation data |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761945/ https://www.ncbi.nlm.nih.gov/pubmed/30799505 http://dx.doi.org/10.1093/bioinformatics/btz137 |
work_keys_str_mv | AT wangyue biomethylanrpackageforbiologicalinterpretationofdnamethylationdata AT franksjenniferm biomethylanrpackageforbiologicalinterpretationofdnamethylationdata AT whitfieldmichaell biomethylanrpackageforbiologicalinterpretationofdnamethylationdata AT chengchao biomethylanrpackageforbiologicalinterpretationofdnamethylationdata |