Cargando…

BioMethyl: an R package for biological interpretation of DNA methylation data

MOTIVATION: The accumulation of publicly available DNA methylation datasets has resulted in the need for tools to interpret the specific cellular phenotypes in bulk tissue data. Current approaches use either single differentially methylated CpG sites or differentially methylated regions that map to...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yue, Franks, Jennifer M, Whitfield, Michael L, Cheng, Chao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761945/
https://www.ncbi.nlm.nih.gov/pubmed/30799505
http://dx.doi.org/10.1093/bioinformatics/btz137
_version_ 1783454129233854464
author Wang, Yue
Franks, Jennifer M
Whitfield, Michael L
Cheng, Chao
author_facet Wang, Yue
Franks, Jennifer M
Whitfield, Michael L
Cheng, Chao
author_sort Wang, Yue
collection PubMed
description MOTIVATION: The accumulation of publicly available DNA methylation datasets has resulted in the need for tools to interpret the specific cellular phenotypes in bulk tissue data. Current approaches use either single differentially methylated CpG sites or differentially methylated regions that map to genes. However, these approaches may introduce biases in downstream analyses of biological interpretation, because of the variability in gene length. There is a lack of approaches to interpret DNA methylation effectively. Therefore, we have developed computational models to provide biological interpretation of relevant gene sets using DNA methylation data in the context of The Cancer Genome Atlas. RESULTS: We illustrate that Biological interpretation of DNA Methylation (BioMethyl) utilizes the complete DNA methylation data for a given cancer type to reflect corresponding gene expression profiles and performs pathway enrichment analyses, providing unique biological insight. Using breast cancer as an example, BioMethyl shows high consistency in the identification of enriched biological pathways from DNA methylation data compared to the results calculated from RNA sequencing data. We find that 12 out of 14 pathways identified by BioMethyl are shared with those by using RNA-seq data, with a Jaccard score 0.8 for estrogen receptor (ER) positive samples. For ER negative samples, three pathways are shared in the two enrichments with a slight lower similarity (Jaccard score = 0.6). Using BioMethyl, we can successfully identify those hidden biological pathways in DNA methylation data when gene expression profile is lacking. AVAILABILITY AND IMPLEMENTATION: BioMethyl R package is freely available in the GitHub repository (https://github.com/yuewangpanda/BioMethyl). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6761945
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-67619452019-10-02 BioMethyl: an R package for biological interpretation of DNA methylation data Wang, Yue Franks, Jennifer M Whitfield, Michael L Cheng, Chao Bioinformatics Original Papers MOTIVATION: The accumulation of publicly available DNA methylation datasets has resulted in the need for tools to interpret the specific cellular phenotypes in bulk tissue data. Current approaches use either single differentially methylated CpG sites or differentially methylated regions that map to genes. However, these approaches may introduce biases in downstream analyses of biological interpretation, because of the variability in gene length. There is a lack of approaches to interpret DNA methylation effectively. Therefore, we have developed computational models to provide biological interpretation of relevant gene sets using DNA methylation data in the context of The Cancer Genome Atlas. RESULTS: We illustrate that Biological interpretation of DNA Methylation (BioMethyl) utilizes the complete DNA methylation data for a given cancer type to reflect corresponding gene expression profiles and performs pathway enrichment analyses, providing unique biological insight. Using breast cancer as an example, BioMethyl shows high consistency in the identification of enriched biological pathways from DNA methylation data compared to the results calculated from RNA sequencing data. We find that 12 out of 14 pathways identified by BioMethyl are shared with those by using RNA-seq data, with a Jaccard score 0.8 for estrogen receptor (ER) positive samples. For ER negative samples, three pathways are shared in the two enrichments with a slight lower similarity (Jaccard score = 0.6). Using BioMethyl, we can successfully identify those hidden biological pathways in DNA methylation data when gene expression profile is lacking. AVAILABILITY AND IMPLEMENTATION: BioMethyl R package is freely available in the GitHub repository (https://github.com/yuewangpanda/BioMethyl). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-10-01 2019-02-25 /pmc/articles/PMC6761945/ /pubmed/30799505 http://dx.doi.org/10.1093/bioinformatics/btz137 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Wang, Yue
Franks, Jennifer M
Whitfield, Michael L
Cheng, Chao
BioMethyl: an R package for biological interpretation of DNA methylation data
title BioMethyl: an R package for biological interpretation of DNA methylation data
title_full BioMethyl: an R package for biological interpretation of DNA methylation data
title_fullStr BioMethyl: an R package for biological interpretation of DNA methylation data
title_full_unstemmed BioMethyl: an R package for biological interpretation of DNA methylation data
title_short BioMethyl: an R package for biological interpretation of DNA methylation data
title_sort biomethyl: an r package for biological interpretation of dna methylation data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761945/
https://www.ncbi.nlm.nih.gov/pubmed/30799505
http://dx.doi.org/10.1093/bioinformatics/btz137
work_keys_str_mv AT wangyue biomethylanrpackageforbiologicalinterpretationofdnamethylationdata
AT franksjenniferm biomethylanrpackageforbiologicalinterpretationofdnamethylationdata
AT whitfieldmichaell biomethylanrpackageforbiologicalinterpretationofdnamethylationdata
AT chengchao biomethylanrpackageforbiologicalinterpretationofdnamethylationdata