Cargando…
Redundancy analysis allows improved detection of methylation changes in large genomic regions
BACKGROUND: DNA methylation is an epigenetic process that regulates gene expression. Methylation can be modified by environmental exposures and changes in the methylation patterns have been associated with diseases. Methylation microarrays measure methylation levels at more than 450,000 CpGs in a si...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5729265/ https://www.ncbi.nlm.nih.gov/pubmed/29237399 http://dx.doi.org/10.1186/s12859-017-1986-0 |
_version_ | 1783286158554300416 |
---|---|
author | Ruiz-Arenas, Carlos González, Juan R. |
author_facet | Ruiz-Arenas, Carlos González, Juan R. |
author_sort | Ruiz-Arenas, Carlos |
collection | PubMed |
description | BACKGROUND: DNA methylation is an epigenetic process that regulates gene expression. Methylation can be modified by environmental exposures and changes in the methylation patterns have been associated with diseases. Methylation microarrays measure methylation levels at more than 450,000 CpGs in a single experiment, and the most common analysis strategy is to perform a single probe analysis to find methylation probes associated with the outcome of interest. However, methylation changes usually occur at the regional level: for example, genomic structural variants can affect methylation patterns in regions up to several megabases in length. Existing DMR methods provide lists of Differentially Methylated Regions (DMRs) of up to only few kilobases in length, and cannot check if a target region is differentially methylated. Therefore, these methods are not suitable to evaluate methylation changes in large regions. To address these limitations, we developed a new DMR approach based on redundancy analysis (RDA) that assesses whether a target region is differentially methylated. RESULTS: Using simulated and real datasets, we compared our approach to three common DMR detection methods (Bumphunter, blockFinder, and DMRcate). We found that Bumphunter underestimated methylation changes and blockFinder showed poor performance. DMRcate showed poor power in the simulated datasets and low specificity in the real data analysis. Our method showed very high performance in all simulation settings, even with small sample sizes and subtle methylation changes, while controlling type I error. Other advantages of our method are: 1) it estimates the degree of association between the DMR and the outcome; 2) it can analyze a targeted or region of interest; and 3) it can evaluate the simultaneous effects of different variables. The proposed methodology is implemented in MEAL, a Bioconductor package designed to facilitate the analysis of methylation data. CONCLUSIONS: We propose a multivariate approach to decipher whether an outcome of interest alters the methylation pattern of a region of interest. The method is designed to analyze large target genomic regions and outperforms the three most popular methods for detecting DMRs. Our method can evaluate factors with more than two levels or the simultaneous effect of more than one continuous variable, which is not possible with the state-of-the-art methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1986-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5729265 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57292652017-12-18 Redundancy analysis allows improved detection of methylation changes in large genomic regions Ruiz-Arenas, Carlos González, Juan R. BMC Bioinformatics Software BACKGROUND: DNA methylation is an epigenetic process that regulates gene expression. Methylation can be modified by environmental exposures and changes in the methylation patterns have been associated with diseases. Methylation microarrays measure methylation levels at more than 450,000 CpGs in a single experiment, and the most common analysis strategy is to perform a single probe analysis to find methylation probes associated with the outcome of interest. However, methylation changes usually occur at the regional level: for example, genomic structural variants can affect methylation patterns in regions up to several megabases in length. Existing DMR methods provide lists of Differentially Methylated Regions (DMRs) of up to only few kilobases in length, and cannot check if a target region is differentially methylated. Therefore, these methods are not suitable to evaluate methylation changes in large regions. To address these limitations, we developed a new DMR approach based on redundancy analysis (RDA) that assesses whether a target region is differentially methylated. RESULTS: Using simulated and real datasets, we compared our approach to three common DMR detection methods (Bumphunter, blockFinder, and DMRcate). We found that Bumphunter underestimated methylation changes and blockFinder showed poor performance. DMRcate showed poor power in the simulated datasets and low specificity in the real data analysis. Our method showed very high performance in all simulation settings, even with small sample sizes and subtle methylation changes, while controlling type I error. Other advantages of our method are: 1) it estimates the degree of association between the DMR and the outcome; 2) it can analyze a targeted or region of interest; and 3) it can evaluate the simultaneous effects of different variables. The proposed methodology is implemented in MEAL, a Bioconductor package designed to facilitate the analysis of methylation data. CONCLUSIONS: We propose a multivariate approach to decipher whether an outcome of interest alters the methylation pattern of a region of interest. The method is designed to analyze large target genomic regions and outperforms the three most popular methods for detecting DMRs. Our method can evaluate factors with more than two levels or the simultaneous effect of more than one continuous variable, which is not possible with the state-of-the-art methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1986-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-12-14 /pmc/articles/PMC5729265/ /pubmed/29237399 http://dx.doi.org/10.1186/s12859-017-1986-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Ruiz-Arenas, Carlos González, Juan R. Redundancy analysis allows improved detection of methylation changes in large genomic regions |
title | Redundancy analysis allows improved detection of methylation changes in large genomic regions |
title_full | Redundancy analysis allows improved detection of methylation changes in large genomic regions |
title_fullStr | Redundancy analysis allows improved detection of methylation changes in large genomic regions |
title_full_unstemmed | Redundancy analysis allows improved detection of methylation changes in large genomic regions |
title_short | Redundancy analysis allows improved detection of methylation changes in large genomic regions |
title_sort | redundancy analysis allows improved detection of methylation changes in large genomic regions |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5729265/ https://www.ncbi.nlm.nih.gov/pubmed/29237399 http://dx.doi.org/10.1186/s12859-017-1986-0 |
work_keys_str_mv | AT ruizarenascarlos redundancyanalysisallowsimproveddetectionofmethylationchangesinlargegenomicregions AT gonzalezjuanr redundancyanalysisallowsimproveddetectionofmethylationchangesinlargegenomicregions |