Cargando…

Controlling for conservation in genome-wide DNA methylation studies

BACKGROUND: A commonplace analysis in high-throughput DNA methylation studies is the comparison of methylation extent between different functional regions, computed by averaging methylation states within region types and then comparing averages between regions. For example, it has been reported that...

Descripción completa

Detalles Bibliográficos
Autores principales: Singer, Meromit, Pachter, Lior
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4448855/
https://www.ncbi.nlm.nih.gov/pubmed/26024968
http://dx.doi.org/10.1186/s12864-015-1604-3
_version_ 1782373775650586624
author Singer, Meromit
Pachter, Lior
author_facet Singer, Meromit
Pachter, Lior
author_sort Singer, Meromit
collection PubMed
description BACKGROUND: A commonplace analysis in high-throughput DNA methylation studies is the comparison of methylation extent between different functional regions, computed by averaging methylation states within region types and then comparing averages between regions. For example, it has been reported that methylation is more prevalent in coding regions as compared to their neighboring introns or UTRs, leading to hypotheses about novel forms of epigenetic regulation. RESULTS: We have identified and characterized a bias present in these seemingly straightforward comparisons that results in the false detection of differences in methylation intensities across region types. This bias arises due to differences in conservation rates, rather than methylation rates, and is broadly present in the published literature. When controlling for conservation at coding start sites the differences in DNA methylation rates disappear. Moreover, a re-evaluation of methylation rates at intronexon junctions reveals that the magnitude of previously reported differences is greatly exaggerated. We introduce two correction methods to address this bias, an inferencebased matrix completion algorithm and an averaging approach, tailored to address different underlying biological questions. We evaluate how analysis using these corrections affects the detection of differences in DNA methylation across functional boundaries. CONCLUSIONS: We report here on a bias in DNA methylation comparative studies that originates in conservation rate differences and manifests itself in the false discovery of differences in DNA methylation intensities and their extents. We have characterized this bias and its broad implications, and show how to control for it so as to enable the study of a variety of biological questions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1604-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4448855
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44488552015-05-30 Controlling for conservation in genome-wide DNA methylation studies Singer, Meromit Pachter, Lior BMC Genomics Methodology Article BACKGROUND: A commonplace analysis in high-throughput DNA methylation studies is the comparison of methylation extent between different functional regions, computed by averaging methylation states within region types and then comparing averages between regions. For example, it has been reported that methylation is more prevalent in coding regions as compared to their neighboring introns or UTRs, leading to hypotheses about novel forms of epigenetic regulation. RESULTS: We have identified and characterized a bias present in these seemingly straightforward comparisons that results in the false detection of differences in methylation intensities across region types. This bias arises due to differences in conservation rates, rather than methylation rates, and is broadly present in the published literature. When controlling for conservation at coding start sites the differences in DNA methylation rates disappear. Moreover, a re-evaluation of methylation rates at intronexon junctions reveals that the magnitude of previously reported differences is greatly exaggerated. We introduce two correction methods to address this bias, an inferencebased matrix completion algorithm and an averaging approach, tailored to address different underlying biological questions. We evaluate how analysis using these corrections affects the detection of differences in DNA methylation across functional boundaries. CONCLUSIONS: We report here on a bias in DNA methylation comparative studies that originates in conservation rate differences and manifests itself in the false discovery of differences in DNA methylation intensities and their extents. We have characterized this bias and its broad implications, and show how to control for it so as to enable the study of a variety of biological questions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1604-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-05-30 /pmc/articles/PMC4448855/ /pubmed/26024968 http://dx.doi.org/10.1186/s12864-015-1604-3 Text en © Singer and Pachter; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Singer, Meromit
Pachter, Lior
Controlling for conservation in genome-wide DNA methylation studies
title Controlling for conservation in genome-wide DNA methylation studies
title_full Controlling for conservation in genome-wide DNA methylation studies
title_fullStr Controlling for conservation in genome-wide DNA methylation studies
title_full_unstemmed Controlling for conservation in genome-wide DNA methylation studies
title_short Controlling for conservation in genome-wide DNA methylation studies
title_sort controlling for conservation in genome-wide dna methylation studies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4448855/
https://www.ncbi.nlm.nih.gov/pubmed/26024968
http://dx.doi.org/10.1186/s12864-015-1604-3
work_keys_str_mv AT singermeromit controllingforconservationingenomewidednamethylationstudies
AT pachterlior controllingforconservationingenomewidednamethylationstudies