Cargando…

Methods for discovering genomic loci exhibiting complex patterns of differential methylation

BACKGROUND: Cytosine methylation is widespread in most eukaryotic genomes and is known to play a substantial role in various regulatory pathways. Unmethylated cytosines may be converted to uracil through the addition of sodium bisulphite, allowing genome-wide quantification of cytosine methylation v...

Descripción completa

Detalles Bibliográficos
Autor principal: Hardcastle, Thomas J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5604413/
https://www.ncbi.nlm.nih.gov/pubmed/28923005
http://dx.doi.org/10.1186/s12859-017-1836-0
_version_ 1783264862088986624
author Hardcastle, Thomas J.
author_facet Hardcastle, Thomas J.
author_sort Hardcastle, Thomas J.
collection PubMed
description BACKGROUND: Cytosine methylation is widespread in most eukaryotic genomes and is known to play a substantial role in various regulatory pathways. Unmethylated cytosines may be converted to uracil through the addition of sodium bisulphite, allowing genome-wide quantification of cytosine methylation via high-throughput sequencing. The data thus acquired allows the discovery of methylation ‘loci’; contiguous regions of methylation consistently methylated across biological replicates. The mapping of these loci allows for associations with other genomic factors to be identified, and for analyses of differential methylation to take place. RESULTS: The segmentSeq R package is extended to identify methylation loci from high-throughput sequencing data from multiple experimental conditions. A statistical model is then developed that accounts for biological replication and variable rates of non-conversion of cytosines in each sample to compute posterior likelihoods of methylation at each locus within an empirical Bayesian framework. The same model is used as a basis for analysis of differential methylation between multiple experimental conditions with the baySeq R package. We demonstrate the capability of this method to analyse complex data sets in an analysis of data derived from multiple Dicer-like mutants in Arabidopsis. This reveals several novel behaviours at distinct sets of loci in response to loss of one or more of the Dicer-like proteins that indicate an antagonistic relationship between the Dicer-like proteins at at least some methylation loci. Finally, we show in simulation studies that this approach can be significantly more powerful in the detection of differential methylation than many existing methods in data derived from both mammalian and plant systems. CONCLUSIONS: The methods developed here make it possible to analyse high-throughput sequencing of the methylome of any given organism under a diverse set of experimental conditions. The methods are able to identify methylation loci and evaluate the likelihood that a region is truly methylated under any given experimental condition, allowing for downstream analyses that characterise differences between methylated and non-methylated regions of the genome. Futhermore, diverse patterns of differential methylation may also be characterised from these data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1836-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5604413
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56044132017-09-21 Methods for discovering genomic loci exhibiting complex patterns of differential methylation Hardcastle, Thomas J. BMC Bioinformatics Research Article BACKGROUND: Cytosine methylation is widespread in most eukaryotic genomes and is known to play a substantial role in various regulatory pathways. Unmethylated cytosines may be converted to uracil through the addition of sodium bisulphite, allowing genome-wide quantification of cytosine methylation via high-throughput sequencing. The data thus acquired allows the discovery of methylation ‘loci’; contiguous regions of methylation consistently methylated across biological replicates. The mapping of these loci allows for associations with other genomic factors to be identified, and for analyses of differential methylation to take place. RESULTS: The segmentSeq R package is extended to identify methylation loci from high-throughput sequencing data from multiple experimental conditions. A statistical model is then developed that accounts for biological replication and variable rates of non-conversion of cytosines in each sample to compute posterior likelihoods of methylation at each locus within an empirical Bayesian framework. The same model is used as a basis for analysis of differential methylation between multiple experimental conditions with the baySeq R package. We demonstrate the capability of this method to analyse complex data sets in an analysis of data derived from multiple Dicer-like mutants in Arabidopsis. This reveals several novel behaviours at distinct sets of loci in response to loss of one or more of the Dicer-like proteins that indicate an antagonistic relationship between the Dicer-like proteins at at least some methylation loci. Finally, we show in simulation studies that this approach can be significantly more powerful in the detection of differential methylation than many existing methods in data derived from both mammalian and plant systems. CONCLUSIONS: The methods developed here make it possible to analyse high-throughput sequencing of the methylome of any given organism under a diverse set of experimental conditions. The methods are able to identify methylation loci and evaluate the likelihood that a region is truly methylated under any given experimental condition, allowing for downstream analyses that characterise differences between methylated and non-methylated regions of the genome. Futhermore, diverse patterns of differential methylation may also be characterised from these data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1836-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-09-18 /pmc/articles/PMC5604413/ /pubmed/28923005 http://dx.doi.org/10.1186/s12859-017-1836-0 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Hardcastle, Thomas J.
Methods for discovering genomic loci exhibiting complex patterns of differential methylation
title Methods for discovering genomic loci exhibiting complex patterns of differential methylation
title_full Methods for discovering genomic loci exhibiting complex patterns of differential methylation
title_fullStr Methods for discovering genomic loci exhibiting complex patterns of differential methylation
title_full_unstemmed Methods for discovering genomic loci exhibiting complex patterns of differential methylation
title_short Methods for discovering genomic loci exhibiting complex patterns of differential methylation
title_sort methods for discovering genomic loci exhibiting complex patterns of differential methylation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5604413/
https://www.ncbi.nlm.nih.gov/pubmed/28923005
http://dx.doi.org/10.1186/s12859-017-1836-0
work_keys_str_mv AT hardcastlethomasj methodsfordiscoveringgenomiclociexhibitingcomplexpatternsofdifferentialmethylation