Cargando…

MethylScore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data

Whole-genome bisulfite sequencing (WGBS) is the standard method for profiling DNA methylation at single-nucleotide resolution. Different tools have been developed to extract differentially methylated regions (DMRs), often built upon assumptions from mammalian data. Here, we present MethylScore, a pi...

Descripción completa

Detalles Bibliográficos
Autores principales: Hüther, Patrick, Hagmann, Jörg, Nunn, Adam, Kakoulidou, Ioanna, Pisupati, Rahul, Langenberger, David, Weigel, Detlef, Johannes, Frank, Schultheiss, Sebastian J., Becker, Claude
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cambridge University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10095865/
https://www.ncbi.nlm.nih.gov/pubmed/37077980
http://dx.doi.org/10.1017/qpb.2022.14
_version_ 1785024185018351616
author Hüther, Patrick
Hagmann, Jörg
Nunn, Adam
Kakoulidou, Ioanna
Pisupati, Rahul
Langenberger, David
Weigel, Detlef
Johannes, Frank
Schultheiss, Sebastian J.
Becker, Claude
author_facet Hüther, Patrick
Hagmann, Jörg
Nunn, Adam
Kakoulidou, Ioanna
Pisupati, Rahul
Langenberger, David
Weigel, Detlef
Johannes, Frank
Schultheiss, Sebastian J.
Becker, Claude
author_sort Hüther, Patrick
collection PubMed
description Whole-genome bisulfite sequencing (WGBS) is the standard method for profiling DNA methylation at single-nucleotide resolution. Different tools have been developed to extract differentially methylated regions (DMRs), often built upon assumptions from mammalian data. Here, we present MethylScore, a pipeline to analyse WGBS data and to account for the substantially more complex and variable nature of plant DNA methylation. MethylScore uses an unsupervised machine learning approach to segment the genome by classification into states of high and low methylation. It processes data from genomic alignments to DMR output and is designed to be usable by novice and expert users alike. We show how MethylScore can identify DMRs from hundreds of samples and how its data-driven approach can stratify associated samples without prior information. We identify DMRs in the A. thaliana 1,001 Genomes dataset to unveil known and unknown genotype–epigenotype associations .
format Online
Article
Text
id pubmed-10095865
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cambridge University Press
record_format MEDLINE/PubMed
spelling pubmed-100958652023-04-18 MethylScore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data Hüther, Patrick Hagmann, Jörg Nunn, Adam Kakoulidou, Ioanna Pisupati, Rahul Langenberger, David Weigel, Detlef Johannes, Frank Schultheiss, Sebastian J. Becker, Claude Quant Plant Biol Original Research Article Whole-genome bisulfite sequencing (WGBS) is the standard method for profiling DNA methylation at single-nucleotide resolution. Different tools have been developed to extract differentially methylated regions (DMRs), often built upon assumptions from mammalian data. Here, we present MethylScore, a pipeline to analyse WGBS data and to account for the substantially more complex and variable nature of plant DNA methylation. MethylScore uses an unsupervised machine learning approach to segment the genome by classification into states of high and low methylation. It processes data from genomic alignments to DMR output and is designed to be usable by novice and expert users alike. We show how MethylScore can identify DMRs from hundreds of samples and how its data-driven approach can stratify associated samples without prior information. We identify DMRs in the A. thaliana 1,001 Genomes dataset to unveil known and unknown genotype–epigenotype associations . Cambridge University Press 2022-09-26 /pmc/articles/PMC10095865/ /pubmed/37077980 http://dx.doi.org/10.1017/qpb.2022.14 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research Article
Hüther, Patrick
Hagmann, Jörg
Nunn, Adam
Kakoulidou, Ioanna
Pisupati, Rahul
Langenberger, David
Weigel, Detlef
Johannes, Frank
Schultheiss, Sebastian J.
Becker, Claude
MethylScore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data
title MethylScore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data
title_full MethylScore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data
title_fullStr MethylScore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data
title_full_unstemmed MethylScore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data
title_short MethylScore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data
title_sort methylscore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data
topic Original Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10095865/
https://www.ncbi.nlm.nih.gov/pubmed/37077980
http://dx.doi.org/10.1017/qpb.2022.14
work_keys_str_mv AT hutherpatrick methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata
AT hagmannjorg methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata
AT nunnadam methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata
AT kakoulidouioanna methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata
AT pisupatirahul methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata
AT langenbergerdavid methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata
AT weigeldetlef methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata
AT johannesfrank methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata
AT schultheisssebastianj methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata
AT beckerclaude methylscoreapipelineforaccurateandcontextawareidentificationofdifferentiallymethylatedregionsfrompopulationscaleplantwholegenomebisulfitesequencingdata