Cargando…

METHimpute: imputation-guided construction of complete methylomes from WGBS data

BACKGROUND: Whole-genome bisulfite sequencing (WGBS) has become the standard method for interrogating plant methylomes at base resolution. However, deep WGBS measurements remain cost prohibitive for large, complex genomes and for population-level studies. As a result, most published plant methylomes...

Descripción completa

Detalles Bibliográficos
Autores principales: Taudt, Aaron, Roquis, David, Vidalis, Amaryllis, Wardenaar, René, Johannes, Frank, Colomé-Tatché, Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5992726/
https://www.ncbi.nlm.nih.gov/pubmed/29879918
http://dx.doi.org/10.1186/s12864-018-4641-x
_version_ 1783330087582564352
author Taudt, Aaron
Roquis, David
Vidalis, Amaryllis
Wardenaar, René
Johannes, Frank
Colomé-Tatché, Maria
author_facet Taudt, Aaron
Roquis, David
Vidalis, Amaryllis
Wardenaar, René
Johannes, Frank
Colomé-Tatché, Maria
author_sort Taudt, Aaron
collection PubMed
description BACKGROUND: Whole-genome bisulfite sequencing (WGBS) has become the standard method for interrogating plant methylomes at base resolution. However, deep WGBS measurements remain cost prohibitive for large, complex genomes and for population-level studies. As a result, most published plant methylomes are sequenced far below saturation, with a large proportion of cytosines having either missing data or insufficient coverage. RESULTS: Here we present METHimpute, a Hidden Markov Model (HMM) based imputation algorithm for the analysis of WGBS data. Unlike existing methods, METHimpute enables the construction of complete methylomes by inferring the methylation status and level of all cytosines in the genome regardless of coverage. Application of METHimpute to maize, rice and Arabidopsis shows that the algorithm infers cytosine-resolution methylomes with high accuracy from data as low as 6X, compared to data with 60X, thus making it a cost-effective solution for large-scale studies. CONCLUSIONS: METHimpute provides methylation status calls and levels for all cytosines in the genome regardless of coverage, thus yielding complete methylomes even with low-coverage WGBS datasets. The method has been extensively tested in plants, but should also be applicable to other species. An implementation is available on Bioconductor. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4641-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5992726
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59927262018-06-21 METHimpute: imputation-guided construction of complete methylomes from WGBS data Taudt, Aaron Roquis, David Vidalis, Amaryllis Wardenaar, René Johannes, Frank Colomé-Tatché, Maria BMC Genomics Methodology Article BACKGROUND: Whole-genome bisulfite sequencing (WGBS) has become the standard method for interrogating plant methylomes at base resolution. However, deep WGBS measurements remain cost prohibitive for large, complex genomes and for population-level studies. As a result, most published plant methylomes are sequenced far below saturation, with a large proportion of cytosines having either missing data or insufficient coverage. RESULTS: Here we present METHimpute, a Hidden Markov Model (HMM) based imputation algorithm for the analysis of WGBS data. Unlike existing methods, METHimpute enables the construction of complete methylomes by inferring the methylation status and level of all cytosines in the genome regardless of coverage. Application of METHimpute to maize, rice and Arabidopsis shows that the algorithm infers cytosine-resolution methylomes with high accuracy from data as low as 6X, compared to data with 60X, thus making it a cost-effective solution for large-scale studies. CONCLUSIONS: METHimpute provides methylation status calls and levels for all cytosines in the genome regardless of coverage, thus yielding complete methylomes even with low-coverage WGBS datasets. The method has been extensively tested in plants, but should also be applicable to other species. An implementation is available on Bioconductor. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4641-x) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-07 /pmc/articles/PMC5992726/ /pubmed/29879918 http://dx.doi.org/10.1186/s12864-018-4641-x Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Taudt, Aaron
Roquis, David
Vidalis, Amaryllis
Wardenaar, René
Johannes, Frank
Colomé-Tatché, Maria
METHimpute: imputation-guided construction of complete methylomes from WGBS data
title METHimpute: imputation-guided construction of complete methylomes from WGBS data
title_full METHimpute: imputation-guided construction of complete methylomes from WGBS data
title_fullStr METHimpute: imputation-guided construction of complete methylomes from WGBS data
title_full_unstemmed METHimpute: imputation-guided construction of complete methylomes from WGBS data
title_short METHimpute: imputation-guided construction of complete methylomes from WGBS data
title_sort methimpute: imputation-guided construction of complete methylomes from wgbs data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5992726/
https://www.ncbi.nlm.nih.gov/pubmed/29879918
http://dx.doi.org/10.1186/s12864-018-4641-x
work_keys_str_mv AT taudtaaron methimputeimputationguidedconstructionofcompletemethylomesfromwgbsdata
AT roquisdavid methimputeimputationguidedconstructionofcompletemethylomesfromwgbsdata
AT vidalisamaryllis methimputeimputationguidedconstructionofcompletemethylomesfromwgbsdata
AT wardenaarrene methimputeimputationguidedconstructionofcompletemethylomesfromwgbsdata
AT johannesfrank methimputeimputationguidedconstructionofcompletemethylomesfromwgbsdata
AT colometatchemaria methimputeimputationguidedconstructionofcompletemethylomesfromwgbsdata