Cargando…

Information recovery from low coverage whole-genome bisulfite sequencing

The cost of whole-genome bisulfite sequencing (WGBS) remains a bottleneck for many studies and it is therefore imperative to extract as much information as possible from a given dataset. This is particularly important because even at the recommend 30X coverage for reference methylomes, up to 50% of...

Descripción completa

Detalles Bibliográficos
Autores principales: Libertini, Emanuele, Heath, Simon C., Hamoudi, Rifat A., Gut, Marta, Ziller, Michael J., Czyz, Agata, Ruotti, Victor, Stunnenberg, Hendrik G., Frontini, Mattia, Ouwehand, Willem H., Meissner, Alexander, Gut, Ivo G., Beck, Stephan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4931220/
https://www.ncbi.nlm.nih.gov/pubmed/27346250
http://dx.doi.org/10.1038/ncomms11306
_version_ 1782440850705350656
author Libertini, Emanuele
Heath, Simon C.
Hamoudi, Rifat A.
Gut, Marta
Ziller, Michael J.
Czyz, Agata
Ruotti, Victor
Stunnenberg, Hendrik G.
Frontini, Mattia
Ouwehand, Willem H.
Meissner, Alexander
Gut, Ivo G.
Beck, Stephan
author_facet Libertini, Emanuele
Heath, Simon C.
Hamoudi, Rifat A.
Gut, Marta
Ziller, Michael J.
Czyz, Agata
Ruotti, Victor
Stunnenberg, Hendrik G.
Frontini, Mattia
Ouwehand, Willem H.
Meissner, Alexander
Gut, Ivo G.
Beck, Stephan
author_sort Libertini, Emanuele
collection PubMed
description The cost of whole-genome bisulfite sequencing (WGBS) remains a bottleneck for many studies and it is therefore imperative to extract as much information as possible from a given dataset. This is particularly important because even at the recommend 30X coverage for reference methylomes, up to 50% of high-resolution features such as differentially methylated positions (DMPs) cannot be called with current methods as determined by saturation analysis. To address this limitation, we have developed a tool that dynamically segments WGBS methylomes into blocks of comethylation (COMETs) from which lost information can be recovered in the form of differentially methylated COMETs (DMCs). Using this tool, we demonstrate recovery of ∼30% of the lost DMP information content as DMCs even at very low (5X) coverage. This constitutes twice the amount that can be recovered using an existing method based on differentially methylated regions (DMRs). In addition, we explored the relationship between COMETs and haplotypes in lymphoblastoid cell lines of African and European origin. Using best fit analysis, we show COMETs to be correlated in a population-specific manner, suggesting that this type of dynamic segmentation may be useful for integrated (epi)genome-wide association studies in the future.
format Online
Article
Text
id pubmed-4931220
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49312202016-07-12 Information recovery from low coverage whole-genome bisulfite sequencing Libertini, Emanuele Heath, Simon C. Hamoudi, Rifat A. Gut, Marta Ziller, Michael J. Czyz, Agata Ruotti, Victor Stunnenberg, Hendrik G. Frontini, Mattia Ouwehand, Willem H. Meissner, Alexander Gut, Ivo G. Beck, Stephan Nat Commun Article The cost of whole-genome bisulfite sequencing (WGBS) remains a bottleneck for many studies and it is therefore imperative to extract as much information as possible from a given dataset. This is particularly important because even at the recommend 30X coverage for reference methylomes, up to 50% of high-resolution features such as differentially methylated positions (DMPs) cannot be called with current methods as determined by saturation analysis. To address this limitation, we have developed a tool that dynamically segments WGBS methylomes into blocks of comethylation (COMETs) from which lost information can be recovered in the form of differentially methylated COMETs (DMCs). Using this tool, we demonstrate recovery of ∼30% of the lost DMP information content as DMCs even at very low (5X) coverage. This constitutes twice the amount that can be recovered using an existing method based on differentially methylated regions (DMRs). In addition, we explored the relationship between COMETs and haplotypes in lymphoblastoid cell lines of African and European origin. Using best fit analysis, we show COMETs to be correlated in a population-specific manner, suggesting that this type of dynamic segmentation may be useful for integrated (epi)genome-wide association studies in the future. Nature Publishing Group 2016-06-27 /pmc/articles/PMC4931220/ /pubmed/27346250 http://dx.doi.org/10.1038/ncomms11306 Text en Copyright © 2016, Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved. http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Libertini, Emanuele
Heath, Simon C.
Hamoudi, Rifat A.
Gut, Marta
Ziller, Michael J.
Czyz, Agata
Ruotti, Victor
Stunnenberg, Hendrik G.
Frontini, Mattia
Ouwehand, Willem H.
Meissner, Alexander
Gut, Ivo G.
Beck, Stephan
Information recovery from low coverage whole-genome bisulfite sequencing
title Information recovery from low coverage whole-genome bisulfite sequencing
title_full Information recovery from low coverage whole-genome bisulfite sequencing
title_fullStr Information recovery from low coverage whole-genome bisulfite sequencing
title_full_unstemmed Information recovery from low coverage whole-genome bisulfite sequencing
title_short Information recovery from low coverage whole-genome bisulfite sequencing
title_sort information recovery from low coverage whole-genome bisulfite sequencing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4931220/
https://www.ncbi.nlm.nih.gov/pubmed/27346250
http://dx.doi.org/10.1038/ncomms11306
work_keys_str_mv AT libertiniemanuele informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT heathsimonc informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT hamoudirifata informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT gutmarta informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT zillermichaelj informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT czyzagata informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT ruottivictor informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT stunnenberghendrikg informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT frontinimattia informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT ouwehandwillemh informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT meissneralexander informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT gutivog informationrecoveryfromlowcoveragewholegenomebisulfitesequencing
AT beckstephan informationrecoveryfromlowcoveragewholegenomebisulfitesequencing