Cargando…

MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq

As the fifth base in mammalian genome, 5-methylcytosine (5 mC) is essential for many biological processes including normal development and disease. Methylated DNA immunoprecipitation sequencing (MeDIP-seq), which uses anti-5 mC antibodies to enrich for methylated fraction of the genome, is widely us...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Yun, Yu, Fulong, Pang, Lin, Zhao, Hongying, Liu, Ling, Zhang, Guanxiong, Liu, Tingting, Zhang, Hongyi, Fan, Huihui, Zhang, Yan, Pang, Bo, Li, Xia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4589794/
https://www.ncbi.nlm.nih.gov/pubmed/26424089
http://dx.doi.org/10.1038/srep14699
_version_ 1782392851484639232
author Xiao, Yun
Yu, Fulong
Pang, Lin
Zhao, Hongying
Liu, Ling
Zhang, Guanxiong
Liu, Tingting
Zhang, Hongyi
Fan, Huihui
Zhang, Yan
Pang, Bo
Li, Xia
author_facet Xiao, Yun
Yu, Fulong
Pang, Lin
Zhao, Hongying
Liu, Ling
Zhang, Guanxiong
Liu, Tingting
Zhang, Hongyi
Fan, Huihui
Zhang, Yan
Pang, Bo
Li, Xia
author_sort Xiao, Yun
collection PubMed
description As the fifth base in mammalian genome, 5-methylcytosine (5 mC) is essential for many biological processes including normal development and disease. Methylated DNA immunoprecipitation sequencing (MeDIP-seq), which uses anti-5 mC antibodies to enrich for methylated fraction of the genome, is widely used to investigate methylome at a resolution of 100–500 bp. Considering the CpG density-dependent bias and limited resolution of MeDIP-seq, we developed a Random Forest Regression (RFR) model method, MeSiC, to estimate DNA methylation levels at single-base resolution. MeSiC integrated MeDIP-seq signals of CpG sites and their surrounding neighbors as well as genomic features to construct genomic element-dependent RFR models. In the H1 cell line, a high correlation was observed between MeSiC predictions and actual 5 mC levels. Meanwhile, MeSiC enabled to calibrate CpG density-dependent bias of MeDIP-seq signals. Importantly, we found that MeSiC models constructed in the H1 cell line could be used to accurately predict DNA methylation levels for other cell types. Comparisons with methylCRF and MEDIPS showed that MeSiC achieved comparable and even better performance. These demonstrate that MeSiC can provide accurate estimations of 5 mC levels at single-CpG resolution using MeDIP-seq data alone.
format Online
Article
Text
id pubmed-4589794
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-45897942015-10-13 MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq Xiao, Yun Yu, Fulong Pang, Lin Zhao, Hongying Liu, Ling Zhang, Guanxiong Liu, Tingting Zhang, Hongyi Fan, Huihui Zhang, Yan Pang, Bo Li, Xia Sci Rep Article As the fifth base in mammalian genome, 5-methylcytosine (5 mC) is essential for many biological processes including normal development and disease. Methylated DNA immunoprecipitation sequencing (MeDIP-seq), which uses anti-5 mC antibodies to enrich for methylated fraction of the genome, is widely used to investigate methylome at a resolution of 100–500 bp. Considering the CpG density-dependent bias and limited resolution of MeDIP-seq, we developed a Random Forest Regression (RFR) model method, MeSiC, to estimate DNA methylation levels at single-base resolution. MeSiC integrated MeDIP-seq signals of CpG sites and their surrounding neighbors as well as genomic features to construct genomic element-dependent RFR models. In the H1 cell line, a high correlation was observed between MeSiC predictions and actual 5 mC levels. Meanwhile, MeSiC enabled to calibrate CpG density-dependent bias of MeDIP-seq signals. Importantly, we found that MeSiC models constructed in the H1 cell line could be used to accurately predict DNA methylation levels for other cell types. Comparisons with methylCRF and MEDIPS showed that MeSiC achieved comparable and even better performance. These demonstrate that MeSiC can provide accurate estimations of 5 mC levels at single-CpG resolution using MeDIP-seq data alone. Nature Publishing Group 2015-10-01 /pmc/articles/PMC4589794/ /pubmed/26424089 http://dx.doi.org/10.1038/srep14699 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Xiao, Yun
Yu, Fulong
Pang, Lin
Zhao, Hongying
Liu, Ling
Zhang, Guanxiong
Liu, Tingting
Zhang, Hongyi
Fan, Huihui
Zhang, Yan
Pang, Bo
Li, Xia
MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq
title MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq
title_full MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq
title_fullStr MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq
title_full_unstemmed MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq
title_short MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq
title_sort mesic: a model-based method for estimating 5 mc levels at single-cpg resolution from medip-seq
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4589794/
https://www.ncbi.nlm.nih.gov/pubmed/26424089
http://dx.doi.org/10.1038/srep14699
work_keys_str_mv AT xiaoyun mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT yufulong mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT panglin mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT zhaohongying mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT liuling mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT zhangguanxiong mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT liutingting mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT zhanghongyi mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT fanhuihui mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT zhangyan mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT pangbo mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq
AT lixia mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq