Cargando…
MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq
As the fifth base in mammalian genome, 5-methylcytosine (5 mC) is essential for many biological processes including normal development and disease. Methylated DNA immunoprecipitation sequencing (MeDIP-seq), which uses anti-5 mC antibodies to enrich for methylated fraction of the genome, is widely us...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4589794/ https://www.ncbi.nlm.nih.gov/pubmed/26424089 http://dx.doi.org/10.1038/srep14699 |
_version_ | 1782392851484639232 |
---|---|
author | Xiao, Yun Yu, Fulong Pang, Lin Zhao, Hongying Liu, Ling Zhang, Guanxiong Liu, Tingting Zhang, Hongyi Fan, Huihui Zhang, Yan Pang, Bo Li, Xia |
author_facet | Xiao, Yun Yu, Fulong Pang, Lin Zhao, Hongying Liu, Ling Zhang, Guanxiong Liu, Tingting Zhang, Hongyi Fan, Huihui Zhang, Yan Pang, Bo Li, Xia |
author_sort | Xiao, Yun |
collection | PubMed |
description | As the fifth base in mammalian genome, 5-methylcytosine (5 mC) is essential for many biological processes including normal development and disease. Methylated DNA immunoprecipitation sequencing (MeDIP-seq), which uses anti-5 mC antibodies to enrich for methylated fraction of the genome, is widely used to investigate methylome at a resolution of 100–500 bp. Considering the CpG density-dependent bias and limited resolution of MeDIP-seq, we developed a Random Forest Regression (RFR) model method, MeSiC, to estimate DNA methylation levels at single-base resolution. MeSiC integrated MeDIP-seq signals of CpG sites and their surrounding neighbors as well as genomic features to construct genomic element-dependent RFR models. In the H1 cell line, a high correlation was observed between MeSiC predictions and actual 5 mC levels. Meanwhile, MeSiC enabled to calibrate CpG density-dependent bias of MeDIP-seq signals. Importantly, we found that MeSiC models constructed in the H1 cell line could be used to accurately predict DNA methylation levels for other cell types. Comparisons with methylCRF and MEDIPS showed that MeSiC achieved comparable and even better performance. These demonstrate that MeSiC can provide accurate estimations of 5 mC levels at single-CpG resolution using MeDIP-seq data alone. |
format | Online Article Text |
id | pubmed-4589794 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-45897942015-10-13 MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq Xiao, Yun Yu, Fulong Pang, Lin Zhao, Hongying Liu, Ling Zhang, Guanxiong Liu, Tingting Zhang, Hongyi Fan, Huihui Zhang, Yan Pang, Bo Li, Xia Sci Rep Article As the fifth base in mammalian genome, 5-methylcytosine (5 mC) is essential for many biological processes including normal development and disease. Methylated DNA immunoprecipitation sequencing (MeDIP-seq), which uses anti-5 mC antibodies to enrich for methylated fraction of the genome, is widely used to investigate methylome at a resolution of 100–500 bp. Considering the CpG density-dependent bias and limited resolution of MeDIP-seq, we developed a Random Forest Regression (RFR) model method, MeSiC, to estimate DNA methylation levels at single-base resolution. MeSiC integrated MeDIP-seq signals of CpG sites and their surrounding neighbors as well as genomic features to construct genomic element-dependent RFR models. In the H1 cell line, a high correlation was observed between MeSiC predictions and actual 5 mC levels. Meanwhile, MeSiC enabled to calibrate CpG density-dependent bias of MeDIP-seq signals. Importantly, we found that MeSiC models constructed in the H1 cell line could be used to accurately predict DNA methylation levels for other cell types. Comparisons with methylCRF and MEDIPS showed that MeSiC achieved comparable and even better performance. These demonstrate that MeSiC can provide accurate estimations of 5 mC levels at single-CpG resolution using MeDIP-seq data alone. Nature Publishing Group 2015-10-01 /pmc/articles/PMC4589794/ /pubmed/26424089 http://dx.doi.org/10.1038/srep14699 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Xiao, Yun Yu, Fulong Pang, Lin Zhao, Hongying Liu, Ling Zhang, Guanxiong Liu, Tingting Zhang, Hongyi Fan, Huihui Zhang, Yan Pang, Bo Li, Xia MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq |
title | MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq |
title_full | MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq |
title_fullStr | MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq |
title_full_unstemmed | MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq |
title_short | MeSiC: A Model-Based Method for Estimating 5 mC Levels at Single-CpG Resolution from MeDIP-seq |
title_sort | mesic: a model-based method for estimating 5 mc levels at single-cpg resolution from medip-seq |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4589794/ https://www.ncbi.nlm.nih.gov/pubmed/26424089 http://dx.doi.org/10.1038/srep14699 |
work_keys_str_mv | AT xiaoyun mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT yufulong mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT panglin mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT zhaohongying mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT liuling mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT zhangguanxiong mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT liutingting mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT zhanghongyi mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT fanhuihui mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT zhangyan mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT pangbo mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq AT lixia mesicamodelbasedmethodforestimating5mclevelsatsinglecpgresolutionfrommedipseq |