Cargando…

PretiMeth: precise prediction models for DNA methylation based on single methylation mark

BACKGROUND: The computational prediction of methylation levels at single CpG resolution is promising to explore the methylation levels of CpGs uncovered by existing array techniques, especially for the 450 K beadchip array data with huge reserves. General prediction models concentrate on improving t...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Jianxiong, Zou, Jianxiao, Zhang, Xiaoran, Fan, Mei, Tian, Qi, Fu, Shuyao, Gao, Shihong, Fan, Shicai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7227319/
https://www.ncbi.nlm.nih.gov/pubmed/32414326
http://dx.doi.org/10.1186/s12864-020-6768-9
_version_ 1783534479525019648
author Tang, Jianxiong
Zou, Jianxiao
Zhang, Xiaoran
Fan, Mei
Tian, Qi
Fu, Shuyao
Gao, Shihong
Fan, Shicai
author_facet Tang, Jianxiong
Zou, Jianxiao
Zhang, Xiaoran
Fan, Mei
Tian, Qi
Fu, Shuyao
Gao, Shihong
Fan, Shicai
author_sort Tang, Jianxiong
collection PubMed
description BACKGROUND: The computational prediction of methylation levels at single CpG resolution is promising to explore the methylation levels of CpGs uncovered by existing array techniques, especially for the 450 K beadchip array data with huge reserves. General prediction models concentrate on improving the overall prediction accuracy for the bulk of CpG loci while neglecting whether each locus is precisely predicted. This leads to the limited application of the prediction results, especially when performing downstream analysis with high precision requirements. RESULTS: Here we reported PretiMeth, a method for constructing precise prediction models for each single CpG locus. PretiMeth used a logistic regression algorithm to build a prediction model for each interested locus. Only one DNA methylation feature that shared the most similar methylation pattern with the CpG locus to be predicted was applied in the model. We found that PretiMeth outperformed other algorithms in the prediction accuracy, and kept robust across platforms and cell types. Furthermore, PretiMeth was applied to The Cancer Genome Atlas data (TCGA), the intensive analysis based on precise prediction results showed that several CpG loci and genes (differentially methylated between the tumor and normal samples) were worthy for further biological validation. CONCLUSION: The precise prediction of single CpG locus is important for both methylation array data expansion and downstream analysis of prediction results. PretiMeth achieved precise modeling for each CpG locus by using only one significant feature, which also suggested that our precise prediction models could be probably used for reference in the probe set design when the DNA methylation beadchip update. PretiMeth is provided as an open source tool via https://github.com/JxTang-bioinformatics/PretiMeth.
format Online
Article
Text
id pubmed-7227319
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72273192020-05-27 PretiMeth: precise prediction models for DNA methylation based on single methylation mark Tang, Jianxiong Zou, Jianxiao Zhang, Xiaoran Fan, Mei Tian, Qi Fu, Shuyao Gao, Shihong Fan, Shicai BMC Genomics Methodology Article BACKGROUND: The computational prediction of methylation levels at single CpG resolution is promising to explore the methylation levels of CpGs uncovered by existing array techniques, especially for the 450 K beadchip array data with huge reserves. General prediction models concentrate on improving the overall prediction accuracy for the bulk of CpG loci while neglecting whether each locus is precisely predicted. This leads to the limited application of the prediction results, especially when performing downstream analysis with high precision requirements. RESULTS: Here we reported PretiMeth, a method for constructing precise prediction models for each single CpG locus. PretiMeth used a logistic regression algorithm to build a prediction model for each interested locus. Only one DNA methylation feature that shared the most similar methylation pattern with the CpG locus to be predicted was applied in the model. We found that PretiMeth outperformed other algorithms in the prediction accuracy, and kept robust across platforms and cell types. Furthermore, PretiMeth was applied to The Cancer Genome Atlas data (TCGA), the intensive analysis based on precise prediction results showed that several CpG loci and genes (differentially methylated between the tumor and normal samples) were worthy for further biological validation. CONCLUSION: The precise prediction of single CpG locus is important for both methylation array data expansion and downstream analysis of prediction results. PretiMeth achieved precise modeling for each CpG locus by using only one significant feature, which also suggested that our precise prediction models could be probably used for reference in the probe set design when the DNA methylation beadchip update. PretiMeth is provided as an open source tool via https://github.com/JxTang-bioinformatics/PretiMeth. BioMed Central 2020-05-15 /pmc/articles/PMC7227319/ /pubmed/32414326 http://dx.doi.org/10.1186/s12864-020-6768-9 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Tang, Jianxiong
Zou, Jianxiao
Zhang, Xiaoran
Fan, Mei
Tian, Qi
Fu, Shuyao
Gao, Shihong
Fan, Shicai
PretiMeth: precise prediction models for DNA methylation based on single methylation mark
title PretiMeth: precise prediction models for DNA methylation based on single methylation mark
title_full PretiMeth: precise prediction models for DNA methylation based on single methylation mark
title_fullStr PretiMeth: precise prediction models for DNA methylation based on single methylation mark
title_full_unstemmed PretiMeth: precise prediction models for DNA methylation based on single methylation mark
title_short PretiMeth: precise prediction models for DNA methylation based on single methylation mark
title_sort pretimeth: precise prediction models for dna methylation based on single methylation mark
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7227319/
https://www.ncbi.nlm.nih.gov/pubmed/32414326
http://dx.doi.org/10.1186/s12864-020-6768-9
work_keys_str_mv AT tangjianxiong pretimethprecisepredictionmodelsfordnamethylationbasedonsinglemethylationmark
AT zoujianxiao pretimethprecisepredictionmodelsfordnamethylationbasedonsinglemethylationmark
AT zhangxiaoran pretimethprecisepredictionmodelsfordnamethylationbasedonsinglemethylationmark
AT fanmei pretimethprecisepredictionmodelsfordnamethylationbasedonsinglemethylationmark
AT tianqi pretimethprecisepredictionmodelsfordnamethylationbasedonsinglemethylationmark
AT fushuyao pretimethprecisepredictionmodelsfordnamethylationbasedonsinglemethylationmark
AT gaoshihong pretimethprecisepredictionmodelsfordnamethylationbasedonsinglemethylationmark
AT fanshicai pretimethprecisepredictionmodelsfordnamethylationbasedonsinglemethylationmark