Cargando…

Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network

DNA methylation takes on critical significance to the regulation of gene expression by affecting the stability of DNA and changing the structure of chromosomes. DNA methylation modification sites should be identified, which lays a solid basis for gaining more insights into their biological functions...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Wenxing, Guan, Lixin, Li, Mengshan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10461834/
https://www.ncbi.nlm.nih.gov/pubmed/37639434
http://dx.doi.org/10.1371/journal.pcbi.1011370
_version_ 1785097919881281536
author Hu, Wenxing
Guan, Lixin
Li, Mengshan
author_facet Hu, Wenxing
Guan, Lixin
Li, Mengshan
author_sort Hu, Wenxing
collection PubMed
description DNA methylation takes on critical significance to the regulation of gene expression by affecting the stability of DNA and changing the structure of chromosomes. DNA methylation modification sites should be identified, which lays a solid basis for gaining more insights into their biological functions. Existing machine learning-based methods of predicting DNA methylation have not fully exploited the hidden multidimensional information in DNA gene sequences, such that the prediction accuracy of models is significantly limited. Besides, most models have been built in terms of a single methylation type. To address the above-mentioned issues, a deep learning-based method was proposed in this study for DNA methylation site prediction, termed the MEDCNN model. The MEDCNN model is capable of extracting feature information from gene sequences in three dimensions (i.e., positional information, biological information, and chemical information). Moreover, the proposed method employs a convolutional neural network model with double convolutional layers and double fully connected layers while iteratively updating the gradient descent algorithm using the cross-entropy loss function to increase the prediction accuracy of the model. Besides, the MEDCNN model can predict different types of DNA methylation sites. As indicated by the experimental results,the deep learning method based on coding from multiple dimensions outperformed single coding methods, and the MEDCNN model was highly applicable and outperformed existing models in predicting DNA methylation between different species. As revealed by the above-described findings, the MEDCNN model can be effective in predicting DNA methylation sites.
format Online
Article
Text
id pubmed-10461834
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-104618342023-08-29 Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network Hu, Wenxing Guan, Lixin Li, Mengshan PLoS Comput Biol Research Article DNA methylation takes on critical significance to the regulation of gene expression by affecting the stability of DNA and changing the structure of chromosomes. DNA methylation modification sites should be identified, which lays a solid basis for gaining more insights into their biological functions. Existing machine learning-based methods of predicting DNA methylation have not fully exploited the hidden multidimensional information in DNA gene sequences, such that the prediction accuracy of models is significantly limited. Besides, most models have been built in terms of a single methylation type. To address the above-mentioned issues, a deep learning-based method was proposed in this study for DNA methylation site prediction, termed the MEDCNN model. The MEDCNN model is capable of extracting feature information from gene sequences in three dimensions (i.e., positional information, biological information, and chemical information). Moreover, the proposed method employs a convolutional neural network model with double convolutional layers and double fully connected layers while iteratively updating the gradient descent algorithm using the cross-entropy loss function to increase the prediction accuracy of the model. Besides, the MEDCNN model can predict different types of DNA methylation sites. As indicated by the experimental results,the deep learning method based on coding from multiple dimensions outperformed single coding methods, and the MEDCNN model was highly applicable and outperformed existing models in predicting DNA methylation between different species. As revealed by the above-described findings, the MEDCNN model can be effective in predicting DNA methylation sites. Public Library of Science 2023-08-28 /pmc/articles/PMC10461834/ /pubmed/37639434 http://dx.doi.org/10.1371/journal.pcbi.1011370 Text en © 2023 Hu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hu, Wenxing
Guan, Lixin
Li, Mengshan
Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network
title Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network
title_full Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network
title_fullStr Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network
title_full_unstemmed Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network
title_short Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network
title_sort prediction of dna methylation based on multi-dimensional feature encoding and double convolutional fully connected convolutional neural network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10461834/
https://www.ncbi.nlm.nih.gov/pubmed/37639434
http://dx.doi.org/10.1371/journal.pcbi.1011370
work_keys_str_mv AT huwenxing predictionofdnamethylationbasedonmultidimensionalfeatureencodinganddoubleconvolutionalfullyconnectedconvolutionalneuralnetwork
AT guanlixin predictionofdnamethylationbasedonmultidimensionalfeatureencodinganddoubleconvolutionalfullyconnectedconvolutionalneuralnetwork
AT limengshan predictionofdnamethylationbasedonmultidimensionalfeatureencodinganddoubleconvolutionalfullyconnectedconvolutionalneuralnetwork