Cargando…

Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique

4mC is a type of DNA alteration that has the ability to synchronize multiple biological movements, for example, DNA replication, gene expressions, and transcriptional regulations. Accurate prediction of 4mC sites can provide exact information to their hereditary functions. The purpose of this study...

Descripción completa

Detalles Bibliográficos
Autores principales: Zulfiqar, Hasan, Huang, Qin-Lai, Lv, Hao, Sun, Zi-Jie, Dao, Fu-Ying, Lin, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8836036/
https://www.ncbi.nlm.nih.gov/pubmed/35163174
http://dx.doi.org/10.3390/ijms23031251
_version_ 1784649578160586752
author Zulfiqar, Hasan
Huang, Qin-Lai
Lv, Hao
Sun, Zi-Jie
Dao, Fu-Ying
Lin, Hao
author_facet Zulfiqar, Hasan
Huang, Qin-Lai
Lv, Hao
Sun, Zi-Jie
Dao, Fu-Ying
Lin, Hao
author_sort Zulfiqar, Hasan
collection PubMed
description 4mC is a type of DNA alteration that has the ability to synchronize multiple biological movements, for example, DNA replication, gene expressions, and transcriptional regulations. Accurate prediction of 4mC sites can provide exact information to their hereditary functions. The purpose of this study was to establish a robust deep learning model to recognize 4mC sites in Geobacter pickeringii. In the anticipated model, two kinds of feature descriptors, namely, binary and k-mer composition were used to encode the DNA sequences of Geobacter pickeringii. The obtained features from their fusion were optimized by using correlation and gradient-boosting decision tree (GBDT)-based algorithm with incremental feature selection (IFS) method. Then, these optimized features were inserted into 1D convolutional neural network (CNN) to classify 4mC sites from non-4mC sites in Geobacter pickeringii. The performance of the anticipated model on independent data exhibited an accuracy of 0.868, which was 4.2% higher than the existing model.
format Online
Article
Text
id pubmed-8836036
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88360362022-02-12 Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique Zulfiqar, Hasan Huang, Qin-Lai Lv, Hao Sun, Zi-Jie Dao, Fu-Ying Lin, Hao Int J Mol Sci Communication 4mC is a type of DNA alteration that has the ability to synchronize multiple biological movements, for example, DNA replication, gene expressions, and transcriptional regulations. Accurate prediction of 4mC sites can provide exact information to their hereditary functions. The purpose of this study was to establish a robust deep learning model to recognize 4mC sites in Geobacter pickeringii. In the anticipated model, two kinds of feature descriptors, namely, binary and k-mer composition were used to encode the DNA sequences of Geobacter pickeringii. The obtained features from their fusion were optimized by using correlation and gradient-boosting decision tree (GBDT)-based algorithm with incremental feature selection (IFS) method. Then, these optimized features were inserted into 1D convolutional neural network (CNN) to classify 4mC sites from non-4mC sites in Geobacter pickeringii. The performance of the anticipated model on independent data exhibited an accuracy of 0.868, which was 4.2% higher than the existing model. MDPI 2022-01-23 /pmc/articles/PMC8836036/ /pubmed/35163174 http://dx.doi.org/10.3390/ijms23031251 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Zulfiqar, Hasan
Huang, Qin-Lai
Lv, Hao
Sun, Zi-Jie
Dao, Fu-Ying
Lin, Hao
Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique
title Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique
title_full Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique
title_fullStr Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique
title_full_unstemmed Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique
title_short Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique
title_sort deep-4mcgp: a deep learning approach to predict 4mc sites in geobacter pickeringii by using correlation-based feature selection technique
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8836036/
https://www.ncbi.nlm.nih.gov/pubmed/35163174
http://dx.doi.org/10.3390/ijms23031251
work_keys_str_mv AT zulfiqarhasan deep4mcgpadeeplearningapproachtopredict4mcsitesingeobacterpickeringiibyusingcorrelationbasedfeatureselectiontechnique
AT huangqinlai deep4mcgpadeeplearningapproachtopredict4mcsitesingeobacterpickeringiibyusingcorrelationbasedfeatureselectiontechnique
AT lvhao deep4mcgpadeeplearningapproachtopredict4mcsitesingeobacterpickeringiibyusingcorrelationbasedfeatureselectiontechnique
AT sunzijie deep4mcgpadeeplearningapproachtopredict4mcsitesingeobacterpickeringiibyusingcorrelationbasedfeatureselectiontechnique
AT daofuying deep4mcgpadeeplearningapproachtopredict4mcsitesingeobacterpickeringiibyusingcorrelationbasedfeatureselectiontechnique
AT linhao deep4mcgpadeeplearningapproachtopredict4mcsitesingeobacterpickeringiibyusingcorrelationbasedfeatureselectiontechnique