Cargando…

Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion

Protein is closely related to life activities. As a kind of protein, DNA-binding protein plays an irreplaceable role in life activities. Therefore, it is very important to study DNA-binding protein, which is a subject worthy of study. Although traditional biotechnology has high precision, its cost a...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Weizhong, Chen, Xiaoyi, Zhang, Yu, Wu, Hongjie, Ding, Yijie, Shen, Jiawei, Guan, Shixuan, Li, Haiou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9184165/
https://www.ncbi.nlm.nih.gov/pubmed/35693256
http://dx.doi.org/10.1155/2022/9705275
_version_ 1784724450421243904
author Lu, Weizhong
Chen, Xiaoyi
Zhang, Yu
Wu, Hongjie
Ding, Yijie
Shen, Jiawei
Guan, Shixuan
Li, Haiou
author_facet Lu, Weizhong
Chen, Xiaoyi
Zhang, Yu
Wu, Hongjie
Ding, Yijie
Shen, Jiawei
Guan, Shixuan
Li, Haiou
author_sort Lu, Weizhong
collection PubMed
description Protein is closely related to life activities. As a kind of protein, DNA-binding protein plays an irreplaceable role in life activities. Therefore, it is very important to study DNA-binding protein, which is a subject worthy of study. Although traditional biotechnology has high precision, its cost and efficiency are increasingly unable to meet the needs of modern society. Machine learning methods can make up for the deficiencies of biological experimental techniques to a certain extent, but they are not as simple and fast as deep learning for data processing. In this paper, a deep learning framework based on parallel long and short-term memory(LSTM) and convolutional neural networks(CNN) was proposed to identify DNA-binding protein. This model can not only further extract the information and features of protein sequences, but also the features of evolutionary information. Finally, the two features are combined for training and testing. On the PDB2272 dataset, compared with PDBP_Fusion model, Accuracy(ACC) and Matthew's Correlation Coefficient (MCC) increased by 3.82% and 7.98% respectively. The experimental results of this model have certain advantages.
format Online
Article
Text
id pubmed-9184165
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-91841652022-06-10 Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion Lu, Weizhong Chen, Xiaoyi Zhang, Yu Wu, Hongjie Ding, Yijie Shen, Jiawei Guan, Shixuan Li, Haiou Comput Math Methods Med Research Article Protein is closely related to life activities. As a kind of protein, DNA-binding protein plays an irreplaceable role in life activities. Therefore, it is very important to study DNA-binding protein, which is a subject worthy of study. Although traditional biotechnology has high precision, its cost and efficiency are increasingly unable to meet the needs of modern society. Machine learning methods can make up for the deficiencies of biological experimental techniques to a certain extent, but they are not as simple and fast as deep learning for data processing. In this paper, a deep learning framework based on parallel long and short-term memory(LSTM) and convolutional neural networks(CNN) was proposed to identify DNA-binding protein. This model can not only further extract the information and features of protein sequences, but also the features of evolutionary information. Finally, the two features are combined for training and testing. On the PDB2272 dataset, compared with PDBP_Fusion model, Accuracy(ACC) and Matthew's Correlation Coefficient (MCC) increased by 3.82% and 7.98% respectively. The experimental results of this model have certain advantages. Hindawi 2022-06-02 /pmc/articles/PMC9184165/ /pubmed/35693256 http://dx.doi.org/10.1155/2022/9705275 Text en Copyright © 2022 Weizhong Lu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lu, Weizhong
Chen, Xiaoyi
Zhang, Yu
Wu, Hongjie
Ding, Yijie
Shen, Jiawei
Guan, Shixuan
Li, Haiou
Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion
title Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion
title_full Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion
title_fullStr Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion
title_full_unstemmed Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion
title_short Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion
title_sort research on dna-binding protein identification method based on lstm-cnn feature fusion
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9184165/
https://www.ncbi.nlm.nih.gov/pubmed/35693256
http://dx.doi.org/10.1155/2022/9705275
work_keys_str_mv AT luweizhong researchondnabindingproteinidentificationmethodbasedonlstmcnnfeaturefusion
AT chenxiaoyi researchondnabindingproteinidentificationmethodbasedonlstmcnnfeaturefusion
AT zhangyu researchondnabindingproteinidentificationmethodbasedonlstmcnnfeaturefusion
AT wuhongjie researchondnabindingproteinidentificationmethodbasedonlstmcnnfeaturefusion
AT dingyijie researchondnabindingproteinidentificationmethodbasedonlstmcnnfeaturefusion
AT shenjiawei researchondnabindingproteinidentificationmethodbasedonlstmcnnfeaturefusion
AT guanshixuan researchondnabindingproteinidentificationmethodbasedonlstmcnnfeaturefusion
AT lihaiou researchondnabindingproteinidentificationmethodbasedonlstmcnnfeaturefusion