Cargando…

SecProCT: In Silico Prediction of Human Secretory Proteins Based on Capsule Network and Transformer

Identifying secretory proteins from blood, saliva or other body fluids has become an effective method of diagnosing diseases. Existing secretory protein prediction methods are mainly based on conventional machine learning algorithms and are highly dependent on the feature set from the protein. In th...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Wei, Zhao, Xuan, Sun, Yu, Zheng, Lei, Li, Ying, Zhang, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8396571/
https://www.ncbi.nlm.nih.gov/pubmed/34445760
http://dx.doi.org/10.3390/ijms22169054
_version_ 1783744403927465984
author Du, Wei
Zhao, Xuan
Sun, Yu
Zheng, Lei
Li, Ying
Zhang, Yu
author_facet Du, Wei
Zhao, Xuan
Sun, Yu
Zheng, Lei
Li, Ying
Zhang, Yu
author_sort Du, Wei
collection PubMed
description Identifying secretory proteins from blood, saliva or other body fluids has become an effective method of diagnosing diseases. Existing secretory protein prediction methods are mainly based on conventional machine learning algorithms and are highly dependent on the feature set from the protein. In this article, we propose a deep learning model based on the capsule network and transformer architecture, SecProCT, to predict secretory proteins using only amino acid sequences. The proposed model was validated using cross-validation and achieved 0.921 and 0.892 accuracy for predicting blood-secretory proteins and saliva-secretory proteins, respectively. Meanwhile, the proposed model was validated on an independent test set and achieved 0.917 and 0.905 accuracy for predicting blood-secretory proteins and saliva-secretory proteins, respectively, which are better than conventional machine learning methods and other deep learning methods for biological sequence analysis. The main contributions of this article are as follows: (1) a deep learning model based on a capsule network and transformer architecture is proposed for predicting secretory proteins. The results of this model are better than the those of existing conventional machine learning methods and deep learning methods for biological sequence analysis; (2) only amino acid sequences are used in the proposed model, which overcomes the high dependence of existing methods on the annotated protein features; (3) the proposed model can accurately predict most experimentally verified secretory proteins and cancer protein biomarkers in blood and saliva.
format Online
Article
Text
id pubmed-8396571
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83965712021-08-28 SecProCT: In Silico Prediction of Human Secretory Proteins Based on Capsule Network and Transformer Du, Wei Zhao, Xuan Sun, Yu Zheng, Lei Li, Ying Zhang, Yu Int J Mol Sci Article Identifying secretory proteins from blood, saliva or other body fluids has become an effective method of diagnosing diseases. Existing secretory protein prediction methods are mainly based on conventional machine learning algorithms and are highly dependent on the feature set from the protein. In this article, we propose a deep learning model based on the capsule network and transformer architecture, SecProCT, to predict secretory proteins using only amino acid sequences. The proposed model was validated using cross-validation and achieved 0.921 and 0.892 accuracy for predicting blood-secretory proteins and saliva-secretory proteins, respectively. Meanwhile, the proposed model was validated on an independent test set and achieved 0.917 and 0.905 accuracy for predicting blood-secretory proteins and saliva-secretory proteins, respectively, which are better than conventional machine learning methods and other deep learning methods for biological sequence analysis. The main contributions of this article are as follows: (1) a deep learning model based on a capsule network and transformer architecture is proposed for predicting secretory proteins. The results of this model are better than the those of existing conventional machine learning methods and deep learning methods for biological sequence analysis; (2) only amino acid sequences are used in the proposed model, which overcomes the high dependence of existing methods on the annotated protein features; (3) the proposed model can accurately predict most experimentally verified secretory proteins and cancer protein biomarkers in blood and saliva. MDPI 2021-08-22 /pmc/articles/PMC8396571/ /pubmed/34445760 http://dx.doi.org/10.3390/ijms22169054 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Du, Wei
Zhao, Xuan
Sun, Yu
Zheng, Lei
Li, Ying
Zhang, Yu
SecProCT: In Silico Prediction of Human Secretory Proteins Based on Capsule Network and Transformer
title SecProCT: In Silico Prediction of Human Secretory Proteins Based on Capsule Network and Transformer
title_full SecProCT: In Silico Prediction of Human Secretory Proteins Based on Capsule Network and Transformer
title_fullStr SecProCT: In Silico Prediction of Human Secretory Proteins Based on Capsule Network and Transformer
title_full_unstemmed SecProCT: In Silico Prediction of Human Secretory Proteins Based on Capsule Network and Transformer
title_short SecProCT: In Silico Prediction of Human Secretory Proteins Based on Capsule Network and Transformer
title_sort secproct: in silico prediction of human secretory proteins based on capsule network and transformer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8396571/
https://www.ncbi.nlm.nih.gov/pubmed/34445760
http://dx.doi.org/10.3390/ijms22169054
work_keys_str_mv AT duwei secproctinsilicopredictionofhumansecretoryproteinsbasedoncapsulenetworkandtransformer
AT zhaoxuan secproctinsilicopredictionofhumansecretoryproteinsbasedoncapsulenetworkandtransformer
AT sunyu secproctinsilicopredictionofhumansecretoryproteinsbasedoncapsulenetworkandtransformer
AT zhenglei secproctinsilicopredictionofhumansecretoryproteinsbasedoncapsulenetworkandtransformer
AT liying secproctinsilicopredictionofhumansecretoryproteinsbasedoncapsulenetworkandtransformer
AT zhangyu secproctinsilicopredictionofhumansecretoryproteinsbasedoncapsulenetworkandtransformer