Cargando…

PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability

Peptide detectability is defined as the probability of identifying a peptide from a mixture of standard samples, which is a key step in protein identification and analysis. Exploring effective methods for predicting peptide detectability is helpful for disease treatment and clinical research. Howeve...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Huiqing, Wang, Juan, Feng, Zhipeng, Li, Ying, Zhao, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9604182/
https://www.ncbi.nlm.nih.gov/pubmed/36293242
http://dx.doi.org/10.3390/ijms232012385
_version_ 1784817749130739712
author Wang, Huiqing
Wang, Juan
Feng, Zhipeng
Li, Ying
Zhao, Hong
author_facet Wang, Huiqing
Wang, Juan
Feng, Zhipeng
Li, Ying
Zhao, Hong
author_sort Wang, Huiqing
collection PubMed
description Peptide detectability is defined as the probability of identifying a peptide from a mixture of standard samples, which is a key step in protein identification and analysis. Exploring effective methods for predicting peptide detectability is helpful for disease treatment and clinical research. However, most existing computational methods for predicting peptide detectability rely on a single information. With the increasing complexity of feature representation, it is necessary to explore the influence of multivariate information on peptide detectability. Thus, we propose an ensemble deep learning method, PD-BertEDL. Bidirectional encoder representations from transformers (BERT) is introduced to capture the context information of peptides. Context information, sequence information, and physicochemical information of peptides were combined to construct the multivariate feature space of peptides. We use different deep learning methods to capture the high-quality features of different categories of peptides information and use the average fusion strategy to integrate three model prediction results to solve the heterogeneity problem and to enhance the robustness and adaptability of the model. The experimental results show that PD-BertEDL is superior to the existing prediction methods, which can effectively predict peptide detectability and provide strong support for protein identification and quantitative analysis, as well as disease treatment.
format Online
Article
Text
id pubmed-9604182
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96041822022-10-27 PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability Wang, Huiqing Wang, Juan Feng, Zhipeng Li, Ying Zhao, Hong Int J Mol Sci Article Peptide detectability is defined as the probability of identifying a peptide from a mixture of standard samples, which is a key step in protein identification and analysis. Exploring effective methods for predicting peptide detectability is helpful for disease treatment and clinical research. However, most existing computational methods for predicting peptide detectability rely on a single information. With the increasing complexity of feature representation, it is necessary to explore the influence of multivariate information on peptide detectability. Thus, we propose an ensemble deep learning method, PD-BertEDL. Bidirectional encoder representations from transformers (BERT) is introduced to capture the context information of peptides. Context information, sequence information, and physicochemical information of peptides were combined to construct the multivariate feature space of peptides. We use different deep learning methods to capture the high-quality features of different categories of peptides information and use the average fusion strategy to integrate three model prediction results to solve the heterogeneity problem and to enhance the robustness and adaptability of the model. The experimental results show that PD-BertEDL is superior to the existing prediction methods, which can effectively predict peptide detectability and provide strong support for protein identification and quantitative analysis, as well as disease treatment. MDPI 2022-10-16 /pmc/articles/PMC9604182/ /pubmed/36293242 http://dx.doi.org/10.3390/ijms232012385 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Huiqing
Wang, Juan
Feng, Zhipeng
Li, Ying
Zhao, Hong
PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability
title PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability
title_full PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability
title_fullStr PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability
title_full_unstemmed PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability
title_short PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability
title_sort pd-bertedl: an ensemble deep learning method using bert and multivariate representation to predict peptide detectability
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9604182/
https://www.ncbi.nlm.nih.gov/pubmed/36293242
http://dx.doi.org/10.3390/ijms232012385
work_keys_str_mv AT wanghuiqing pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability
AT wangjuan pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability
AT fengzhipeng pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability
AT liying pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability
AT zhaohong pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability