Cargando…
PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability
Peptide detectability is defined as the probability of identifying a peptide from a mixture of standard samples, which is a key step in protein identification and analysis. Exploring effective methods for predicting peptide detectability is helpful for disease treatment and clinical research. Howeve...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9604182/ https://www.ncbi.nlm.nih.gov/pubmed/36293242 http://dx.doi.org/10.3390/ijms232012385 |
_version_ | 1784817749130739712 |
---|---|
author | Wang, Huiqing Wang, Juan Feng, Zhipeng Li, Ying Zhao, Hong |
author_facet | Wang, Huiqing Wang, Juan Feng, Zhipeng Li, Ying Zhao, Hong |
author_sort | Wang, Huiqing |
collection | PubMed |
description | Peptide detectability is defined as the probability of identifying a peptide from a mixture of standard samples, which is a key step in protein identification and analysis. Exploring effective methods for predicting peptide detectability is helpful for disease treatment and clinical research. However, most existing computational methods for predicting peptide detectability rely on a single information. With the increasing complexity of feature representation, it is necessary to explore the influence of multivariate information on peptide detectability. Thus, we propose an ensemble deep learning method, PD-BertEDL. Bidirectional encoder representations from transformers (BERT) is introduced to capture the context information of peptides. Context information, sequence information, and physicochemical information of peptides were combined to construct the multivariate feature space of peptides. We use different deep learning methods to capture the high-quality features of different categories of peptides information and use the average fusion strategy to integrate three model prediction results to solve the heterogeneity problem and to enhance the robustness and adaptability of the model. The experimental results show that PD-BertEDL is superior to the existing prediction methods, which can effectively predict peptide detectability and provide strong support for protein identification and quantitative analysis, as well as disease treatment. |
format | Online Article Text |
id | pubmed-9604182 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96041822022-10-27 PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability Wang, Huiqing Wang, Juan Feng, Zhipeng Li, Ying Zhao, Hong Int J Mol Sci Article Peptide detectability is defined as the probability of identifying a peptide from a mixture of standard samples, which is a key step in protein identification and analysis. Exploring effective methods for predicting peptide detectability is helpful for disease treatment and clinical research. However, most existing computational methods for predicting peptide detectability rely on a single information. With the increasing complexity of feature representation, it is necessary to explore the influence of multivariate information on peptide detectability. Thus, we propose an ensemble deep learning method, PD-BertEDL. Bidirectional encoder representations from transformers (BERT) is introduced to capture the context information of peptides. Context information, sequence information, and physicochemical information of peptides were combined to construct the multivariate feature space of peptides. We use different deep learning methods to capture the high-quality features of different categories of peptides information and use the average fusion strategy to integrate three model prediction results to solve the heterogeneity problem and to enhance the robustness and adaptability of the model. The experimental results show that PD-BertEDL is superior to the existing prediction methods, which can effectively predict peptide detectability and provide strong support for protein identification and quantitative analysis, as well as disease treatment. MDPI 2022-10-16 /pmc/articles/PMC9604182/ /pubmed/36293242 http://dx.doi.org/10.3390/ijms232012385 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Huiqing Wang, Juan Feng, Zhipeng Li, Ying Zhao, Hong PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability |
title | PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability |
title_full | PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability |
title_fullStr | PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability |
title_full_unstemmed | PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability |
title_short | PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability |
title_sort | pd-bertedl: an ensemble deep learning method using bert and multivariate representation to predict peptide detectability |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9604182/ https://www.ncbi.nlm.nih.gov/pubmed/36293242 http://dx.doi.org/10.3390/ijms232012385 |
work_keys_str_mv | AT wanghuiqing pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability AT wangjuan pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability AT fengzhipeng pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability AT liying pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability AT zhaohong pdbertedlanensembledeeplearningmethodusingbertandmultivariaterepresentationtopredictpeptidedetectability |