Cargando…
Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model
MOTIVATION: Identifying the B-cell epitopes is an essential step for guiding rational vaccine development and immunotherapies. Since experimental approaches are expensive and time-consuming, many computational methods have been designed to assist B-cell epitope prediction. However, existing sequence...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126322/ https://www.ncbi.nlm.nih.gov/pubmed/37039829 http://dx.doi.org/10.1093/bioinformatics/btad187 |
_version_ | 1785030216941305856 |
---|---|
author | Zeng, Yuansong Wei, Zhuoyi Yuan, Qianmu Chen, Sheng Yu, Weijiang Lu, Yutong Gao, Jianzhao Yang, Yuedong |
author_facet | Zeng, Yuansong Wei, Zhuoyi Yuan, Qianmu Chen, Sheng Yu, Weijiang Lu, Yutong Gao, Jianzhao Yang, Yuedong |
author_sort | Zeng, Yuansong |
collection | PubMed |
description | MOTIVATION: Identifying the B-cell epitopes is an essential step for guiding rational vaccine development and immunotherapies. Since experimental approaches are expensive and time-consuming, many computational methods have been designed to assist B-cell epitope prediction. However, existing sequence-based methods have limited performance since they only use contextual features of the sequential neighbors while neglecting structural information. RESULTS: Based on the recent breakthrough of AlphaFold2 in protein structure prediction, we propose GraphBepi, a novel graph-based model for accurate B-cell epitope prediction. For one protein, the predicted structure from AlphaFold2 is used to construct the protein graph, where the nodes/residues are encoded by ESM-2 learning representations. The graph is input into the edge-enhanced deep graph neural network (EGNN) to capture the spatial information in the predicted 3D structures. In parallel, a bidirectional long short-term memory neural networks (BiLSTM) are employed to capture long-range dependencies in the sequence. The learned low-dimensional representations by EGNN and BiLSTM are then combined into a multilayer perceptron for predicting B-cell epitopes. Through comprehensive tests on the curated epitope dataset, GraphBepi was shown to outperform the state-of-the-art methods by more than 5.5% and 44.0% in terms of AUC and AUPR, respectively. A web server is freely available at http://bio-web1.nscc-gz.cn/app/graphbepi. AVAILABILITY AND IMPLEMENTATION: The datasets, pre-computed features, source codes, and the trained model are available at https://github.com/biomed-AI/GraphBepi. |
format | Online Article Text |
id | pubmed-10126322 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101263222023-04-26 Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model Zeng, Yuansong Wei, Zhuoyi Yuan, Qianmu Chen, Sheng Yu, Weijiang Lu, Yutong Gao, Jianzhao Yang, Yuedong Bioinformatics Original Paper MOTIVATION: Identifying the B-cell epitopes is an essential step for guiding rational vaccine development and immunotherapies. Since experimental approaches are expensive and time-consuming, many computational methods have been designed to assist B-cell epitope prediction. However, existing sequence-based methods have limited performance since they only use contextual features of the sequential neighbors while neglecting structural information. RESULTS: Based on the recent breakthrough of AlphaFold2 in protein structure prediction, we propose GraphBepi, a novel graph-based model for accurate B-cell epitope prediction. For one protein, the predicted structure from AlphaFold2 is used to construct the protein graph, where the nodes/residues are encoded by ESM-2 learning representations. The graph is input into the edge-enhanced deep graph neural network (EGNN) to capture the spatial information in the predicted 3D structures. In parallel, a bidirectional long short-term memory neural networks (BiLSTM) are employed to capture long-range dependencies in the sequence. The learned low-dimensional representations by EGNN and BiLSTM are then combined into a multilayer perceptron for predicting B-cell epitopes. Through comprehensive tests on the curated epitope dataset, GraphBepi was shown to outperform the state-of-the-art methods by more than 5.5% and 44.0% in terms of AUC and AUPR, respectively. A web server is freely available at http://bio-web1.nscc-gz.cn/app/graphbepi. AVAILABILITY AND IMPLEMENTATION: The datasets, pre-computed features, source codes, and the trained model are available at https://github.com/biomed-AI/GraphBepi. Oxford University Press 2023-04-11 /pmc/articles/PMC10126322/ /pubmed/37039829 http://dx.doi.org/10.1093/bioinformatics/btad187 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Zeng, Yuansong Wei, Zhuoyi Yuan, Qianmu Chen, Sheng Yu, Weijiang Lu, Yutong Gao, Jianzhao Yang, Yuedong Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model |
title | Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model |
title_full | Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model |
title_fullStr | Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model |
title_full_unstemmed | Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model |
title_short | Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model |
title_sort | identifying b-cell epitopes using alphafold2 predicted structures and pretrained language model |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126322/ https://www.ncbi.nlm.nih.gov/pubmed/37039829 http://dx.doi.org/10.1093/bioinformatics/btad187 |
work_keys_str_mv | AT zengyuansong identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel AT weizhuoyi identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel AT yuanqianmu identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel AT chensheng identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel AT yuweijiang identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel AT luyutong identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel AT gaojianzhao identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel AT yangyuedong identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel |