Cargando…

Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model

MOTIVATION: Identifying the B-cell epitopes is an essential step for guiding rational vaccine development and immunotherapies. Since experimental approaches are expensive and time-consuming, many computational methods have been designed to assist B-cell epitope prediction. However, existing sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeng, Yuansong, Wei, Zhuoyi, Yuan, Qianmu, Chen, Sheng, Yu, Weijiang, Lu, Yutong, Gao, Jianzhao, Yang, Yuedong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126322/
https://www.ncbi.nlm.nih.gov/pubmed/37039829
http://dx.doi.org/10.1093/bioinformatics/btad187
_version_ 1785030216941305856
author Zeng, Yuansong
Wei, Zhuoyi
Yuan, Qianmu
Chen, Sheng
Yu, Weijiang
Lu, Yutong
Gao, Jianzhao
Yang, Yuedong
author_facet Zeng, Yuansong
Wei, Zhuoyi
Yuan, Qianmu
Chen, Sheng
Yu, Weijiang
Lu, Yutong
Gao, Jianzhao
Yang, Yuedong
author_sort Zeng, Yuansong
collection PubMed
description MOTIVATION: Identifying the B-cell epitopes is an essential step for guiding rational vaccine development and immunotherapies. Since experimental approaches are expensive and time-consuming, many computational methods have been designed to assist B-cell epitope prediction. However, existing sequence-based methods have limited performance since they only use contextual features of the sequential neighbors while neglecting structural information. RESULTS: Based on the recent breakthrough of AlphaFold2 in protein structure prediction, we propose GraphBepi, a novel graph-based model for accurate B-cell epitope prediction. For one protein, the predicted structure from AlphaFold2 is used to construct the protein graph, where the nodes/residues are encoded by ESM-2 learning representations. The graph is input into the edge-enhanced deep graph neural network (EGNN) to capture the spatial information in the predicted 3D structures. In parallel, a bidirectional long short-term memory neural networks (BiLSTM) are employed to capture long-range dependencies in the sequence. The learned low-dimensional representations by EGNN and BiLSTM are then combined into a multilayer perceptron for predicting B-cell epitopes. Through comprehensive tests on the curated epitope dataset, GraphBepi was shown to outperform the state-of-the-art methods by more than 5.5% and 44.0% in terms of AUC and AUPR, respectively. A web server is freely available at http://bio-web1.nscc-gz.cn/app/graphbepi. AVAILABILITY AND IMPLEMENTATION: The datasets, pre-computed features, source codes, and the trained model are available at https://github.com/biomed-AI/GraphBepi.
format Online
Article
Text
id pubmed-10126322
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101263222023-04-26 Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model Zeng, Yuansong Wei, Zhuoyi Yuan, Qianmu Chen, Sheng Yu, Weijiang Lu, Yutong Gao, Jianzhao Yang, Yuedong Bioinformatics Original Paper MOTIVATION: Identifying the B-cell epitopes is an essential step for guiding rational vaccine development and immunotherapies. Since experimental approaches are expensive and time-consuming, many computational methods have been designed to assist B-cell epitope prediction. However, existing sequence-based methods have limited performance since they only use contextual features of the sequential neighbors while neglecting structural information. RESULTS: Based on the recent breakthrough of AlphaFold2 in protein structure prediction, we propose GraphBepi, a novel graph-based model for accurate B-cell epitope prediction. For one protein, the predicted structure from AlphaFold2 is used to construct the protein graph, where the nodes/residues are encoded by ESM-2 learning representations. The graph is input into the edge-enhanced deep graph neural network (EGNN) to capture the spatial information in the predicted 3D structures. In parallel, a bidirectional long short-term memory neural networks (BiLSTM) are employed to capture long-range dependencies in the sequence. The learned low-dimensional representations by EGNN and BiLSTM are then combined into a multilayer perceptron for predicting B-cell epitopes. Through comprehensive tests on the curated epitope dataset, GraphBepi was shown to outperform the state-of-the-art methods by more than 5.5% and 44.0% in terms of AUC and AUPR, respectively. A web server is freely available at http://bio-web1.nscc-gz.cn/app/graphbepi. AVAILABILITY AND IMPLEMENTATION: The datasets, pre-computed features, source codes, and the trained model are available at https://github.com/biomed-AI/GraphBepi. Oxford University Press 2023-04-11 /pmc/articles/PMC10126322/ /pubmed/37039829 http://dx.doi.org/10.1093/bioinformatics/btad187 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Zeng, Yuansong
Wei, Zhuoyi
Yuan, Qianmu
Chen, Sheng
Yu, Weijiang
Lu, Yutong
Gao, Jianzhao
Yang, Yuedong
Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model
title Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model
title_full Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model
title_fullStr Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model
title_full_unstemmed Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model
title_short Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model
title_sort identifying b-cell epitopes using alphafold2 predicted structures and pretrained language model
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126322/
https://www.ncbi.nlm.nih.gov/pubmed/37039829
http://dx.doi.org/10.1093/bioinformatics/btad187
work_keys_str_mv AT zengyuansong identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel
AT weizhuoyi identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel
AT yuanqianmu identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel
AT chensheng identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel
AT yuweijiang identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel
AT luyutong identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel
AT gaojianzhao identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel
AT yangyuedong identifyingbcellepitopesusingalphafold2predictedstructuresandpretrainedlanguagemodel