Cargando…

Determining epitope specificity of T-cell receptors with transformers

SUMMARY: T-cell receptors (TCRs) on T cells recognize and bind to epitopes presented by the major histocompatibility complex in case of an infection or cancer. However, the high diversity of TCRs, as well as their unique and complex binding mechanisms underlying epitope recognition, make it difficul...

Descripción completa

Detalles Bibliográficos
Autores principales: Khan, Abdul Rehman, Reinders, Marcel J T, Khatri, Indu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10636277/
https://www.ncbi.nlm.nih.gov/pubmed/37847663
http://dx.doi.org/10.1093/bioinformatics/btad632
_version_ 1785133177869697024
author Khan, Abdul Rehman
Reinders, Marcel J T
Khatri, Indu
author_facet Khan, Abdul Rehman
Reinders, Marcel J T
Khatri, Indu
author_sort Khan, Abdul Rehman
collection PubMed
description SUMMARY: T-cell receptors (TCRs) on T cells recognize and bind to epitopes presented by the major histocompatibility complex in case of an infection or cancer. However, the high diversity of TCRs, as well as their unique and complex binding mechanisms underlying epitope recognition, make it difficult to predict the binding between TCRs and epitopes. Here, we present the utility of transformers, a deep learning strategy that incorporates an attention mechanism that learns the informative features, and show that these models pre-trained on a large set of protein sequences outperform current strategies. We compared three pre-trained auto-encoder transformer models (ProtBERT, ProtAlbert, and ProtElectra) and one pre-trained auto-regressive transformer model (ProtXLNet) to predict the binding specificity of TCRs to 25 epitopes from the VDJdb database (human and murine). Two additional modifications were performed to incorporate gene usage of the TCRs in the four transformer models. Of all 12 transformer implementations (four models with three different modifications), a modified version of the ProtXLNet model could predict TCR–epitope pairs with the highest accuracy (weighted F1 score 0.55 simultaneously considering all 25 epitopes). The modification included additional features representing the gene names for the TCRs. We also showed that the basic implementation of transformers outperformed the previously available methods, i.e. TCRGP, TCRdist, and DeepTCR, developed for the same biological problem, especially for the hard-to-classify labels. We show that the proficiency of transformers in attention learning can be made operational in a complex biological setting like TCR binding prediction. Further ingenuity in utilizing the full potential of transformers, either through attention head visualization or introducing additional features, can extend T-cell research avenues. AVAILABILITY AND IMPLEMENTATION: Data and code are available on https://github.com/InduKhatri/tcrformer.
format Online
Article
Text
id pubmed-10636277
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106362772023-11-11 Determining epitope specificity of T-cell receptors with transformers Khan, Abdul Rehman Reinders, Marcel J T Khatri, Indu Bioinformatics Original Paper SUMMARY: T-cell receptors (TCRs) on T cells recognize and bind to epitopes presented by the major histocompatibility complex in case of an infection or cancer. However, the high diversity of TCRs, as well as their unique and complex binding mechanisms underlying epitope recognition, make it difficult to predict the binding between TCRs and epitopes. Here, we present the utility of transformers, a deep learning strategy that incorporates an attention mechanism that learns the informative features, and show that these models pre-trained on a large set of protein sequences outperform current strategies. We compared three pre-trained auto-encoder transformer models (ProtBERT, ProtAlbert, and ProtElectra) and one pre-trained auto-regressive transformer model (ProtXLNet) to predict the binding specificity of TCRs to 25 epitopes from the VDJdb database (human and murine). Two additional modifications were performed to incorporate gene usage of the TCRs in the four transformer models. Of all 12 transformer implementations (four models with three different modifications), a modified version of the ProtXLNet model could predict TCR–epitope pairs with the highest accuracy (weighted F1 score 0.55 simultaneously considering all 25 epitopes). The modification included additional features representing the gene names for the TCRs. We also showed that the basic implementation of transformers outperformed the previously available methods, i.e. TCRGP, TCRdist, and DeepTCR, developed for the same biological problem, especially for the hard-to-classify labels. We show that the proficiency of transformers in attention learning can be made operational in a complex biological setting like TCR binding prediction. Further ingenuity in utilizing the full potential of transformers, either through attention head visualization or introducing additional features, can extend T-cell research avenues. AVAILABILITY AND IMPLEMENTATION: Data and code are available on https://github.com/InduKhatri/tcrformer. Oxford University Press 2023-10-17 /pmc/articles/PMC10636277/ /pubmed/37847663 http://dx.doi.org/10.1093/bioinformatics/btad632 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Khan, Abdul Rehman
Reinders, Marcel J T
Khatri, Indu
Determining epitope specificity of T-cell receptors with transformers
title Determining epitope specificity of T-cell receptors with transformers
title_full Determining epitope specificity of T-cell receptors with transformers
title_fullStr Determining epitope specificity of T-cell receptors with transformers
title_full_unstemmed Determining epitope specificity of T-cell receptors with transformers
title_short Determining epitope specificity of T-cell receptors with transformers
title_sort determining epitope specificity of t-cell receptors with transformers
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10636277/
https://www.ncbi.nlm.nih.gov/pubmed/37847663
http://dx.doi.org/10.1093/bioinformatics/btad632
work_keys_str_mv AT khanabdulrehman determiningepitopespecificityoftcellreceptorswithtransformers
AT reindersmarceljt determiningepitopespecificityoftcellreceptorswithtransformers
AT khatriindu determiningepitopespecificityoftcellreceptorswithtransformers