Cargando…
TAPIR: a T-cell receptor language model for predicting rare and novel targets
T-cell receptors (TCRs) are involved in most human diseases, but linking their sequences with their targets remains an unsolved grand challenge in the field. In this study, we present TAPIR (T-cell receptor and Peptide Interaction Recognizer), a T-cell receptor (TCR) language model that predicts TCR...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10515850/ https://www.ncbi.nlm.nih.gov/pubmed/37745475 http://dx.doi.org/10.1101/2023.09.12.557285 |
_version_ | 1785109032000815104 |
---|---|
author | Fast, Ethan Dhar, Manjima Chen, Binbin |
author_facet | Fast, Ethan Dhar, Manjima Chen, Binbin |
author_sort | Fast, Ethan |
collection | PubMed |
description | T-cell receptors (TCRs) are involved in most human diseases, but linking their sequences with their targets remains an unsolved grand challenge in the field. In this study, we present TAPIR (T-cell receptor and Peptide Interaction Recognizer), a T-cell receptor (TCR) language model that predicts TCR-target interactions, with a focus on novel and rare targets. TAPIR employs deep convolutional neural network (CNN) encoders to process TCR and target sequences across flexible representations (e.g., beta-chain only, unknown MHC allele, etc.) and learns patterns of interactivity via several training tasks. This flexibility allows TAPIR to train on more than 50k either paired (alpha and beta chain) or unpaired TCRs (just alpha or beta chain) from public and proprietary databases against 1933 unique targets. TAPIR demonstrates state-of-the-art performance when predicting TCR interactivity against common benchmark targets and is the first method to demonstrate strong performance when predicting TCR interactivity against novel targets, where no examples are provided in training. TAPIR is also capable of predicting TCR interaction against MHC alleles in the absence of target information. Leveraging these capabilities, we apply TAPIR to cancer patient TCR repertoires and identify and validate a novel and potent anti-cancer T-cell receptor against a shared cancer neoantigen target (PIK3CA H1047L). We further show how TAPIR, when extended with a generative neural network, is capable of directly designing T-cell receptor sequences that interact with a target of interest. |
format | Online Article Text |
id | pubmed-10515850 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-105158502023-09-23 TAPIR: a T-cell receptor language model for predicting rare and novel targets Fast, Ethan Dhar, Manjima Chen, Binbin bioRxiv Article T-cell receptors (TCRs) are involved in most human diseases, but linking their sequences with their targets remains an unsolved grand challenge in the field. In this study, we present TAPIR (T-cell receptor and Peptide Interaction Recognizer), a T-cell receptor (TCR) language model that predicts TCR-target interactions, with a focus on novel and rare targets. TAPIR employs deep convolutional neural network (CNN) encoders to process TCR and target sequences across flexible representations (e.g., beta-chain only, unknown MHC allele, etc.) and learns patterns of interactivity via several training tasks. This flexibility allows TAPIR to train on more than 50k either paired (alpha and beta chain) or unpaired TCRs (just alpha or beta chain) from public and proprietary databases against 1933 unique targets. TAPIR demonstrates state-of-the-art performance when predicting TCR interactivity against common benchmark targets and is the first method to demonstrate strong performance when predicting TCR interactivity against novel targets, where no examples are provided in training. TAPIR is also capable of predicting TCR interaction against MHC alleles in the absence of target information. Leveraging these capabilities, we apply TAPIR to cancer patient TCR repertoires and identify and validate a novel and potent anti-cancer T-cell receptor against a shared cancer neoantigen target (PIK3CA H1047L). We further show how TAPIR, when extended with a generative neural network, is capable of directly designing T-cell receptor sequences that interact with a target of interest. Cold Spring Harbor Laboratory 2023-09-15 /pmc/articles/PMC10515850/ /pubmed/37745475 http://dx.doi.org/10.1101/2023.09.12.557285 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Fast, Ethan Dhar, Manjima Chen, Binbin TAPIR: a T-cell receptor language model for predicting rare and novel targets |
title | TAPIR: a T-cell receptor language model for predicting rare and novel targets |
title_full | TAPIR: a T-cell receptor language model for predicting rare and novel targets |
title_fullStr | TAPIR: a T-cell receptor language model for predicting rare and novel targets |
title_full_unstemmed | TAPIR: a T-cell receptor language model for predicting rare and novel targets |
title_short | TAPIR: a T-cell receptor language model for predicting rare and novel targets |
title_sort | tapir: a t-cell receptor language model for predicting rare and novel targets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10515850/ https://www.ncbi.nlm.nih.gov/pubmed/37745475 http://dx.doi.org/10.1101/2023.09.12.557285 |
work_keys_str_mv | AT fastethan tapiratcellreceptorlanguagemodelforpredictingrareandnoveltargets AT dharmanjima tapiratcellreceptorlanguagemodelforpredictingrareandnoveltargets AT chenbinbin tapiratcellreceptorlanguagemodelforpredictingrareandnoveltargets |