Cargando…

NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions

T cell receptors (TCR) define the specificity of T cells and are responsible for their interaction with peptide antigen targets presented in complex with major histocompatibility complex (MHC) molecules. Understanding the rules underlying this interaction hence forms the foundation for our understan...

Descripción completa

Detalles Bibliográficos
Autores principales:	Montemurro, Alessandro, Jessen, Leon Eyrich, Nielsen, Morten
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Immunology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9763291/ https://www.ncbi.nlm.nih.gov/pubmed/36561755 http://dx.doi.org/10.3389/fimmu.2022.1055151

_version_	1784853024362987520
author	Montemurro, Alessandro Jessen, Leon Eyrich Nielsen, Morten
author_facet	Montemurro, Alessandro Jessen, Leon Eyrich Nielsen, Morten
author_sort	Montemurro, Alessandro
collection	PubMed
description	T cell receptors (TCR) define the specificity of T cells and are responsible for their interaction with peptide antigen targets presented in complex with major histocompatibility complex (MHC) molecules. Understanding the rules underlying this interaction hence forms the foundation for our understanding of basic adaptive immunology. Over the last decade, efforts have been dedicated to developing assays for high throughput identification of peptide-specific TCRs. Based on such data, several computational methods have been proposed for predicting the TCR-pMHC interaction. The general conclusion from these studies is that the prediction of TCR interactions with MHC-peptide complexes remains highly challenging. Several reasons form the basis for this including scarcity and quality of data, and ill-defined modeling objectives imposed by the high redundancy of the available data. In this work, we propose a framework for dealing with this redundancy, allowing us to address essential questions related to the modeling of TCR specificity including the use of peptide- versus pan-specific models, how to best define negative data, and the performance impact of integrating of CDR1 and 2 loops. Further, we illustrate how and why it is strongly recommended to include simple similarity-based modeling approaches when validating an improved predictive power of machine learning models, and that such validation should include a performance evaluation as a function of “distance” to the training data, to quantify the potential for generalization of the proposed model. The conclusion of the work is that, given current data, TCR specificity is best modeled using peptide-specific approaches, integrating information from all 6 CDR loops, and with negative data constructed from a combination of true and mislabeled negatives. Comparing such machine learning models to similarity-based approaches demonstrated an increased performance gain of the former as the “distance” to the training data was increased; thus demonstrating an improved generalization ability of the machine learning-based approaches. We believe these results demonstrate that the outlined modeling framework and proposed evaluation strategy form a solid basis for investigating the modeling of TCR specificities and that adhering to such a framework will allow for faster progress within the field. The final devolved model, NetTCR-2.1, is available at https://services.healthtech.dtu.dk/service.php?NetTCR-2.1.
format	Online Article Text
id	pubmed-9763291
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-97632912022-12-21 NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions Montemurro, Alessandro Jessen, Leon Eyrich Nielsen, Morten Front Immunol Immunology T cell receptors (TCR) define the specificity of T cells and are responsible for their interaction with peptide antigen targets presented in complex with major histocompatibility complex (MHC) molecules. Understanding the rules underlying this interaction hence forms the foundation for our understanding of basic adaptive immunology. Over the last decade, efforts have been dedicated to developing assays for high throughput identification of peptide-specific TCRs. Based on such data, several computational methods have been proposed for predicting the TCR-pMHC interaction. The general conclusion from these studies is that the prediction of TCR interactions with MHC-peptide complexes remains highly challenging. Several reasons form the basis for this including scarcity and quality of data, and ill-defined modeling objectives imposed by the high redundancy of the available data. In this work, we propose a framework for dealing with this redundancy, allowing us to address essential questions related to the modeling of TCR specificity including the use of peptide- versus pan-specific models, how to best define negative data, and the performance impact of integrating of CDR1 and 2 loops. Further, we illustrate how and why it is strongly recommended to include simple similarity-based modeling approaches when validating an improved predictive power of machine learning models, and that such validation should include a performance evaluation as a function of “distance” to the training data, to quantify the potential for generalization of the proposed model. The conclusion of the work is that, given current data, TCR specificity is best modeled using peptide-specific approaches, integrating information from all 6 CDR loops, and with negative data constructed from a combination of true and mislabeled negatives. Comparing such machine learning models to similarity-based approaches demonstrated an increased performance gain of the former as the “distance” to the training data was increased; thus demonstrating an improved generalization ability of the machine learning-based approaches. We believe these results demonstrate that the outlined modeling framework and proposed evaluation strategy form a solid basis for investigating the modeling of TCR specificities and that adhering to such a framework will allow for faster progress within the field. The final devolved model, NetTCR-2.1, is available at https://services.healthtech.dtu.dk/service.php?NetTCR-2.1. Frontiers Media S.A. 2022-12-06 /pmc/articles/PMC9763291/ /pubmed/36561755 http://dx.doi.org/10.3389/fimmu.2022.1055151 Text en Copyright © 2022 Montemurro, Jessen and Nielsen https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Immunology Montemurro, Alessandro Jessen, Leon Eyrich Nielsen, Morten NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions
title	NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions
title_full	NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions
title_fullStr	NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions
title_full_unstemmed	NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions
title_short	NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions
title_sort	nettcr-2.1: lessons and guidance on how to develop models for tcr specificity predictions
topic	Immunology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9763291/ https://www.ncbi.nlm.nih.gov/pubmed/36561755 http://dx.doi.org/10.3389/fimmu.2022.1055151
work_keys_str_mv	AT montemurroalessandro nettcr21lessonsandguidanceonhowtodevelopmodelsfortcrspecificitypredictions AT jessenleoneyrich nettcr21lessonsandguidanceonhowtodevelopmodelsfortcrspecificitypredictions AT nielsenmorten nettcr21lessonsandguidanceonhowtodevelopmodelsfortcrspecificitypredictions

NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions

Ejemplares similares