Cargando…

Evaluating performance of existing computational models in predicting CD8+ T cell pathogenic epitopes and cancer neoantigens

T cell recognition of a cognate peptide–major histocompatibility complex (pMHC) presented on the surface of infected or malignant cells is of the utmost importance for mediating robust and long-term immune responses. Accurate predictions of cognate pMHC targets for T cell receptors would greatly fac...

Descripción completa

Detalles Bibliográficos
Autores principales: Buckley, Paul R, Lee, Chloe H, Ma, Ruichong, Woodhouse, Isaac, Woo, Jeongmin, Tsvetkov, Vasily O, Shcherbinin, Dmitrii S, Antanaviciute, Agne, Shughay, Mikhail, Rei, Margarida, Simmons, Alison, Koohy, Hashem
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9116217/
https://www.ncbi.nlm.nih.gov/pubmed/35471658
http://dx.doi.org/10.1093/bib/bbac141
_version_ 1784710071827038208
author Buckley, Paul R
Lee, Chloe H
Ma, Ruichong
Woodhouse, Isaac
Woo, Jeongmin
Tsvetkov, Vasily O
Shcherbinin, Dmitrii S
Antanaviciute, Agne
Shughay, Mikhail
Rei, Margarida
Simmons, Alison
Koohy, Hashem
author_facet Buckley, Paul R
Lee, Chloe H
Ma, Ruichong
Woodhouse, Isaac
Woo, Jeongmin
Tsvetkov, Vasily O
Shcherbinin, Dmitrii S
Antanaviciute, Agne
Shughay, Mikhail
Rei, Margarida
Simmons, Alison
Koohy, Hashem
author_sort Buckley, Paul R
collection PubMed
description T cell recognition of a cognate peptide–major histocompatibility complex (pMHC) presented on the surface of infected or malignant cells is of the utmost importance for mediating robust and long-term immune responses. Accurate predictions of cognate pMHC targets for T cell receptors would greatly facilitate identification of vaccine targets for both pathogenic diseases and personalized cancer immunotherapies. Predicting immunogenic peptides therefore has been at the center of intensive research for the past decades but has proven challenging. Although numerous models have been proposed, performance of these models has not been systematically evaluated and their success rate in predicting epitopes in the context of human pathology has not been measured and compared. In this study, we evaluated the performance of several publicly available models, in identifying immunogenic CD8+ T cell targets in the context of pathogens and cancers. We found that for predicting immunogenic peptides from an emerging virus such as severe acute respiratory syndrome coronavirus 2, none of the models perform substantially better than random or offer considerable improvement beyond HLA ligand prediction. We also observed suboptimal performance for predicting cancer neoantigens. Through investigation of potential factors associated with ill performance of models, we highlight several data- and model-associated issues. In particular, we observed that cross-HLA variation in the distribution of immunogenic and non-immunogenic peptides in the training data of the models seems to substantially confound the predictions. We additionally compared key parameters associated with immunogenicity between pathogenic peptides and cancer neoantigens and observed evidence for differences in the thresholds of binding affinity and stability, which suggested the need to modulate different features in identifying immunogenic pathogen versus cancer peptides. Overall, we demonstrate that accurate and reliable predictions of immunogenic CD8+ T cell targets remain unsolved; thus, we hope our work will guide users and model developers regarding potential pitfalls and unsettled questions in existing immunogenicity predictors.
format Online
Article
Text
id pubmed-9116217
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-91162172022-05-19 Evaluating performance of existing computational models in predicting CD8+ T cell pathogenic epitopes and cancer neoantigens Buckley, Paul R Lee, Chloe H Ma, Ruichong Woodhouse, Isaac Woo, Jeongmin Tsvetkov, Vasily O Shcherbinin, Dmitrii S Antanaviciute, Agne Shughay, Mikhail Rei, Margarida Simmons, Alison Koohy, Hashem Brief Bioinform Problem Solving Protocol T cell recognition of a cognate peptide–major histocompatibility complex (pMHC) presented on the surface of infected or malignant cells is of the utmost importance for mediating robust and long-term immune responses. Accurate predictions of cognate pMHC targets for T cell receptors would greatly facilitate identification of vaccine targets for both pathogenic diseases and personalized cancer immunotherapies. Predicting immunogenic peptides therefore has been at the center of intensive research for the past decades but has proven challenging. Although numerous models have been proposed, performance of these models has not been systematically evaluated and their success rate in predicting epitopes in the context of human pathology has not been measured and compared. In this study, we evaluated the performance of several publicly available models, in identifying immunogenic CD8+ T cell targets in the context of pathogens and cancers. We found that for predicting immunogenic peptides from an emerging virus such as severe acute respiratory syndrome coronavirus 2, none of the models perform substantially better than random or offer considerable improvement beyond HLA ligand prediction. We also observed suboptimal performance for predicting cancer neoantigens. Through investigation of potential factors associated with ill performance of models, we highlight several data- and model-associated issues. In particular, we observed that cross-HLA variation in the distribution of immunogenic and non-immunogenic peptides in the training data of the models seems to substantially confound the predictions. We additionally compared key parameters associated with immunogenicity between pathogenic peptides and cancer neoantigens and observed evidence for differences in the thresholds of binding affinity and stability, which suggested the need to modulate different features in identifying immunogenic pathogen versus cancer peptides. Overall, we demonstrate that accurate and reliable predictions of immunogenic CD8+ T cell targets remain unsolved; thus, we hope our work will guide users and model developers regarding potential pitfalls and unsettled questions in existing immunogenicity predictors. Oxford University Press 2022-04-25 /pmc/articles/PMC9116217/ /pubmed/35471658 http://dx.doi.org/10.1093/bib/bbac141 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Buckley, Paul R
Lee, Chloe H
Ma, Ruichong
Woodhouse, Isaac
Woo, Jeongmin
Tsvetkov, Vasily O
Shcherbinin, Dmitrii S
Antanaviciute, Agne
Shughay, Mikhail
Rei, Margarida
Simmons, Alison
Koohy, Hashem
Evaluating performance of existing computational models in predicting CD8+ T cell pathogenic epitopes and cancer neoantigens
title Evaluating performance of existing computational models in predicting CD8+ T cell pathogenic epitopes and cancer neoantigens
title_full Evaluating performance of existing computational models in predicting CD8+ T cell pathogenic epitopes and cancer neoantigens
title_fullStr Evaluating performance of existing computational models in predicting CD8+ T cell pathogenic epitopes and cancer neoantigens
title_full_unstemmed Evaluating performance of existing computational models in predicting CD8+ T cell pathogenic epitopes and cancer neoantigens
title_short Evaluating performance of existing computational models in predicting CD8+ T cell pathogenic epitopes and cancer neoantigens
title_sort evaluating performance of existing computational models in predicting cd8+ t cell pathogenic epitopes and cancer neoantigens
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9116217/
https://www.ncbi.nlm.nih.gov/pubmed/35471658
http://dx.doi.org/10.1093/bib/bbac141
work_keys_str_mv AT buckleypaulr evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT leechloeh evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT maruichong evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT woodhouseisaac evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT woojeongmin evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT tsvetkovvasilyo evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT shcherbinindmitriis evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT antanaviciuteagne evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT shughaymikhail evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT reimargarida evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT simmonsalison evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens
AT koohyhashem evaluatingperformanceofexistingcomputationalmodelsinpredictingcd8tcellpathogenicepitopesandcancerneoantigens