Cargando…

Improved prediction of MHC-peptide binding using protein language models

Major histocompatibility complex Class I (MHC-I) molecules bind to peptides derived from intracellular antigens and present them on the surface of cells, allowing the immune system (T cells) to detect them. Elucidating the process of this presentation is essential for regulation and potential manipu...

Descripción completa

Detalles Bibliográficos
Autores principales: Hashemi, Nasser, Hao, Boran, Ignatov, Mikhail, Paschalidis, Ioannis Ch., Vakili, Pirooz, Vajda, Sandor, Kozakov, Dima
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10469926/
https://www.ncbi.nlm.nih.gov/pubmed/37663788
http://dx.doi.org/10.3389/fbinf.2023.1207380
_version_ 1785099556978950144
author Hashemi, Nasser
Hao, Boran
Ignatov, Mikhail
Paschalidis, Ioannis Ch.
Vakili, Pirooz
Vajda, Sandor
Kozakov, Dima
author_facet Hashemi, Nasser
Hao, Boran
Ignatov, Mikhail
Paschalidis, Ioannis Ch.
Vakili, Pirooz
Vajda, Sandor
Kozakov, Dima
author_sort Hashemi, Nasser
collection PubMed
description Major histocompatibility complex Class I (MHC-I) molecules bind to peptides derived from intracellular antigens and present them on the surface of cells, allowing the immune system (T cells) to detect them. Elucidating the process of this presentation is essential for regulation and potential manipulation of the cellular immune system. Predicting whether a given peptide binds to an MHC molecule is an important step in the above process and has motivated the introduction of many computational approaches to address this problem. NetMHCPan, a pan-specific model for predicting binding of peptides to any MHC molecule, is one of the most widely used methods which focuses on solving this binary classification problem using shallow neural networks. The recent successful results of Deep Learning (DL) methods, especially Natural Language Processing (NLP-based) pretrained models in various applications, including protein structure determination, motivated us to explore their use in this problem. Specifically, we consider the application of deep learning models pretrained on large datasets of protein sequences to predict MHC Class I-peptide binding. Using the standard performance metrics in this area, and the same training and test sets, we show that our models outperform NetMHCpan4.1, currently considered as the-state-of-the-art.
format Online
Article
Text
id pubmed-10469926
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104699262023-09-01 Improved prediction of MHC-peptide binding using protein language models Hashemi, Nasser Hao, Boran Ignatov, Mikhail Paschalidis, Ioannis Ch. Vakili, Pirooz Vajda, Sandor Kozakov, Dima Front Bioinform Bioinformatics Major histocompatibility complex Class I (MHC-I) molecules bind to peptides derived from intracellular antigens and present them on the surface of cells, allowing the immune system (T cells) to detect them. Elucidating the process of this presentation is essential for regulation and potential manipulation of the cellular immune system. Predicting whether a given peptide binds to an MHC molecule is an important step in the above process and has motivated the introduction of many computational approaches to address this problem. NetMHCPan, a pan-specific model for predicting binding of peptides to any MHC molecule, is one of the most widely used methods which focuses on solving this binary classification problem using shallow neural networks. The recent successful results of Deep Learning (DL) methods, especially Natural Language Processing (NLP-based) pretrained models in various applications, including protein structure determination, motivated us to explore their use in this problem. Specifically, we consider the application of deep learning models pretrained on large datasets of protein sequences to predict MHC Class I-peptide binding. Using the standard performance metrics in this area, and the same training and test sets, we show that our models outperform NetMHCpan4.1, currently considered as the-state-of-the-art. Frontiers Media S.A. 2023-08-17 /pmc/articles/PMC10469926/ /pubmed/37663788 http://dx.doi.org/10.3389/fbinf.2023.1207380 Text en Copyright © 2023 Hashemi, Hao, Ignatov, Paschalidis, Vakili, Vajda and Kozakov. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Hashemi, Nasser
Hao, Boran
Ignatov, Mikhail
Paschalidis, Ioannis Ch.
Vakili, Pirooz
Vajda, Sandor
Kozakov, Dima
Improved prediction of MHC-peptide binding using protein language models
title Improved prediction of MHC-peptide binding using protein language models
title_full Improved prediction of MHC-peptide binding using protein language models
title_fullStr Improved prediction of MHC-peptide binding using protein language models
title_full_unstemmed Improved prediction of MHC-peptide binding using protein language models
title_short Improved prediction of MHC-peptide binding using protein language models
title_sort improved prediction of mhc-peptide binding using protein language models
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10469926/
https://www.ncbi.nlm.nih.gov/pubmed/37663788
http://dx.doi.org/10.3389/fbinf.2023.1207380
work_keys_str_mv AT hasheminasser improvedpredictionofmhcpeptidebindingusingproteinlanguagemodels
AT haoboran improvedpredictionofmhcpeptidebindingusingproteinlanguagemodels
AT ignatovmikhail improvedpredictionofmhcpeptidebindingusingproteinlanguagemodels
AT paschalidisioannisch improvedpredictionofmhcpeptidebindingusingproteinlanguagemodels
AT vakilipirooz improvedpredictionofmhcpeptidebindingusingproteinlanguagemodels
AT vajdasandor improvedpredictionofmhcpeptidebindingusingproteinlanguagemodels
AT kozakovdima improvedpredictionofmhcpeptidebindingusingproteinlanguagemodels