Cargando…

RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles

The recent increase of immunopeptidomics data, obtained by mass spectrometry or binding assays, opens up possibilities for investigating endogenous antigen presentation by the highly polymorphic human leukocyte antigen class I (HLA-I) protein. State-of-the-art methods predict with high accuracy pres...

Descripción completa

Detalles Bibliográficos
Autores principales: Bravi, Barbara, Tubiana, Jérôme, Cocco, Simona, Monasson, Rémi, Mora, Thierry, Walczak, Aleksandra M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cell Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7895905/
https://www.ncbi.nlm.nih.gov/pubmed/33338400
http://dx.doi.org/10.1016/j.cels.2020.11.005
_version_ 1783653450151624704
author Bravi, Barbara
Tubiana, Jérôme
Cocco, Simona
Monasson, Rémi
Mora, Thierry
Walczak, Aleksandra M.
author_facet Bravi, Barbara
Tubiana, Jérôme
Cocco, Simona
Monasson, Rémi
Mora, Thierry
Walczak, Aleksandra M.
author_sort Bravi, Barbara
collection PubMed
description The recent increase of immunopeptidomics data, obtained by mass spectrometry or binding assays, opens up possibilities for investigating endogenous antigen presentation by the highly polymorphic human leukocyte antigen class I (HLA-I) protein. State-of-the-art methods predict with high accuracy presentation by HLA alleles that are well represented in databases at the time of release but have a poorer performance for rarer and less characterized alleles. Here, we introduce a method based on Restricted Boltzmann Machines (RBMs) for prediction of antigens presented on the Major Histocompatibility Complex (MHC) encoded by HLA genes—RBM-MHC. RBM-MHC can be trained on custom and newly available samples with no or a small amount of HLA annotations. RBM-MHC ensures improved predictions for rare alleles and matches state-of-the-art performance for well-characterized alleles while being less data demanding. RBM-MHC is shown to be a flexible and easily interpretable method that can be used as a predictor of cancer neoantigens and viral epitopes, as a tool for feature discovery, and to reconstruct peptide motifs presented on specific HLA molecules.
format Online
Article
Text
id pubmed-7895905
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cell Press
record_format MEDLINE/PubMed
spelling pubmed-78959052021-03-02 RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles Bravi, Barbara Tubiana, Jérôme Cocco, Simona Monasson, Rémi Mora, Thierry Walczak, Aleksandra M. Cell Syst Brief Report The recent increase of immunopeptidomics data, obtained by mass spectrometry or binding assays, opens up possibilities for investigating endogenous antigen presentation by the highly polymorphic human leukocyte antigen class I (HLA-I) protein. State-of-the-art methods predict with high accuracy presentation by HLA alleles that are well represented in databases at the time of release but have a poorer performance for rarer and less characterized alleles. Here, we introduce a method based on Restricted Boltzmann Machines (RBMs) for prediction of antigens presented on the Major Histocompatibility Complex (MHC) encoded by HLA genes—RBM-MHC. RBM-MHC can be trained on custom and newly available samples with no or a small amount of HLA annotations. RBM-MHC ensures improved predictions for rare alleles and matches state-of-the-art performance for well-characterized alleles while being less data demanding. RBM-MHC is shown to be a flexible and easily interpretable method that can be used as a predictor of cancer neoantigens and viral epitopes, as a tool for feature discovery, and to reconstruct peptide motifs presented on specific HLA molecules. Cell Press 2021-02-17 /pmc/articles/PMC7895905/ /pubmed/33338400 http://dx.doi.org/10.1016/j.cels.2020.11.005 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Brief Report
Bravi, Barbara
Tubiana, Jérôme
Cocco, Simona
Monasson, Rémi
Mora, Thierry
Walczak, Aleksandra M.
RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles
title RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles
title_full RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles
title_fullStr RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles
title_full_unstemmed RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles
title_short RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles
title_sort rbm-mhc: a semi-supervised machine-learning method for sample-specific prediction of antigen presentation by hla-i alleles
topic Brief Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7895905/
https://www.ncbi.nlm.nih.gov/pubmed/33338400
http://dx.doi.org/10.1016/j.cels.2020.11.005
work_keys_str_mv AT bravibarbara rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles
AT tubianajerome rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles
AT coccosimona rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles
AT monassonremi rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles
AT morathierry rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles
AT walczakaleksandram rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles