Cargando…
RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles
The recent increase of immunopeptidomics data, obtained by mass spectrometry or binding assays, opens up possibilities for investigating endogenous antigen presentation by the highly polymorphic human leukocyte antigen class I (HLA-I) protein. State-of-the-art methods predict with high accuracy pres...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cell Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7895905/ https://www.ncbi.nlm.nih.gov/pubmed/33338400 http://dx.doi.org/10.1016/j.cels.2020.11.005 |
_version_ | 1783653450151624704 |
---|---|
author | Bravi, Barbara Tubiana, Jérôme Cocco, Simona Monasson, Rémi Mora, Thierry Walczak, Aleksandra M. |
author_facet | Bravi, Barbara Tubiana, Jérôme Cocco, Simona Monasson, Rémi Mora, Thierry Walczak, Aleksandra M. |
author_sort | Bravi, Barbara |
collection | PubMed |
description | The recent increase of immunopeptidomics data, obtained by mass spectrometry or binding assays, opens up possibilities for investigating endogenous antigen presentation by the highly polymorphic human leukocyte antigen class I (HLA-I) protein. State-of-the-art methods predict with high accuracy presentation by HLA alleles that are well represented in databases at the time of release but have a poorer performance for rarer and less characterized alleles. Here, we introduce a method based on Restricted Boltzmann Machines (RBMs) for prediction of antigens presented on the Major Histocompatibility Complex (MHC) encoded by HLA genes—RBM-MHC. RBM-MHC can be trained on custom and newly available samples with no or a small amount of HLA annotations. RBM-MHC ensures improved predictions for rare alleles and matches state-of-the-art performance for well-characterized alleles while being less data demanding. RBM-MHC is shown to be a flexible and easily interpretable method that can be used as a predictor of cancer neoantigens and viral epitopes, as a tool for feature discovery, and to reconstruct peptide motifs presented on specific HLA molecules. |
format | Online Article Text |
id | pubmed-7895905 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Cell Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-78959052021-03-02 RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles Bravi, Barbara Tubiana, Jérôme Cocco, Simona Monasson, Rémi Mora, Thierry Walczak, Aleksandra M. Cell Syst Brief Report The recent increase of immunopeptidomics data, obtained by mass spectrometry or binding assays, opens up possibilities for investigating endogenous antigen presentation by the highly polymorphic human leukocyte antigen class I (HLA-I) protein. State-of-the-art methods predict with high accuracy presentation by HLA alleles that are well represented in databases at the time of release but have a poorer performance for rarer and less characterized alleles. Here, we introduce a method based on Restricted Boltzmann Machines (RBMs) for prediction of antigens presented on the Major Histocompatibility Complex (MHC) encoded by HLA genes—RBM-MHC. RBM-MHC can be trained on custom and newly available samples with no or a small amount of HLA annotations. RBM-MHC ensures improved predictions for rare alleles and matches state-of-the-art performance for well-characterized alleles while being less data demanding. RBM-MHC is shown to be a flexible and easily interpretable method that can be used as a predictor of cancer neoantigens and viral epitopes, as a tool for feature discovery, and to reconstruct peptide motifs presented on specific HLA molecules. Cell Press 2021-02-17 /pmc/articles/PMC7895905/ /pubmed/33338400 http://dx.doi.org/10.1016/j.cels.2020.11.005 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Brief Report Bravi, Barbara Tubiana, Jérôme Cocco, Simona Monasson, Rémi Mora, Thierry Walczak, Aleksandra M. RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles |
title | RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles |
title_full | RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles |
title_fullStr | RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles |
title_full_unstemmed | RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles |
title_short | RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles |
title_sort | rbm-mhc: a semi-supervised machine-learning method for sample-specific prediction of antigen presentation by hla-i alleles |
topic | Brief Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7895905/ https://www.ncbi.nlm.nih.gov/pubmed/33338400 http://dx.doi.org/10.1016/j.cels.2020.11.005 |
work_keys_str_mv | AT bravibarbara rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles AT tubianajerome rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles AT coccosimona rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles AT monassonremi rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles AT morathierry rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles AT walczakaleksandram rbmmhcasemisupervisedmachinelearningmethodforsamplespecificpredictionofantigenpresentationbyhlaialleles |