Cargando…

BERTMHC: improved MHC–peptide class II interaction prediction with transformer and multiple instance learning

MOTIVATION: Increasingly comprehensive characterization of cancer-associated genetic alterations has paved the way for the development of highly specific therapeutic vaccines. Predicting precisely the binding and presentation of peptides to major histocompatibility complex (MHC) alleles is an import...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cheng, Jun, Bendjama, Kaïdre, Rittner, Karola, Malone, Brandon
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2021
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9502151/ https://www.ncbi.nlm.nih.gov/pubmed/34096999 http://dx.doi.org/10.1093/bioinformatics/btab422

_version_	1784795635919093760
author	Cheng, Jun Bendjama, Kaïdre Rittner, Karola Malone, Brandon
author_facet	Cheng, Jun Bendjama, Kaïdre Rittner, Karola Malone, Brandon
author_sort	Cheng, Jun
collection	PubMed
description	MOTIVATION: Increasingly comprehensive characterization of cancer-associated genetic alterations has paved the way for the development of highly specific therapeutic vaccines. Predicting precisely the binding and presentation of peptides to major histocompatibility complex (MHC) alleles is an important step toward such therapies. Recent data suggest that presentation of both class I and II epitopes are critical for the induction of a sustained effective immune response. However, the prediction performance for MHC class II has been limited compared to class I. RESULTS: We present a transformer neural network model which leverages self-supervised pretraining from a large corpus of protein sequences. We also propose a multiple instance learning (MIL) framework to deconvolve mass spectrometry data where multiple potential MHC alleles may have presented each peptide. We show that pretraining boosted the performance for these tasks. Combining pretraining and the novel MIL approach, our model outperforms state-of-the-art models based on peptide and MHC sequence only for both binding and cell surface presentation predictions. AVAILABILITY AND IMPLEMENTATION: Our source code is available at https://github.com/s6juncheng/BERTMHC under a noncommercial license. A webserver is available at https://bertmhc.privacy.nlehd.de/ SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-9502151
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-95021512022-09-26 BERTMHC: improved MHC–peptide class II interaction prediction with transformer and multiple instance learning Cheng, Jun Bendjama, Kaïdre Rittner, Karola Malone, Brandon Bioinformatics Original Papers MOTIVATION: Increasingly comprehensive characterization of cancer-associated genetic alterations has paved the way for the development of highly specific therapeutic vaccines. Predicting precisely the binding and presentation of peptides to major histocompatibility complex (MHC) alleles is an important step toward such therapies. Recent data suggest that presentation of both class I and II epitopes are critical for the induction of a sustained effective immune response. However, the prediction performance for MHC class II has been limited compared to class I. RESULTS: We present a transformer neural network model which leverages self-supervised pretraining from a large corpus of protein sequences. We also propose a multiple instance learning (MIL) framework to deconvolve mass spectrometry data where multiple potential MHC alleles may have presented each peptide. We show that pretraining boosted the performance for these tasks. Combining pretraining and the novel MIL approach, our model outperforms state-of-the-art models based on peptide and MHC sequence only for both binding and cell surface presentation predictions. AVAILABILITY AND IMPLEMENTATION: Our source code is available at https://github.com/s6juncheng/BERTMHC under a noncommercial license. A webserver is available at https://bertmhc.privacy.nlehd.de/ SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-06-07 /pmc/articles/PMC9502151/ /pubmed/34096999 http://dx.doi.org/10.1093/bioinformatics/btab422 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Original Papers Cheng, Jun Bendjama, Kaïdre Rittner, Karola Malone, Brandon BERTMHC: improved MHC–peptide class II interaction prediction with transformer and multiple instance learning
title	BERTMHC: improved MHC–peptide class II interaction prediction with transformer and multiple instance learning
title_full	BERTMHC: improved MHC–peptide class II interaction prediction with transformer and multiple instance learning
title_fullStr	BERTMHC: improved MHC–peptide class II interaction prediction with transformer and multiple instance learning
title_full_unstemmed	BERTMHC: improved MHC–peptide class II interaction prediction with transformer and multiple instance learning
title_short	BERTMHC: improved MHC–peptide class II interaction prediction with transformer and multiple instance learning
title_sort	bertmhc: improved mhc–peptide class ii interaction prediction with transformer and multiple instance learning
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9502151/ https://www.ncbi.nlm.nih.gov/pubmed/34096999 http://dx.doi.org/10.1093/bioinformatics/btab422
work_keys_str_mv	AT chengjun bertmhcimprovedmhcpeptideclassiiinteractionpredictionwithtransformerandmultipleinstancelearning AT bendjamakaidre bertmhcimprovedmhcpeptideclassiiinteractionpredictionwithtransformerandmultipleinstancelearning AT rittnerkarola bertmhcimprovedmhcpeptideclassiiinteractionpredictionwithtransformerandmultipleinstancelearning AT malonebrandon bertmhcimprovedmhcpeptideclassiiinteractionpredictionwithtransformerandmultipleinstancelearning

BERTMHC: improved MHC–peptide class II interaction prediction with transformer and multiple instance learning

Ejemplares similares