Cargando…

MHCAttnNet: predicting MHC-peptide bindings for MHC alleles classes I and II using an attention-based deep neural model

MOTIVATION: Accurate prediction of binding between a major histocompatibility complex (MHC) allele and a peptide plays a major role in the synthesis of personalized cancer vaccines. The immune system struggles to distinguish between a cancerous and a healthy cell. In a patient suffering from cancer...

Descripción completa

Detalles Bibliográficos
Autores principales: Venkatesh, Gopalakrishnan, Grover, Aayush, Srinivasaraghavan, G, Rao, Shrisha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355292/
https://www.ncbi.nlm.nih.gov/pubmed/32657386
http://dx.doi.org/10.1093/bioinformatics/btaa479
_version_ 1783558246252937216
author Venkatesh, Gopalakrishnan
Grover, Aayush
Srinivasaraghavan, G
Rao, Shrisha
author_facet Venkatesh, Gopalakrishnan
Grover, Aayush
Srinivasaraghavan, G
Rao, Shrisha
author_sort Venkatesh, Gopalakrishnan
collection PubMed
description MOTIVATION: Accurate prediction of binding between a major histocompatibility complex (MHC) allele and a peptide plays a major role in the synthesis of personalized cancer vaccines. The immune system struggles to distinguish between a cancerous and a healthy cell. In a patient suffering from cancer who has a particular MHC allele, only those peptides that bind with the MHC allele with high affinity, help the immune system recognize the cancerous cells. RESULTS: MHCAttnNet is a deep neural model that uses an attention mechanism to capture the relevant subsequences of the amino acid sequences of peptides and MHC alleles. It then uses this to accurately predict the MHC-peptide binding. MHCAttnNet achieves an AUC-PRC score of 94.18% with 161 class I MHC alleles, which outperforms the state-of-the-art models for this task. MHCAttnNet also achieves a better F1-score in comparison to the state-of-the-art models while covering a larger number of class II MHC alleles. The attention mechanism used by MHCAttnNet provides a heatmap over the amino acids thus indicating the important subsequences present in the amino acid sequence. This approach also allows us to focus on a much smaller number of relevant trigrams corresponding to the amino acid sequence of an MHC allele, from 9251 possible trigrams to about 258. This significantly reduces the number of amino acid subsequences that need to be clinically tested. AVAILABILITY AND IMPLEMENTATION: The data and source code are available at https://github.com/gopuvenkat/MHCAttnNet.
format Online
Article
Text
id pubmed-7355292
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73552922020-07-16 MHCAttnNet: predicting MHC-peptide bindings for MHC alleles classes I and II using an attention-based deep neural model Venkatesh, Gopalakrishnan Grover, Aayush Srinivasaraghavan, G Rao, Shrisha Bioinformatics Studies of Phenotypes and Clinical Applications MOTIVATION: Accurate prediction of binding between a major histocompatibility complex (MHC) allele and a peptide plays a major role in the synthesis of personalized cancer vaccines. The immune system struggles to distinguish between a cancerous and a healthy cell. In a patient suffering from cancer who has a particular MHC allele, only those peptides that bind with the MHC allele with high affinity, help the immune system recognize the cancerous cells. RESULTS: MHCAttnNet is a deep neural model that uses an attention mechanism to capture the relevant subsequences of the amino acid sequences of peptides and MHC alleles. It then uses this to accurately predict the MHC-peptide binding. MHCAttnNet achieves an AUC-PRC score of 94.18% with 161 class I MHC alleles, which outperforms the state-of-the-art models for this task. MHCAttnNet also achieves a better F1-score in comparison to the state-of-the-art models while covering a larger number of class II MHC alleles. The attention mechanism used by MHCAttnNet provides a heatmap over the amino acids thus indicating the important subsequences present in the amino acid sequence. This approach also allows us to focus on a much smaller number of relevant trigrams corresponding to the amino acid sequence of an MHC allele, from 9251 possible trigrams to about 258. This significantly reduces the number of amino acid subsequences that need to be clinically tested. AVAILABILITY AND IMPLEMENTATION: The data and source code are available at https://github.com/gopuvenkat/MHCAttnNet. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355292/ /pubmed/32657386 http://dx.doi.org/10.1093/bioinformatics/btaa479 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Studies of Phenotypes and Clinical Applications
Venkatesh, Gopalakrishnan
Grover, Aayush
Srinivasaraghavan, G
Rao, Shrisha
MHCAttnNet: predicting MHC-peptide bindings for MHC alleles classes I and II using an attention-based deep neural model
title MHCAttnNet: predicting MHC-peptide bindings for MHC alleles classes I and II using an attention-based deep neural model
title_full MHCAttnNet: predicting MHC-peptide bindings for MHC alleles classes I and II using an attention-based deep neural model
title_fullStr MHCAttnNet: predicting MHC-peptide bindings for MHC alleles classes I and II using an attention-based deep neural model
title_full_unstemmed MHCAttnNet: predicting MHC-peptide bindings for MHC alleles classes I and II using an attention-based deep neural model
title_short MHCAttnNet: predicting MHC-peptide bindings for MHC alleles classes I and II using an attention-based deep neural model
title_sort mhcattnnet: predicting mhc-peptide bindings for mhc alleles classes i and ii using an attention-based deep neural model
topic Studies of Phenotypes and Clinical Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355292/
https://www.ncbi.nlm.nih.gov/pubmed/32657386
http://dx.doi.org/10.1093/bioinformatics/btaa479
work_keys_str_mv AT venkateshgopalakrishnan mhcattnnetpredictingmhcpeptidebindingsformhcallelesclassesiandiiusinganattentionbaseddeepneuralmodel
AT groveraayush mhcattnnetpredictingmhcpeptidebindingsformhcallelesclassesiandiiusinganattentionbaseddeepneuralmodel
AT srinivasaraghavang mhcattnnetpredictingmhcpeptidebindingsformhcallelesclassesiandiiusinganattentionbaseddeepneuralmodel
AT raoshrisha mhcattnnetpredictingmhcpeptidebindingsformhcallelesclassesiandiiusinganattentionbaseddeepneuralmodel