Cargando…

DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction

Interactions between human leukocyte antigens (HLAs) and peptides play a critical role in the human immune system. Accurate computational prediction of HLA-binding peptides can be used for peptide drug discovery. Currently, the best prediction algorithms are neural network-based pan-specific models,...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zhonghao, Cui, Yuxin, Xiong, Zheng, Nasiri, Alierza, Zhang, Ansi, Hu, Jianjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6349913/
https://www.ncbi.nlm.nih.gov/pubmed/30692623
http://dx.doi.org/10.1038/s41598-018-37214-1
_version_ 1783390347755257856
author Liu, Zhonghao
Cui, Yuxin
Xiong, Zheng
Nasiri, Alierza
Zhang, Ansi
Hu, Jianjun
author_facet Liu, Zhonghao
Cui, Yuxin
Xiong, Zheng
Nasiri, Alierza
Zhang, Ansi
Hu, Jianjun
author_sort Liu, Zhonghao
collection PubMed
description Interactions between human leukocyte antigens (HLAs) and peptides play a critical role in the human immune system. Accurate computational prediction of HLA-binding peptides can be used for peptide drug discovery. Currently, the best prediction algorithms are neural network-based pan-specific models, which take advantage of the large amount of data across HLA alleles. However, current pan-specific models are all based on the pseudo sequence encoding for modeling the binding context, which is based on 34 positions identified from the HLA protein-peptide bound structures in early works. In this work, we proposed a novel deep convolutional neural network model (DCNN) for HLA-peptide binding prediction, in which the encoding of the HLA sequence and the binding context are both learned by the network itself without requiring the HLA-peptide bound structure information. Our DCNN model is also characterized by its binding context extraction layer and dual outputs with both binding affinity output and binding probability outputs. Evaluation on public benchmark datasets shows that our DeepSeqPan model without HLA structural information in training achieves state-of-the-art performance on a large number of HLA alleles with good generalization capability. Since our model only needs raw sequences from the HLA-peptide binding pairs, it can be applied to binding predictions of HLAs without structure information and can also be applied to other protein binding problems such as protein-DNA and protein-RNA bindings. The implementation code and trained models are freely available at https://github.com/pcpLiu/DeepSeqPan.
format Online
Article
Text
id pubmed-6349913
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-63499132019-01-30 DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction Liu, Zhonghao Cui, Yuxin Xiong, Zheng Nasiri, Alierza Zhang, Ansi Hu, Jianjun Sci Rep Article Interactions between human leukocyte antigens (HLAs) and peptides play a critical role in the human immune system. Accurate computational prediction of HLA-binding peptides can be used for peptide drug discovery. Currently, the best prediction algorithms are neural network-based pan-specific models, which take advantage of the large amount of data across HLA alleles. However, current pan-specific models are all based on the pseudo sequence encoding for modeling the binding context, which is based on 34 positions identified from the HLA protein-peptide bound structures in early works. In this work, we proposed a novel deep convolutional neural network model (DCNN) for HLA-peptide binding prediction, in which the encoding of the HLA sequence and the binding context are both learned by the network itself without requiring the HLA-peptide bound structure information. Our DCNN model is also characterized by its binding context extraction layer and dual outputs with both binding affinity output and binding probability outputs. Evaluation on public benchmark datasets shows that our DeepSeqPan model without HLA structural information in training achieves state-of-the-art performance on a large number of HLA alleles with good generalization capability. Since our model only needs raw sequences from the HLA-peptide binding pairs, it can be applied to binding predictions of HLAs without structure information and can also be applied to other protein binding problems such as protein-DNA and protein-RNA bindings. The implementation code and trained models are freely available at https://github.com/pcpLiu/DeepSeqPan. Nature Publishing Group UK 2019-01-28 /pmc/articles/PMC6349913/ /pubmed/30692623 http://dx.doi.org/10.1038/s41598-018-37214-1 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Liu, Zhonghao
Cui, Yuxin
Xiong, Zheng
Nasiri, Alierza
Zhang, Ansi
Hu, Jianjun
DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction
title DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction
title_full DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction
title_fullStr DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction
title_full_unstemmed DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction
title_short DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction
title_sort deepseqpan, a novel deep convolutional neural network model for pan-specific class i hla-peptide binding affinity prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6349913/
https://www.ncbi.nlm.nih.gov/pubmed/30692623
http://dx.doi.org/10.1038/s41598-018-37214-1
work_keys_str_mv AT liuzhonghao deepseqpananoveldeepconvolutionalneuralnetworkmodelforpanspecificclassihlapeptidebindingaffinityprediction
AT cuiyuxin deepseqpananoveldeepconvolutionalneuralnetworkmodelforpanspecificclassihlapeptidebindingaffinityprediction
AT xiongzheng deepseqpananoveldeepconvolutionalneuralnetworkmodelforpanspecificclassihlapeptidebindingaffinityprediction
AT nasirialierza deepseqpananoveldeepconvolutionalneuralnetworkmodelforpanspecificclassihlapeptidebindingaffinityprediction
AT zhangansi deepseqpananoveldeepconvolutionalneuralnetworkmodelforpanspecificclassihlapeptidebindingaffinityprediction
AT hujianjun deepseqpananoveldeepconvolutionalneuralnetworkmodelforpanspecificclassihlapeptidebindingaffinityprediction