Cargando…

Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles

BACKGROUND: The HLA (human leukocyte antigen) class I is a kind of molecule encoded by a large family of genes and is characteristic of high polymorphism. Now the number of the registered HLA-I molecules has exceeded 3000. Slight differences in the amino acid sequences of HLAs would make them bind t...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Fei, Gao, Yangyang, Zhu, Yongqiong, Liu, Juan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3654895/
https://www.ncbi.nlm.nih.gov/pubmed/23815611
http://dx.doi.org/10.1186/1471-2105-14-S8-S1
_version_ 1782269787730083840
author Luo, Fei
Gao, Yangyang
Zhu, Yongqiong
Liu, Juan
author_facet Luo, Fei
Gao, Yangyang
Zhu, Yongqiong
Liu, Juan
author_sort Luo, Fei
collection PubMed
description BACKGROUND: The HLA (human leukocyte antigen) class I is a kind of molecule encoded by a large family of genes and is characteristic of high polymorphism. Now the number of the registered HLA-I molecules has exceeded 3000. Slight differences in the amino acid sequences of HLAs would make them bind to different sets of peptides. In the past decades, although many methods have been proposed to predict the binding between peptides and HLA-I molecules and achieved good performance, most experimental data used by them is limited to the HLAs with a small number of alleles. Thus they are inclined to obtain high prediction accuracy only for data with similar alleles. Because the peptides and HLAs together determine the binding, it's necessary to consider their contribution meanwhile. RESULTS: By taking into account the features of the peptides sequence and the energy of contact residues, in this paper a method based on the artificial neural network is proposed to predict the binding of peptides and HLA-I even when the HLAs' potential alleles are unknown. Two experiments in the allele-specific and super-type cases are performed respectively to validate our method. In the first case, we collect 14 HLA-A and 14 HLA-B molecules on Bjoern Peters dataset, and compare our method with the ARB, SMM, NetMHC and other 16 online methods. Our method gets the best average AUC (Area under the ROC) value as 0.909. In the second one, we use leave one out cross validation on MHC-peptide binding data that has different alleles but shares the common super-type. Compared to gold standard methods like NetMHC and NetMHCpan, our method again achieves the best average AUC value as 0.847. CONCLUSIONS: Our method achieves satisfactory results. Whenever it's tested on the HLA-I with single definite gene or with super-type gene locus, it gets better classification accuracy. Especially, when the training set is small, our method still works better than the other methods in the comparison. Therefore, we could make a conclusion that by combining the peptides' information, HLAs amino acid residues' interaction information and contact energy, our method really could improve prediction of the peptide HLA-I binding even when there aren't the prior experimental dataset for HLAs with various alleles.
format Online
Article
Text
id pubmed-3654895
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36548952013-05-20 Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles Luo, Fei Gao, Yangyang Zhu, Yongqiong Liu, Juan BMC Bioinformatics Proceedings BACKGROUND: The HLA (human leukocyte antigen) class I is a kind of molecule encoded by a large family of genes and is characteristic of high polymorphism. Now the number of the registered HLA-I molecules has exceeded 3000. Slight differences in the amino acid sequences of HLAs would make them bind to different sets of peptides. In the past decades, although many methods have been proposed to predict the binding between peptides and HLA-I molecules and achieved good performance, most experimental data used by them is limited to the HLAs with a small number of alleles. Thus they are inclined to obtain high prediction accuracy only for data with similar alleles. Because the peptides and HLAs together determine the binding, it's necessary to consider their contribution meanwhile. RESULTS: By taking into account the features of the peptides sequence and the energy of contact residues, in this paper a method based on the artificial neural network is proposed to predict the binding of peptides and HLA-I even when the HLAs' potential alleles are unknown. Two experiments in the allele-specific and super-type cases are performed respectively to validate our method. In the first case, we collect 14 HLA-A and 14 HLA-B molecules on Bjoern Peters dataset, and compare our method with the ARB, SMM, NetMHC and other 16 online methods. Our method gets the best average AUC (Area under the ROC) value as 0.909. In the second one, we use leave one out cross validation on MHC-peptide binding data that has different alleles but shares the common super-type. Compared to gold standard methods like NetMHC and NetMHCpan, our method again achieves the best average AUC value as 0.847. CONCLUSIONS: Our method achieves satisfactory results. Whenever it's tested on the HLA-I with single definite gene or with super-type gene locus, it gets better classification accuracy. Especially, when the training set is small, our method still works better than the other methods in the comparison. Therefore, we could make a conclusion that by combining the peptides' information, HLAs amino acid residues' interaction information and contact energy, our method really could improve prediction of the peptide HLA-I binding even when there aren't the prior experimental dataset for HLAs with various alleles. BioMed Central 2013-05-09 /pmc/articles/PMC3654895/ /pubmed/23815611 http://dx.doi.org/10.1186/1471-2105-14-S8-S1 Text en Copyright © 2013 Luo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Luo, Fei
Gao, Yangyang
Zhu, Yongqiong
Liu, Juan
Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles
title Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles
title_full Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles
title_fullStr Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles
title_full_unstemmed Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles
title_short Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles
title_sort integrating peptides' sequence and energy of contact residues information improves prediction of peptide and hla-i binding with unknown alleles
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3654895/
https://www.ncbi.nlm.nih.gov/pubmed/23815611
http://dx.doi.org/10.1186/1471-2105-14-S8-S1
work_keys_str_mv AT luofei integratingpeptidessequenceandenergyofcontactresiduesinformationimprovespredictionofpeptideandhlaibindingwithunknownalleles
AT gaoyangyang integratingpeptidessequenceandenergyofcontactresiduesinformationimprovespredictionofpeptideandhlaibindingwithunknownalleles
AT zhuyongqiong integratingpeptidessequenceandenergyofcontactresiduesinformationimprovespredictionofpeptideandhlaibindingwithunknownalleles
AT liujuan integratingpeptidessequenceandenergyofcontactresiduesinformationimprovespredictionofpeptideandhlaibindingwithunknownalleles