Cargando…
Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles
BACKGROUND: The HLA (human leukocyte antigen) class I is a kind of molecule encoded by a large family of genes and is characteristic of high polymorphism. Now the number of the registered HLA-I molecules has exceeded 3000. Slight differences in the amino acid sequences of HLAs would make them bind t...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3654895/ https://www.ncbi.nlm.nih.gov/pubmed/23815611 http://dx.doi.org/10.1186/1471-2105-14-S8-S1 |
_version_ | 1782269787730083840 |
---|---|
author | Luo, Fei Gao, Yangyang Zhu, Yongqiong Liu, Juan |
author_facet | Luo, Fei Gao, Yangyang Zhu, Yongqiong Liu, Juan |
author_sort | Luo, Fei |
collection | PubMed |
description | BACKGROUND: The HLA (human leukocyte antigen) class I is a kind of molecule encoded by a large family of genes and is characteristic of high polymorphism. Now the number of the registered HLA-I molecules has exceeded 3000. Slight differences in the amino acid sequences of HLAs would make them bind to different sets of peptides. In the past decades, although many methods have been proposed to predict the binding between peptides and HLA-I molecules and achieved good performance, most experimental data used by them is limited to the HLAs with a small number of alleles. Thus they are inclined to obtain high prediction accuracy only for data with similar alleles. Because the peptides and HLAs together determine the binding, it's necessary to consider their contribution meanwhile. RESULTS: By taking into account the features of the peptides sequence and the energy of contact residues, in this paper a method based on the artificial neural network is proposed to predict the binding of peptides and HLA-I even when the HLAs' potential alleles are unknown. Two experiments in the allele-specific and super-type cases are performed respectively to validate our method. In the first case, we collect 14 HLA-A and 14 HLA-B molecules on Bjoern Peters dataset, and compare our method with the ARB, SMM, NetMHC and other 16 online methods. Our method gets the best average AUC (Area under the ROC) value as 0.909. In the second one, we use leave one out cross validation on MHC-peptide binding data that has different alleles but shares the common super-type. Compared to gold standard methods like NetMHC and NetMHCpan, our method again achieves the best average AUC value as 0.847. CONCLUSIONS: Our method achieves satisfactory results. Whenever it's tested on the HLA-I with single definite gene or with super-type gene locus, it gets better classification accuracy. Especially, when the training set is small, our method still works better than the other methods in the comparison. Therefore, we could make a conclusion that by combining the peptides' information, HLAs amino acid residues' interaction information and contact energy, our method really could improve prediction of the peptide HLA-I binding even when there aren't the prior experimental dataset for HLAs with various alleles. |
format | Online Article Text |
id | pubmed-3654895 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-36548952013-05-20 Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles Luo, Fei Gao, Yangyang Zhu, Yongqiong Liu, Juan BMC Bioinformatics Proceedings BACKGROUND: The HLA (human leukocyte antigen) class I is a kind of molecule encoded by a large family of genes and is characteristic of high polymorphism. Now the number of the registered HLA-I molecules has exceeded 3000. Slight differences in the amino acid sequences of HLAs would make them bind to different sets of peptides. In the past decades, although many methods have been proposed to predict the binding between peptides and HLA-I molecules and achieved good performance, most experimental data used by them is limited to the HLAs with a small number of alleles. Thus they are inclined to obtain high prediction accuracy only for data with similar alleles. Because the peptides and HLAs together determine the binding, it's necessary to consider their contribution meanwhile. RESULTS: By taking into account the features of the peptides sequence and the energy of contact residues, in this paper a method based on the artificial neural network is proposed to predict the binding of peptides and HLA-I even when the HLAs' potential alleles are unknown. Two experiments in the allele-specific and super-type cases are performed respectively to validate our method. In the first case, we collect 14 HLA-A and 14 HLA-B molecules on Bjoern Peters dataset, and compare our method with the ARB, SMM, NetMHC and other 16 online methods. Our method gets the best average AUC (Area under the ROC) value as 0.909. In the second one, we use leave one out cross validation on MHC-peptide binding data that has different alleles but shares the common super-type. Compared to gold standard methods like NetMHC and NetMHCpan, our method again achieves the best average AUC value as 0.847. CONCLUSIONS: Our method achieves satisfactory results. Whenever it's tested on the HLA-I with single definite gene or with super-type gene locus, it gets better classification accuracy. Especially, when the training set is small, our method still works better than the other methods in the comparison. Therefore, we could make a conclusion that by combining the peptides' information, HLAs amino acid residues' interaction information and contact energy, our method really could improve prediction of the peptide HLA-I binding even when there aren't the prior experimental dataset for HLAs with various alleles. BioMed Central 2013-05-09 /pmc/articles/PMC3654895/ /pubmed/23815611 http://dx.doi.org/10.1186/1471-2105-14-S8-S1 Text en Copyright © 2013 Luo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Luo, Fei Gao, Yangyang Zhu, Yongqiong Liu, Juan Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles |
title | Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles |
title_full | Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles |
title_fullStr | Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles |
title_full_unstemmed | Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles |
title_short | Integrating peptides' sequence and energy of contact residues information improves prediction of peptide and HLA-I binding with unknown alleles |
title_sort | integrating peptides' sequence and energy of contact residues information improves prediction of peptide and hla-i binding with unknown alleles |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3654895/ https://www.ncbi.nlm.nih.gov/pubmed/23815611 http://dx.doi.org/10.1186/1471-2105-14-S8-S1 |
work_keys_str_mv | AT luofei integratingpeptidessequenceandenergyofcontactresiduesinformationimprovespredictionofpeptideandhlaibindingwithunknownalleles AT gaoyangyang integratingpeptidessequenceandenergyofcontactresiduesinformationimprovespredictionofpeptideandhlaibindingwithunknownalleles AT zhuyongqiong integratingpeptidessequenceandenergyofcontactresiduesinformationimprovespredictionofpeptideandhlaibindingwithunknownalleles AT liujuan integratingpeptidessequenceandenergyofcontactresiduesinformationimprovespredictionofpeptideandhlaibindingwithunknownalleles |