Cargando…
Prediction of B-cell epitopes using evolutionary information and propensity scales
BACKGROUND: Development of computational tools that can accurately predict presence and location of B-cell epitopes on pathogenic proteins has a valuable application to the field of vaccinology. Because of the highly variable yet enigmatic nature of B-cell epitopes, their prediction presents a great...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549808/ https://www.ncbi.nlm.nih.gov/pubmed/23484214 http://dx.doi.org/10.1186/1471-2105-14-S2-S10 |
_version_ | 1782256474759626752 |
---|---|
author | Lin, Scott Yi-Heng Cheng, Cheng-Wei Su, Emily Chia-Yu |
author_facet | Lin, Scott Yi-Heng Cheng, Cheng-Wei Su, Emily Chia-Yu |
author_sort | Lin, Scott Yi-Heng |
collection | PubMed |
description | BACKGROUND: Development of computational tools that can accurately predict presence and location of B-cell epitopes on pathogenic proteins has a valuable application to the field of vaccinology. Because of the highly variable yet enigmatic nature of B-cell epitopes, their prediction presents a great challenge to computational immunologists. METHODS: We propose a method, BEEPro (B-cell epitope prediction by evolutionary information and propensity scales), which adapts a linear averaging scheme on 16 properties using a support vector machine model to predict both linear and conformational B-cell epitopes. These 16 properties include position specific scoring matrix (PSSM), an amino acid ratio scale, and a set of 14 physicochemical scales obtained via a feature selection process. Finally, a three-way data split procedure is used during the validation process to prevent over-estimation of prediction performance and avoid bias in our experiment results. RESULTS: In our experiment, first we use a non-redundant linear B-cell epitope dataset curated by Sollner et al. for feature selection and parameter optimization. Evaluated by a three-way data split procedure, BEEPro achieves significant improvement with the area under the receiver operating curve (AUC) = 0.9987, accuracy = 99.29%, mathew's correlation coefficient (MCC) = 0.9281, sensitivity = 0.9604, specificity = 0.9946, positive predictive value (PPV) = 0.9042 for the Sollner dataset. In addition, the same parameters are used to evaluate performance on other independent linear B-cell epitope test datasets, BEEPro attains an AUC which ranges from 0.9874 to 0.9950 and an accuracy which ranges from 93.73% to 97.31%. Moreover, five-fold cross-validation on one benchmark conformational B-cell epitope dataset yields an accuracy of 92.14% and AUC of 0.9066. CONCLUSIONS: Compared with other current models, our method achieves a significant improvement with respect to AUC, accuracy, MCC, sensitivity, specificity, and PPV. Thus, we have shown that an appropriate combination of evolutionary information and propensity scales with a support vector machine model can significantly enhance the prediction performance of both linear and conformational B-cell epitopes. |
format | Online Article Text |
id | pubmed-3549808 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35498082013-01-23 Prediction of B-cell epitopes using evolutionary information and propensity scales Lin, Scott Yi-Heng Cheng, Cheng-Wei Su, Emily Chia-Yu BMC Bioinformatics Proceedings BACKGROUND: Development of computational tools that can accurately predict presence and location of B-cell epitopes on pathogenic proteins has a valuable application to the field of vaccinology. Because of the highly variable yet enigmatic nature of B-cell epitopes, their prediction presents a great challenge to computational immunologists. METHODS: We propose a method, BEEPro (B-cell epitope prediction by evolutionary information and propensity scales), which adapts a linear averaging scheme on 16 properties using a support vector machine model to predict both linear and conformational B-cell epitopes. These 16 properties include position specific scoring matrix (PSSM), an amino acid ratio scale, and a set of 14 physicochemical scales obtained via a feature selection process. Finally, a three-way data split procedure is used during the validation process to prevent over-estimation of prediction performance and avoid bias in our experiment results. RESULTS: In our experiment, first we use a non-redundant linear B-cell epitope dataset curated by Sollner et al. for feature selection and parameter optimization. Evaluated by a three-way data split procedure, BEEPro achieves significant improvement with the area under the receiver operating curve (AUC) = 0.9987, accuracy = 99.29%, mathew's correlation coefficient (MCC) = 0.9281, sensitivity = 0.9604, specificity = 0.9946, positive predictive value (PPV) = 0.9042 for the Sollner dataset. In addition, the same parameters are used to evaluate performance on other independent linear B-cell epitope test datasets, BEEPro attains an AUC which ranges from 0.9874 to 0.9950 and an accuracy which ranges from 93.73% to 97.31%. Moreover, five-fold cross-validation on one benchmark conformational B-cell epitope dataset yields an accuracy of 92.14% and AUC of 0.9066. CONCLUSIONS: Compared with other current models, our method achieves a significant improvement with respect to AUC, accuracy, MCC, sensitivity, specificity, and PPV. Thus, we have shown that an appropriate combination of evolutionary information and propensity scales with a support vector machine model can significantly enhance the prediction performance of both linear and conformational B-cell epitopes. BioMed Central 2013-01-21 /pmc/articles/PMC3549808/ /pubmed/23484214 http://dx.doi.org/10.1186/1471-2105-14-S2-S10 Text en Copyright ©2013 Lin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Lin, Scott Yi-Heng Cheng, Cheng-Wei Su, Emily Chia-Yu Prediction of B-cell epitopes using evolutionary information and propensity scales |
title | Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_full | Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_fullStr | Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_full_unstemmed | Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_short | Prediction of B-cell epitopes using evolutionary information and propensity scales |
title_sort | prediction of b-cell epitopes using evolutionary information and propensity scales |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549808/ https://www.ncbi.nlm.nih.gov/pubmed/23484214 http://dx.doi.org/10.1186/1471-2105-14-S2-S10 |
work_keys_str_mv | AT linscottyiheng predictionofbcellepitopesusingevolutionaryinformationandpropensityscales AT chengchengwei predictionofbcellepitopesusingevolutionaryinformationandpropensityscales AT suemilychiayu predictionofbcellepitopesusingevolutionaryinformationandpropensityscales |