Cargando…

Identification of Amino Acid Propensities That Are Strong Determinants of Linear B-cell Epitope Using Neural Networks

BACKGROUND: Identification of amino acid propensities that are strong determinants of linear B-cell epitope is very important to enrich our knowledge about epitopes. This can also help to obtain better epitope prediction. Typical linear B-cell epitope prediction methods combine various propensities...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Chun-Hung, Pal, Nikhil R., Lin, Ken-Li, Chung, I-Fang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3275595/
https://www.ncbi.nlm.nih.gov/pubmed/22347389
http://dx.doi.org/10.1371/journal.pone.0030617
_version_ 1782223245413449728
author Su, Chun-Hung
Pal, Nikhil R.
Lin, Ken-Li
Chung, I-Fang
author_facet Su, Chun-Hung
Pal, Nikhil R.
Lin, Ken-Li
Chung, I-Fang
author_sort Su, Chun-Hung
collection PubMed
description BACKGROUND: Identification of amino acid propensities that are strong determinants of linear B-cell epitope is very important to enrich our knowledge about epitopes. This can also help to obtain better epitope prediction. Typical linear B-cell epitope prediction methods combine various propensities in different ways to improve prediction accuracies. However, fewer but better features may yield better prediction. Moreover, for a propensity, when the sequence length is k, there will be k values, which should be treated as a single unit for feature selection and hence usual feature selection method will not work. Here we use a novel Group Feature Selecting Multilayered Perceptron, GFSMLP, which treats a group of related information as a single entity and selects useful propensities related to linear B-cell epitopes, and uses them to predict epitopes. METHODOLOGY/ PRINCIPAL FINDINGS: We use eight widely known propensities and four data sets. We use GFSMLP to rank propensities by the frequency with which they are selected. We find that Chou's beta-turn and Ponnuswamy's polarity are better features for prediction of linear B-cell epitope. We examine the individual and combined discriminating power of the selected propensities and analyze the correlation between paired propensities. Our results show that the selected propensities are indeed good features, which also cooperate with other propensities to enhance the discriminating power for predicting epitopes. We find that individually polarity is not the best predictor, but it collaborates with others to yield good prediction. Usual feature selection methods cannot provide such information. CONCLUSIONS/ SIGNIFICANCE: Our results confirm the effectiveness of active (group) feature selection by GFSMLP over the traditional passive approaches of evaluating various combinations of propensities. The GFSMLP-based feature selection can be extended to more than 500 remaining propensities to enhance our biological knowledge about epitopes and to obtain better prediction. A graphical-user-interface version of GFSMLP is available at: http://bio.classcloud.org/GFSMLP/.
format Online
Article
Text
id pubmed-3275595
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32755952012-02-15 Identification of Amino Acid Propensities That Are Strong Determinants of Linear B-cell Epitope Using Neural Networks Su, Chun-Hung Pal, Nikhil R. Lin, Ken-Li Chung, I-Fang PLoS One Research Article BACKGROUND: Identification of amino acid propensities that are strong determinants of linear B-cell epitope is very important to enrich our knowledge about epitopes. This can also help to obtain better epitope prediction. Typical linear B-cell epitope prediction methods combine various propensities in different ways to improve prediction accuracies. However, fewer but better features may yield better prediction. Moreover, for a propensity, when the sequence length is k, there will be k values, which should be treated as a single unit for feature selection and hence usual feature selection method will not work. Here we use a novel Group Feature Selecting Multilayered Perceptron, GFSMLP, which treats a group of related information as a single entity and selects useful propensities related to linear B-cell epitopes, and uses them to predict epitopes. METHODOLOGY/ PRINCIPAL FINDINGS: We use eight widely known propensities and four data sets. We use GFSMLP to rank propensities by the frequency with which they are selected. We find that Chou's beta-turn and Ponnuswamy's polarity are better features for prediction of linear B-cell epitope. We examine the individual and combined discriminating power of the selected propensities and analyze the correlation between paired propensities. Our results show that the selected propensities are indeed good features, which also cooperate with other propensities to enhance the discriminating power for predicting epitopes. We find that individually polarity is not the best predictor, but it collaborates with others to yield good prediction. Usual feature selection methods cannot provide such information. CONCLUSIONS/ SIGNIFICANCE: Our results confirm the effectiveness of active (group) feature selection by GFSMLP over the traditional passive approaches of evaluating various combinations of propensities. The GFSMLP-based feature selection can be extended to more than 500 remaining propensities to enhance our biological knowledge about epitopes and to obtain better prediction. A graphical-user-interface version of GFSMLP is available at: http://bio.classcloud.org/GFSMLP/. Public Library of Science 2012-02-08 /pmc/articles/PMC3275595/ /pubmed/22347389 http://dx.doi.org/10.1371/journal.pone.0030617 Text en Su et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Su, Chun-Hung
Pal, Nikhil R.
Lin, Ken-Li
Chung, I-Fang
Identification of Amino Acid Propensities That Are Strong Determinants of Linear B-cell Epitope Using Neural Networks
title Identification of Amino Acid Propensities That Are Strong Determinants of Linear B-cell Epitope Using Neural Networks
title_full Identification of Amino Acid Propensities That Are Strong Determinants of Linear B-cell Epitope Using Neural Networks
title_fullStr Identification of Amino Acid Propensities That Are Strong Determinants of Linear B-cell Epitope Using Neural Networks
title_full_unstemmed Identification of Amino Acid Propensities That Are Strong Determinants of Linear B-cell Epitope Using Neural Networks
title_short Identification of Amino Acid Propensities That Are Strong Determinants of Linear B-cell Epitope Using Neural Networks
title_sort identification of amino acid propensities that are strong determinants of linear b-cell epitope using neural networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3275595/
https://www.ncbi.nlm.nih.gov/pubmed/22347389
http://dx.doi.org/10.1371/journal.pone.0030617
work_keys_str_mv AT suchunhung identificationofaminoacidpropensitiesthatarestrongdeterminantsoflinearbcellepitopeusingneuralnetworks
AT palnikhilr identificationofaminoacidpropensitiesthatarestrongdeterminantsoflinearbcellepitopeusingneuralnetworks
AT linkenli identificationofaminoacidpropensitiesthatarestrongdeterminantsoflinearbcellepitopeusingneuralnetworks
AT chungifang identificationofaminoacidpropensitiesthatarestrongdeterminantsoflinearbcellepitopeusingneuralnetworks