Cargando…

Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic

BACKGROUND: Prediction of short stretches in protein sequences capable of forming amyloid-like fibrils is important in understanding the underlying cause of amyloid illnesses thereby aiding in the discovery of sequence-targeted anti-aggregation pharmaceuticals. Due to the constraints of experimental...

Descripción completa

Detalles Bibliográficos
Autores principales: Nair, Smitha Sunil Kumaran, Subba Reddy, NV, Hareesha, KS
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278838/
https://www.ncbi.nlm.nih.gov/pubmed/22373069
http://dx.doi.org/10.1186/1471-2105-12-S13-S21
_version_ 1782223613345136640
author Nair, Smitha Sunil Kumaran
Subba Reddy, NV
Hareesha, KS
author_facet Nair, Smitha Sunil Kumaran
Subba Reddy, NV
Hareesha, KS
author_sort Nair, Smitha Sunil Kumaran
collection PubMed
description BACKGROUND: Prediction of short stretches in protein sequences capable of forming amyloid-like fibrils is important in understanding the underlying cause of amyloid illnesses thereby aiding in the discovery of sequence-targeted anti-aggregation pharmaceuticals. Due to the constraints of experimental molecular techniques in identifying such motif segments, it is highly desirable to develop computational methods to provide better and affordable in silico predictions. RESULTS: Accurate in silico prediction techniques of amyloidogenic peptide regions rely on the cooperation between informative features and classifier design. In this research article, we propose one such efficient fibril prediction implementation exploiting heterogeneous features based on bio-physio-chemical (BPC) properties, auto-correlation function of carefully selected amino acid indices and atomic composition within a protein fragment of amino acids in a window. In an attempt to get an optimal number of BPC features, an evolutionary Support Vector Machine (SVM) integrating a novel implementation of hybrid Genetic Algorithm termed Memetic Algorithm and SVM is utilized. Five prediction modules designed using Artificial Neural Network (ANN) models are trained with independent and integrated features in order to validate the fibril forming motifs. The results provide evidence that incorporating new feature namely auto-correlation function besides BPC, attempt to strengthen the sequence interaction effect in forming the feature vector thereby obtaining better prediction quality in terms of sensitivity, specificity, Mathews Correlation Coefficient and Area under the Receiver Operating Characteristics curve. CONCLUSION: A significant improvement in performance is observed by introducing features like auto-correlation function that maintains sequence order effect, in addition to the conventional BPC properties selected through a novel optimization strategy to predict the peptide status – amyloidogenic or non-amyloidogenic. The proposed approach achieves acceptable results, comparable to most online predictors. Besides, it compensates the lacuna in existing amyloid fibril prediction tools by maintaining equilibrium between sensitivity and specificity.
format Online
Article
Text
id pubmed-3278838
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32788382012-02-14 Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic Nair, Smitha Sunil Kumaran Subba Reddy, NV Hareesha, KS BMC Bioinformatics Proceedings BACKGROUND: Prediction of short stretches in protein sequences capable of forming amyloid-like fibrils is important in understanding the underlying cause of amyloid illnesses thereby aiding in the discovery of sequence-targeted anti-aggregation pharmaceuticals. Due to the constraints of experimental molecular techniques in identifying such motif segments, it is highly desirable to develop computational methods to provide better and affordable in silico predictions. RESULTS: Accurate in silico prediction techniques of amyloidogenic peptide regions rely on the cooperation between informative features and classifier design. In this research article, we propose one such efficient fibril prediction implementation exploiting heterogeneous features based on bio-physio-chemical (BPC) properties, auto-correlation function of carefully selected amino acid indices and atomic composition within a protein fragment of amino acids in a window. In an attempt to get an optimal number of BPC features, an evolutionary Support Vector Machine (SVM) integrating a novel implementation of hybrid Genetic Algorithm termed Memetic Algorithm and SVM is utilized. Five prediction modules designed using Artificial Neural Network (ANN) models are trained with independent and integrated features in order to validate the fibril forming motifs. The results provide evidence that incorporating new feature namely auto-correlation function besides BPC, attempt to strengthen the sequence interaction effect in forming the feature vector thereby obtaining better prediction quality in terms of sensitivity, specificity, Mathews Correlation Coefficient and Area under the Receiver Operating Characteristics curve. CONCLUSION: A significant improvement in performance is observed by introducing features like auto-correlation function that maintains sequence order effect, in addition to the conventional BPC properties selected through a novel optimization strategy to predict the peptide status – amyloidogenic or non-amyloidogenic. The proposed approach achieves acceptable results, comparable to most online predictors. Besides, it compensates the lacuna in existing amyloid fibril prediction tools by maintaining equilibrium between sensitivity and specificity. BioMed Central 2011-11-30 /pmc/articles/PMC3278838/ /pubmed/22373069 http://dx.doi.org/10.1186/1471-2105-12-S13-S21 Text en Copyright ©2011 Nair et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Nair, Smitha Sunil Kumaran
Subba Reddy, NV
Hareesha, KS
Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic
title Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic
title_full Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic
title_fullStr Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic
title_full_unstemmed Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic
title_short Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic
title_sort exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278838/
https://www.ncbi.nlm.nih.gov/pubmed/22373069
http://dx.doi.org/10.1186/1471-2105-12-S13-S21
work_keys_str_mv AT nairsmithasunilkumaran exploitingheterogeneousfeaturestoimproveinsilicopredictionofpeptidestatusamyloidogenicornonamyloidogenic
AT subbareddynv exploitingheterogeneousfeaturestoimproveinsilicopredictionofpeptidestatusamyloidogenicornonamyloidogenic
AT hareeshaks exploitingheterogeneousfeaturestoimproveinsilicopredictionofpeptidestatusamyloidogenicornonamyloidogenic