Cargando…

LipocalinPred: a SVM-based method for prediction of lipocalins

BACKGROUND: Functional annotation of rapidly amassing nucleotide and protein sequences presents a challenging task for modern bioinformatics. This is particularly true for protein families sharing extremely low sequence identity, as for lipocalins, a family of proteins with varied functions and grea...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramana, Jayashree, Gupta, Dinesh
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2813246/
https://www.ncbi.nlm.nih.gov/pubmed/20030857
http://dx.doi.org/10.1186/1471-2105-10-445
_version_ 1782176900349689856
author Ramana, Jayashree
Gupta, Dinesh
author_facet Ramana, Jayashree
Gupta, Dinesh
author_sort Ramana, Jayashree
collection PubMed
description BACKGROUND: Functional annotation of rapidly amassing nucleotide and protein sequences presents a challenging task for modern bioinformatics. This is particularly true for protein families sharing extremely low sequence identity, as for lipocalins, a family of proteins with varied functions and great diversity at the sequence level, yet conserved structures. RESULTS: In the present study we propose a SVM based method for identification of lipocalin protein sequences. The SVM models were trained with the input features generated using amino acid, dipeptide and secondary structure compositions as well as PSSM profiles. The model derived using both PSSM and secondary structure emerged as the best model in the study. Apart from achieving a high prediction accuracy (>90% in leave-one-out), lipocalinpred correctly differentiates closely related fatty acid-binding proteins and triabins as non-lipocalins. CONCLUSION: The method offers a promising approach as a lipocalin prediction tool, complementing PROSITE, Pfam and homology modelling methods.
format Text
id pubmed-2813246
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28132462010-01-29 LipocalinPred: a SVM-based method for prediction of lipocalins Ramana, Jayashree Gupta, Dinesh BMC Bioinformatics Methodology article BACKGROUND: Functional annotation of rapidly amassing nucleotide and protein sequences presents a challenging task for modern bioinformatics. This is particularly true for protein families sharing extremely low sequence identity, as for lipocalins, a family of proteins with varied functions and great diversity at the sequence level, yet conserved structures. RESULTS: In the present study we propose a SVM based method for identification of lipocalin protein sequences. The SVM models were trained with the input features generated using amino acid, dipeptide and secondary structure compositions as well as PSSM profiles. The model derived using both PSSM and secondary structure emerged as the best model in the study. Apart from achieving a high prediction accuracy (>90% in leave-one-out), lipocalinpred correctly differentiates closely related fatty acid-binding proteins and triabins as non-lipocalins. CONCLUSION: The method offers a promising approach as a lipocalin prediction tool, complementing PROSITE, Pfam and homology modelling methods. BioMed Central 2009-12-24 /pmc/articles/PMC2813246/ /pubmed/20030857 http://dx.doi.org/10.1186/1471-2105-10-445 Text en Copyright ©2009 Ramana and Gupta; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Ramana, Jayashree
Gupta, Dinesh
LipocalinPred: a SVM-based method for prediction of lipocalins
title LipocalinPred: a SVM-based method for prediction of lipocalins
title_full LipocalinPred: a SVM-based method for prediction of lipocalins
title_fullStr LipocalinPred: a SVM-based method for prediction of lipocalins
title_full_unstemmed LipocalinPred: a SVM-based method for prediction of lipocalins
title_short LipocalinPred: a SVM-based method for prediction of lipocalins
title_sort lipocalinpred: a svm-based method for prediction of lipocalins
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2813246/
https://www.ncbi.nlm.nih.gov/pubmed/20030857
http://dx.doi.org/10.1186/1471-2105-10-445
work_keys_str_mv AT ramanajayashree lipocalinpredasvmbasedmethodforpredictionoflipocalins
AT guptadinesh lipocalinpredasvmbasedmethodforpredictionoflipocalins