Cargando…

A comprehensive assessment of N-terminal signal peptides prediction methods

BACKGROUND: Amino-terminal signal peptides (SPs) are short regions that guide the targeting of secretory proteins to the correct subcellular compartments in the cell. They are cleaved off upon the passenger protein reaching its destination. The explosive growth in sequencing technologies has led to...

Descripción completa

Detalles Bibliográficos
Autores principales: Choo, Khar Heng, Tan, Tin Wee, Ranganathan, Shoba
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788353/
https://www.ncbi.nlm.nih.gov/pubmed/19958512
http://dx.doi.org/10.1186/1471-2105-10-S15-S2
_version_ 1782174961969922048
author Choo, Khar Heng
Tan, Tin Wee
Ranganathan, Shoba
author_facet Choo, Khar Heng
Tan, Tin Wee
Ranganathan, Shoba
author_sort Choo, Khar Heng
collection PubMed
description BACKGROUND: Amino-terminal signal peptides (SPs) are short regions that guide the targeting of secretory proteins to the correct subcellular compartments in the cell. They are cleaved off upon the passenger protein reaching its destination. The explosive growth in sequencing technologies has led to the deposition of vast numbers of protein sequences necessitating rapid functional annotation techniques, with subcellular localization being a key feature. Of the myriad software prediction tools developed to automate the task of assigning the SP cleavage site of these new sequences, we review here, the performance and reliability of commonly used SP prediction tools. RESULTS: The available signal peptide data has been manually curated and organized into three datasets representing eukaryotes, Gram-positive and Gram-negative bacteria. These datasets are used to evaluate thirteen prediction tools that are publicly available. SignalP (both the HMM and ANN versions) maintains consistency and achieves the best overall accuracy in all three benchmarking experiments, ranging from 0.872 to 0.914 although other prediction tools are narrowing the performance gap. CONCLUSION: The majority of the tools evaluated in this study encounter no difficulty in discriminating between secretory and non-secretory proteins. The challenge clearly remains with pinpointing the correct SP cleavage site. The composite scoring schemes employed by SignalP may help to explain its accuracy. Prediction task is divided into a number of separate steps, thus allowing each score to tackle a particular aspect of the prediction.
format Text
id pubmed-2788353
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27883532009-12-04 A comprehensive assessment of N-terminal signal peptides prediction methods Choo, Khar Heng Tan, Tin Wee Ranganathan, Shoba BMC Bioinformatics Proceedings BACKGROUND: Amino-terminal signal peptides (SPs) are short regions that guide the targeting of secretory proteins to the correct subcellular compartments in the cell. They are cleaved off upon the passenger protein reaching its destination. The explosive growth in sequencing technologies has led to the deposition of vast numbers of protein sequences necessitating rapid functional annotation techniques, with subcellular localization being a key feature. Of the myriad software prediction tools developed to automate the task of assigning the SP cleavage site of these new sequences, we review here, the performance and reliability of commonly used SP prediction tools. RESULTS: The available signal peptide data has been manually curated and organized into three datasets representing eukaryotes, Gram-positive and Gram-negative bacteria. These datasets are used to evaluate thirteen prediction tools that are publicly available. SignalP (both the HMM and ANN versions) maintains consistency and achieves the best overall accuracy in all three benchmarking experiments, ranging from 0.872 to 0.914 although other prediction tools are narrowing the performance gap. CONCLUSION: The majority of the tools evaluated in this study encounter no difficulty in discriminating between secretory and non-secretory proteins. The challenge clearly remains with pinpointing the correct SP cleavage site. The composite scoring schemes employed by SignalP may help to explain its accuracy. Prediction task is divided into a number of separate steps, thus allowing each score to tackle a particular aspect of the prediction. BioMed Central 2009-12-03 /pmc/articles/PMC2788353/ /pubmed/19958512 http://dx.doi.org/10.1186/1471-2105-10-S15-S2 Text en Copyright ©2009 Choo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Choo, Khar Heng
Tan, Tin Wee
Ranganathan, Shoba
A comprehensive assessment of N-terminal signal peptides prediction methods
title A comprehensive assessment of N-terminal signal peptides prediction methods
title_full A comprehensive assessment of N-terminal signal peptides prediction methods
title_fullStr A comprehensive assessment of N-terminal signal peptides prediction methods
title_full_unstemmed A comprehensive assessment of N-terminal signal peptides prediction methods
title_short A comprehensive assessment of N-terminal signal peptides prediction methods
title_sort comprehensive assessment of n-terminal signal peptides prediction methods
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788353/
https://www.ncbi.nlm.nih.gov/pubmed/19958512
http://dx.doi.org/10.1186/1471-2105-10-S15-S2
work_keys_str_mv AT chookharheng acomprehensiveassessmentofnterminalsignalpeptidespredictionmethods
AT tantinwee acomprehensiveassessmentofnterminalsignalpeptidespredictionmethods
AT ranganathanshoba acomprehensiveassessmentofnterminalsignalpeptidespredictionmethods
AT chookharheng comprehensiveassessmentofnterminalsignalpeptidespredictionmethods
AT tantinwee comprehensiveassessmentofnterminalsignalpeptidespredictionmethods
AT ranganathanshoba comprehensiveassessmentofnterminalsignalpeptidespredictionmethods