Cargando…

Characterization and prediction of protein nucleolar localization sequences

Although the nucleolar localization of proteins is often believed to be mediated primarily by non-specific retention to core nucleolar components, many examples of short nucleolar targeting sequences have been reported in recent years. In this article, 46 human nucleolar localization sequences (NoLS...

Descripción completa

Detalles Bibliográficos
Autores principales: Scott, Michelle S., Boisvert, François-Michel, McDowall, Mark D., Lamond, Angus I., Barton, Geoffrey J.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2995072/
https://www.ncbi.nlm.nih.gov/pubmed/20663773
http://dx.doi.org/10.1093/nar/gkq653
_version_ 1782193044033896448
author Scott, Michelle S.
Boisvert, François-Michel
McDowall, Mark D.
Lamond, Angus I.
Barton, Geoffrey J.
author_facet Scott, Michelle S.
Boisvert, François-Michel
McDowall, Mark D.
Lamond, Angus I.
Barton, Geoffrey J.
author_sort Scott, Michelle S.
collection PubMed
description Although the nucleolar localization of proteins is often believed to be mediated primarily by non-specific retention to core nucleolar components, many examples of short nucleolar targeting sequences have been reported in recent years. In this article, 46 human nucleolar localization sequences (NoLSs) were collated from the literature and subjected to statistical analysis. Of the residues in these NoLSs 48% are basic, whereas 99% of the residues are predicted to be solvent-accessible with 42% in α-helix and 57% in coil. The sequence and predicted protein secondary structure of the 46 NoLSs were used to train an artificial neural network to identify NoLSs. At a true positive rate of 54%, the predictor’s overall false positive rate (FPR) is estimated to be 1.52%, which can be broken down to FPRs of 0.26% for randomly chosen cytoplasmic sequences, 0.80% for randomly chosen nucleoplasmic sequences and 12% for nuclear localization signals. The predictor was used to predict NoLSs in the complete human proteome and 10 of the highest scoring previously unknown NoLSs were experimentally confirmed. NoLSs are a prevalent type of targeting motif that is distinct from nuclear localization signals and that can be computationally predicted.
format Text
id pubmed-2995072
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29950722010-12-01 Characterization and prediction of protein nucleolar localization sequences Scott, Michelle S. Boisvert, François-Michel McDowall, Mark D. Lamond, Angus I. Barton, Geoffrey J. Nucleic Acids Res Computational Biology Although the nucleolar localization of proteins is often believed to be mediated primarily by non-specific retention to core nucleolar components, many examples of short nucleolar targeting sequences have been reported in recent years. In this article, 46 human nucleolar localization sequences (NoLSs) were collated from the literature and subjected to statistical analysis. Of the residues in these NoLSs 48% are basic, whereas 99% of the residues are predicted to be solvent-accessible with 42% in α-helix and 57% in coil. The sequence and predicted protein secondary structure of the 46 NoLSs were used to train an artificial neural network to identify NoLSs. At a true positive rate of 54%, the predictor’s overall false positive rate (FPR) is estimated to be 1.52%, which can be broken down to FPRs of 0.26% for randomly chosen cytoplasmic sequences, 0.80% for randomly chosen nucleoplasmic sequences and 12% for nuclear localization signals. The predictor was used to predict NoLSs in the complete human proteome and 10 of the highest scoring previously unknown NoLSs were experimentally confirmed. NoLSs are a prevalent type of targeting motif that is distinct from nuclear localization signals and that can be computationally predicted. Oxford University Press 2010-11 2010-07-26 /pmc/articles/PMC2995072/ /pubmed/20663773 http://dx.doi.org/10.1093/nar/gkq653 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Scott, Michelle S.
Boisvert, François-Michel
McDowall, Mark D.
Lamond, Angus I.
Barton, Geoffrey J.
Characterization and prediction of protein nucleolar localization sequences
title Characterization and prediction of protein nucleolar localization sequences
title_full Characterization and prediction of protein nucleolar localization sequences
title_fullStr Characterization and prediction of protein nucleolar localization sequences
title_full_unstemmed Characterization and prediction of protein nucleolar localization sequences
title_short Characterization and prediction of protein nucleolar localization sequences
title_sort characterization and prediction of protein nucleolar localization sequences
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2995072/
https://www.ncbi.nlm.nih.gov/pubmed/20663773
http://dx.doi.org/10.1093/nar/gkq653
work_keys_str_mv AT scottmichelles characterizationandpredictionofproteinnucleolarlocalizationsequences
AT boisvertfrancoismichel characterizationandpredictionofproteinnucleolarlocalizationsequences
AT mcdowallmarkd characterizationandpredictionofproteinnucleolarlocalizationsequences
AT lamondangusi characterizationandpredictionofproteinnucleolarlocalizationsequences
AT bartongeoffreyj characterizationandpredictionofproteinnucleolarlocalizationsequences