Cargando…

UniProt: a hub for protein information

UniProt is an important collection of protein sequences and their annotations, which has doubled in size to 80 million sequences during the past year. This growth in sequences has prompted an extension of UniProt accession number space from 6 to 10 characters. An increasing fraction of new sequences...

Descripción completa

Detalles Bibliográficos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384041/
https://www.ncbi.nlm.nih.gov/pubmed/25348405
http://dx.doi.org/10.1093/nar/gku989
_version_ 1782364839293747200
collection PubMed
description UniProt is an important collection of protein sequences and their annotations, which has doubled in size to 80 million sequences during the past year. This growth in sequences has prompted an extension of UniProt accession number space from 6 to 10 characters. An increasing fraction of new sequences are identical to a sequence that already exists in the database with the majority of sequences coming from genome sequencing projects. We have created a new proteome identifier that uniquely identifies a particular assembly of a species and strain or subspecies to help users track the provenance of sequences. We present a new website that has been designed using a user-experience design process. We have introduced an annotation score for all entries in UniProt to represent the relative amount of knowledge known about each protein. These scores will be helpful in identifying which proteins are the best characterized and most informative for comparative analysis. All UniProt data is provided freely and is available on the web at http://www.uniprot.org/.
format Online
Article
Text
id pubmed-4384041
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43840412015-04-08 UniProt: a hub for protein information Nucleic Acids Res Database Issue UniProt is an important collection of protein sequences and their annotations, which has doubled in size to 80 million sequences during the past year. This growth in sequences has prompted an extension of UniProt accession number space from 6 to 10 characters. An increasing fraction of new sequences are identical to a sequence that already exists in the database with the majority of sequences coming from genome sequencing projects. We have created a new proteome identifier that uniquely identifies a particular assembly of a species and strain or subspecies to help users track the provenance of sequences. We present a new website that has been designed using a user-experience design process. We have introduced an annotation score for all entries in UniProt to represent the relative amount of knowledge known about each protein. These scores will be helpful in identifying which proteins are the best characterized and most informative for comparative analysis. All UniProt data is provided freely and is available on the web at http://www.uniprot.org/. Oxford University Press 2014-10-27 2015-01-28 /pmc/articles/PMC4384041/ /pubmed/25348405 http://dx.doi.org/10.1093/nar/gku989 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
UniProt: a hub for protein information
title UniProt: a hub for protein information
title_full UniProt: a hub for protein information
title_fullStr UniProt: a hub for protein information
title_full_unstemmed UniProt: a hub for protein information
title_short UniProt: a hub for protein information
title_sort uniprot: a hub for protein information
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4384041/
https://www.ncbi.nlm.nih.gov/pubmed/25348405
http://dx.doi.org/10.1093/nar/gku989
work_keys_str_mv AT uniprotahubforproteininformation