Cargando…

UniProtKB amid the turmoil of plant proteomics research

The UniProt KnowledgeBase (UniProtKB) provides a single, centralized, authoritative resource for protein sequences and functional information. The majority of its records is based on automatic translation of coding sequences (CDS) provided by submitters at the time of initial deposition to the nucle...

Descripción completa

Detalles Bibliográficos
Autores principales:	Schneider, Michel, Consortium, the UniProt, Poux, Sylvain
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2012
Materias:	Plant Science
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3515866/ https://www.ncbi.nlm.nih.gov/pubmed/23230445 http://dx.doi.org/10.3389/fpls.2012.00270

_version_	1782252235575525376
author	Schneider, Michel Consortium, the UniProt Poux, Sylvain
author_facet	Schneider, Michel Consortium, the UniProt Poux, Sylvain
author_sort	Schneider, Michel
collection	PubMed
description	The UniProt KnowledgeBase (UniProtKB) provides a single, centralized, authoritative resource for protein sequences and functional information. The majority of its records is based on automatic translation of coding sequences (CDS) provided by submitters at the time of initial deposition to the nucleotide sequence databases (INSDC). This article will give a general overview of the current situation, with some specific illustrations extracted from our annotation of Arabidopsis and rice proteomes. More and more frequently, only the raw sequence of a complete genome is deposited to the nucleotide sequence databases and the gene model predictions and annotations are kept in separate, specialized model organism databases (MODs). In order to be able to provide the complete proteome of model organisms, UniProtKB had to implement pipelines for import of protein sequences from Ensembl and EnsemblGenomes. A single genome can be the target of several unrelated sequencing projects and the final assembly and gene model predictions may diverge quite significantly. In addition, several cultivars of the same species are often sequenced – 1001 Arabidopsis cultivars are currently under way – and the resulting proteomes are far from being identical. Therefore, one challenge for UniProtKB is to store and organize these data in a convenient way and to clearly defined reference proteomes that should be made available to users. Manual annotation is one of the landmarks of the Swiss-Prot section of UniProtKB. Besides adding functional annotation, curators are checking, and often correcting, gene model predictions. For plants, this task is limited to Arabidopsis thaliana and Oryza sativa subsp. japonica. Proteomics data providing experimental evidences confirming the existence of proteins or identifying sequence features such as post-translational modifications are also imported into UniProtKB records and the knowledgebase is cross-referenced to numerous proteomics resource.
format	Online Article Text
id	pubmed-3515866
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-35158662012-12-10 UniProtKB amid the turmoil of plant proteomics research Schneider, Michel Consortium, the UniProt Poux, Sylvain Front Plant Sci Plant Science The UniProt KnowledgeBase (UniProtKB) provides a single, centralized, authoritative resource for protein sequences and functional information. The majority of its records is based on automatic translation of coding sequences (CDS) provided by submitters at the time of initial deposition to the nucleotide sequence databases (INSDC). This article will give a general overview of the current situation, with some specific illustrations extracted from our annotation of Arabidopsis and rice proteomes. More and more frequently, only the raw sequence of a complete genome is deposited to the nucleotide sequence databases and the gene model predictions and annotations are kept in separate, specialized model organism databases (MODs). In order to be able to provide the complete proteome of model organisms, UniProtKB had to implement pipelines for import of protein sequences from Ensembl and EnsemblGenomes. A single genome can be the target of several unrelated sequencing projects and the final assembly and gene model predictions may diverge quite significantly. In addition, several cultivars of the same species are often sequenced – 1001 Arabidopsis cultivars are currently under way – and the resulting proteomes are far from being identical. Therefore, one challenge for UniProtKB is to store and organize these data in a convenient way and to clearly defined reference proteomes that should be made available to users. Manual annotation is one of the landmarks of the Swiss-Prot section of UniProtKB. Besides adding functional annotation, curators are checking, and often correcting, gene model predictions. For plants, this task is limited to Arabidopsis thaliana and Oryza sativa subsp. japonica. Proteomics data providing experimental evidences confirming the existence of proteins or identifying sequence features such as post-translational modifications are also imported into UniProtKB records and the knowledgebase is cross-referenced to numerous proteomics resource. Frontiers Media S.A. 2012-12-06 /pmc/articles/PMC3515866/ /pubmed/23230445 http://dx.doi.org/10.3389/fpls.2012.00270 Text en Copyright © Schneider, the UniProt Consortium and Poux. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle	Plant Science Schneider, Michel Consortium, the UniProt Poux, Sylvain UniProtKB amid the turmoil of plant proteomics research
title	UniProtKB amid the turmoil of plant proteomics research
title_full	UniProtKB amid the turmoil of plant proteomics research
title_fullStr	UniProtKB amid the turmoil of plant proteomics research
title_full_unstemmed	UniProtKB amid the turmoil of plant proteomics research
title_short	UniProtKB amid the turmoil of plant proteomics research
title_sort	uniprotkb amid the turmoil of plant proteomics research
topic	Plant Science
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3515866/ https://www.ncbi.nlm.nih.gov/pubmed/23230445 http://dx.doi.org/10.3389/fpls.2012.00270
work_keys_str_mv	AT schneidermichel uniprotkbamidtheturmoilofplantproteomicsresearch AT consortiumtheuniprot uniprotkbamidtheturmoilofplantproteomicsresearch AT pouxsylvain uniprotkbamidtheturmoilofplantproteomicsresearch

UniProtKB amid the turmoil of plant proteomics research

Ejemplares similares