Cargando…

Making proteomics data accessible and reusable: Current state of proteomics databases and repositories

Compared to other data-intensive disciplines such as genomics, public deposition and storage of MS-based proteomics, data are still less developed due to, among other reasons, the inherent complexity of the data and the variety of data types and experimental workflows. In order to address this need,...

Descripción completa

Detalles Bibliográficos
Autores principales: Perez-Riverol, Yasset, Alpi, Emanuele, Wang, Rui, Hermjakob, Henning, Vizcaíno, Juan Antonio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BlackWell Publishing Ltd 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4409848/
https://www.ncbi.nlm.nih.gov/pubmed/25158685
http://dx.doi.org/10.1002/pmic.201400302
_version_ 1782368248371609600
author Perez-Riverol, Yasset
Alpi, Emanuele
Wang, Rui
Hermjakob, Henning
Vizcaíno, Juan Antonio
author_facet Perez-Riverol, Yasset
Alpi, Emanuele
Wang, Rui
Hermjakob, Henning
Vizcaíno, Juan Antonio
author_sort Perez-Riverol, Yasset
collection PubMed
description Compared to other data-intensive disciplines such as genomics, public deposition and storage of MS-based proteomics, data are still less developed due to, among other reasons, the inherent complexity of the data and the variety of data types and experimental workflows. In order to address this need, several public repositories for MS proteomics experiments have been developed, each with different purposes in mind. The most established resources are the Global Proteome Machine Database (GPMDB), PeptideAtlas, and the PRIDE database. Additionally, there are other useful (in many cases recently developed) resources such as ProteomicsDB, Mass Spectrometry Interactive Virtual Environment (MassIVE), Chorus, MaxQB, PeptideAtlas SRM Experiment Library (PASSEL), Model Organism Protein Expression Database (MOPED), and the Human Proteinpedia. In addition, the ProteomeXchange consortium has been recently developed to enable better integration of public repositories and the coordinated sharing of proteomics information, maximizing its benefit to the scientific community. Here, we will review each of the major proteomics resources independently and some tools that enable the integration, mining and reuse of the data. We will also discuss some of the major challenges and current pitfalls in the integration and sharing of the data.
format Online
Article
Text
id pubmed-4409848
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BlackWell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-44098482015-04-29 Making proteomics data accessible and reusable: Current state of proteomics databases and repositories Perez-Riverol, Yasset Alpi, Emanuele Wang, Rui Hermjakob, Henning Vizcaíno, Juan Antonio Proteomics Bioinformatics Compared to other data-intensive disciplines such as genomics, public deposition and storage of MS-based proteomics, data are still less developed due to, among other reasons, the inherent complexity of the data and the variety of data types and experimental workflows. In order to address this need, several public repositories for MS proteomics experiments have been developed, each with different purposes in mind. The most established resources are the Global Proteome Machine Database (GPMDB), PeptideAtlas, and the PRIDE database. Additionally, there are other useful (in many cases recently developed) resources such as ProteomicsDB, Mass Spectrometry Interactive Virtual Environment (MassIVE), Chorus, MaxQB, PeptideAtlas SRM Experiment Library (PASSEL), Model Organism Protein Expression Database (MOPED), and the Human Proteinpedia. In addition, the ProteomeXchange consortium has been recently developed to enable better integration of public repositories and the coordinated sharing of proteomics information, maximizing its benefit to the scientific community. Here, we will review each of the major proteomics resources independently and some tools that enable the integration, mining and reuse of the data. We will also discuss some of the major challenges and current pitfalls in the integration and sharing of the data. BlackWell Publishing Ltd 2015-03 2015-03-12 /pmc/articles/PMC4409848/ /pubmed/25158685 http://dx.doi.org/10.1002/pmic.201400302 Text en © 2014 The Authors. PROTEOMICS published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Bioinformatics
Perez-Riverol, Yasset
Alpi, Emanuele
Wang, Rui
Hermjakob, Henning
Vizcaíno, Juan Antonio
Making proteomics data accessible and reusable: Current state of proteomics databases and repositories
title Making proteomics data accessible and reusable: Current state of proteomics databases and repositories
title_full Making proteomics data accessible and reusable: Current state of proteomics databases and repositories
title_fullStr Making proteomics data accessible and reusable: Current state of proteomics databases and repositories
title_full_unstemmed Making proteomics data accessible and reusable: Current state of proteomics databases and repositories
title_short Making proteomics data accessible and reusable: Current state of proteomics databases and repositories
title_sort making proteomics data accessible and reusable: current state of proteomics databases and repositories
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4409848/
https://www.ncbi.nlm.nih.gov/pubmed/25158685
http://dx.doi.org/10.1002/pmic.201400302
work_keys_str_mv AT perezriverolyasset makingproteomicsdataaccessibleandreusablecurrentstateofproteomicsdatabasesandrepositories
AT alpiemanuele makingproteomicsdataaccessibleandreusablecurrentstateofproteomicsdatabasesandrepositories
AT wangrui makingproteomicsdataaccessibleandreusablecurrentstateofproteomicsdatabasesandrepositories
AT hermjakobhenning makingproteomicsdataaccessibleandreusablecurrentstateofproteomicsdatabasesandrepositories
AT vizcainojuanantonio makingproteomicsdataaccessibleandreusablecurrentstateofproteomicsdatabasesandrepositories