Cargando…

A high confidence, manually validated human blood plasma protein reference set

BACKGROUND: The immense diagnostic potential of human plasma has prompted great interest and effort in cataloging its contents, exemplified by the Human Proteome Organization (HUPO) Plasma Proteome Project (PPP) pilot project. Due to challenges in obtaining a reliable blood plasma protein list, HUPO...

Descripción completa

Detalles Bibliográficos
Autores principales: Schenk, Susann, Schoenhals, Gary J, de Souza, Gustavo, Mann, Matthias
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2563020/
https://www.ncbi.nlm.nih.gov/pubmed/18793429
http://dx.doi.org/10.1186/1755-8794-1-41
_version_ 1782159786410770432
author Schenk, Susann
Schoenhals, Gary J
de Souza, Gustavo
Mann, Matthias
author_facet Schenk, Susann
Schoenhals, Gary J
de Souza, Gustavo
Mann, Matthias
author_sort Schenk, Susann
collection PubMed
description BACKGROUND: The immense diagnostic potential of human plasma has prompted great interest and effort in cataloging its contents, exemplified by the Human Proteome Organization (HUPO) Plasma Proteome Project (PPP) pilot project. Due to challenges in obtaining a reliable blood plasma protein list, HUPO later re-analysed their own original dataset with a more stringent statistical treatment that resulted in a much reduced list of high confidence (at least 95%) proteins compared with their original findings. In order to facilitate the discovery of novel biomarkers in the future and to realize the full diagnostic potential of blood plasma, we feel that there is still a need for an ultra-high confidence reference list (at least 99% confidence) of blood plasma proteins. METHODS: To address the complexity and dynamic protein concentration range of the plasma proteome, we employed a linear ion-trap-Fourier transform (LTQ-FT) and a linear ion trap-Orbitrap (LTQ-Orbitrap) for mass spectrometry (MS) analysis. Both instruments allow the measurement of peptide masses in the low ppm range. Furthermore, we employed a statistical score that allows database peptide identification searching using the products of two consecutive stages of tandem mass spectrometry (MS3). The combination of MS3 with very high mass accuracy in the parent peptide allows peptide identification with orders of magnitude more confidence than that typically achieved. RESULTS: Herein we established a high confidence set of 697 blood plasma proteins and achieved a high 'average sequence coverage' of more than 14 peptides per protein and a median of 6 peptides per protein. All proteins annotated as belonging to the immunoglobulin family as well as all hypothetical proteins whose peptides completely matched immunoglobulin sequences were excluded from this protein list. We also compared the results of using two high-end MS instruments as well as the use of various peptide and protein separation approaches. Furthermore, we characterized the plasma proteins using cellular localization information, as well as comparing our list of proteins to data from other sources, including the HUPO PPP dataset. CONCLUSION: Superior instrumentation combined with rigorous validation criteria gave rise to a set of 697 plasma proteins in which we have very high confidence, demonstrated by an exceptionally low false peptide identification rate of 0.29%.
format Text
id pubmed-2563020
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25630202008-10-08 A high confidence, manually validated human blood plasma protein reference set Schenk, Susann Schoenhals, Gary J de Souza, Gustavo Mann, Matthias BMC Med Genomics Research Article BACKGROUND: The immense diagnostic potential of human plasma has prompted great interest and effort in cataloging its contents, exemplified by the Human Proteome Organization (HUPO) Plasma Proteome Project (PPP) pilot project. Due to challenges in obtaining a reliable blood plasma protein list, HUPO later re-analysed their own original dataset with a more stringent statistical treatment that resulted in a much reduced list of high confidence (at least 95%) proteins compared with their original findings. In order to facilitate the discovery of novel biomarkers in the future and to realize the full diagnostic potential of blood plasma, we feel that there is still a need for an ultra-high confidence reference list (at least 99% confidence) of blood plasma proteins. METHODS: To address the complexity and dynamic protein concentration range of the plasma proteome, we employed a linear ion-trap-Fourier transform (LTQ-FT) and a linear ion trap-Orbitrap (LTQ-Orbitrap) for mass spectrometry (MS) analysis. Both instruments allow the measurement of peptide masses in the low ppm range. Furthermore, we employed a statistical score that allows database peptide identification searching using the products of two consecutive stages of tandem mass spectrometry (MS3). The combination of MS3 with very high mass accuracy in the parent peptide allows peptide identification with orders of magnitude more confidence than that typically achieved. RESULTS: Herein we established a high confidence set of 697 blood plasma proteins and achieved a high 'average sequence coverage' of more than 14 peptides per protein and a median of 6 peptides per protein. All proteins annotated as belonging to the immunoglobulin family as well as all hypothetical proteins whose peptides completely matched immunoglobulin sequences were excluded from this protein list. We also compared the results of using two high-end MS instruments as well as the use of various peptide and protein separation approaches. Furthermore, we characterized the plasma proteins using cellular localization information, as well as comparing our list of proteins to data from other sources, including the HUPO PPP dataset. CONCLUSION: Superior instrumentation combined with rigorous validation criteria gave rise to a set of 697 plasma proteins in which we have very high confidence, demonstrated by an exceptionally low false peptide identification rate of 0.29%. BioMed Central 2008-09-15 /pmc/articles/PMC2563020/ /pubmed/18793429 http://dx.doi.org/10.1186/1755-8794-1-41 Text en Copyright © 2008 Schenk et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Schenk, Susann
Schoenhals, Gary J
de Souza, Gustavo
Mann, Matthias
A high confidence, manually validated human blood plasma protein reference set
title A high confidence, manually validated human blood plasma protein reference set
title_full A high confidence, manually validated human blood plasma protein reference set
title_fullStr A high confidence, manually validated human blood plasma protein reference set
title_full_unstemmed A high confidence, manually validated human blood plasma protein reference set
title_short A high confidence, manually validated human blood plasma protein reference set
title_sort high confidence, manually validated human blood plasma protein reference set
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2563020/
https://www.ncbi.nlm.nih.gov/pubmed/18793429
http://dx.doi.org/10.1186/1755-8794-1-41
work_keys_str_mv AT schenksusann ahighconfidencemanuallyvalidatedhumanbloodplasmaproteinreferenceset
AT schoenhalsgaryj ahighconfidencemanuallyvalidatedhumanbloodplasmaproteinreferenceset
AT desouzagustavo ahighconfidencemanuallyvalidatedhumanbloodplasmaproteinreferenceset
AT mannmatthias ahighconfidencemanuallyvalidatedhumanbloodplasmaproteinreferenceset
AT schenksusann highconfidencemanuallyvalidatedhumanbloodplasmaproteinreferenceset
AT schoenhalsgaryj highconfidencemanuallyvalidatedhumanbloodplasmaproteinreferenceset
AT desouzagustavo highconfidencemanuallyvalidatedhumanbloodplasmaproteinreferenceset
AT mannmatthias highconfidencemanuallyvalidatedhumanbloodplasmaproteinreferenceset