Cargando…

Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0

Percolator is a widely used software tool that increases yield in shotgun proteomics experiments and assigns reliable statistical confidence measures, such as q values and posterior error probabilities, to peptides and peptide-spectrum matches (PSMs) from such experiments. Percolator’s processing sp...

Descripción completa

Detalles Bibliográficos
Autores principales: The, Matthew, MacCoss, Michael J., Noble, William S., Käll, Lukas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5059416/
https://www.ncbi.nlm.nih.gov/pubmed/27572102
http://dx.doi.org/10.1007/s13361-016-1460-7
_version_ 1782459401253158912
author The, Matthew
MacCoss, Michael J.
Noble, William S.
Käll, Lukas
author_facet The, Matthew
MacCoss, Michael J.
Noble, William S.
Käll, Lukas
author_sort The, Matthew
collection PubMed
description Percolator is a widely used software tool that increases yield in shotgun proteomics experiments and assigns reliable statistical confidence measures, such as q values and posterior error probabilities, to peptides and peptide-spectrum matches (PSMs) from such experiments. Percolator’s processing speed has been sufficient for typical data sets consisting of hundreds of thousands of PSMs. With our new scalable approach, we can now also analyze millions of PSMs in a matter of minutes on a commodity computer. Furthermore, with the increasing awareness for the need for reliable statistics on the protein level, we compared several easy-to-understand protein inference methods and implemented the best-performing method—grouping proteins by their corresponding sets of theoretical peptides and then considering only the best-scoring peptide for each protein—in the Percolator package. We used Percolator 3.0 to analyze the data from a recent study of the draft human proteome containing 25 million spectra (PM:24870542). The source code and Ubuntu, Windows, MacOS, and Fedora binary packages are available from http://percolator.ms/ under an Apache 2.0 license. [Figure: see text]
format Online
Article
Text
id pubmed-5059416
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-50594162016-10-26 Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0 The, Matthew MacCoss, Michael J. Noble, William S. Käll, Lukas J Am Soc Mass Spectrom Focus: Bioinformatics, Software, and MS-Based "Omics": Research Article Percolator is a widely used software tool that increases yield in shotgun proteomics experiments and assigns reliable statistical confidence measures, such as q values and posterior error probabilities, to peptides and peptide-spectrum matches (PSMs) from such experiments. Percolator’s processing speed has been sufficient for typical data sets consisting of hundreds of thousands of PSMs. With our new scalable approach, we can now also analyze millions of PSMs in a matter of minutes on a commodity computer. Furthermore, with the increasing awareness for the need for reliable statistics on the protein level, we compared several easy-to-understand protein inference methods and implemented the best-performing method—grouping proteins by their corresponding sets of theoretical peptides and then considering only the best-scoring peptide for each protein—in the Percolator package. We used Percolator 3.0 to analyze the data from a recent study of the draft human proteome containing 25 million spectra (PM:24870542). The source code and Ubuntu, Windows, MacOS, and Fedora binary packages are available from http://percolator.ms/ under an Apache 2.0 license. [Figure: see text] Springer US 2016-08-29 2016 /pmc/articles/PMC5059416/ /pubmed/27572102 http://dx.doi.org/10.1007/s13361-016-1460-7 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Focus: Bioinformatics, Software, and MS-Based "Omics": Research Article
The, Matthew
MacCoss, Michael J.
Noble, William S.
Käll, Lukas
Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0
title Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0
title_full Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0
title_fullStr Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0
title_full_unstemmed Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0
title_short Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0
title_sort fast and accurate protein false discovery rates on large-scale proteomics data sets with percolator 3.0
topic Focus: Bioinformatics, Software, and MS-Based "Omics": Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5059416/
https://www.ncbi.nlm.nih.gov/pubmed/27572102
http://dx.doi.org/10.1007/s13361-016-1460-7
work_keys_str_mv AT thematthew fastandaccurateproteinfalsediscoveryratesonlargescaleproteomicsdatasetswithpercolator30
AT maccossmichaelj fastandaccurateproteinfalsediscoveryratesonlargescaleproteomicsdatasetswithpercolator30
AT noblewilliams fastandaccurateproteinfalsediscoveryratesonlargescaleproteomicsdatasetswithpercolator30
AT kalllukas fastandaccurateproteinfalsediscoveryratesonlargescaleproteomicsdatasetswithpercolator30