Cargando…

Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences

Most implementations of mass spectrometry-based proteomics involve enzymatic digestion of proteins, expanding the analysis to multiple proteolytic peptides for each protein. Currently, there is no consensus of how to summarize peptides' abundances to protein concentrations, and such efforts are...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Bo, Pirmoradian, Mohammad, Zubarev, Roman, Käll, Lukas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The American Society for Biochemistry and Molecular Biology 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5417831/
https://www.ncbi.nlm.nih.gov/pubmed/28302922
http://dx.doi.org/10.1074/mcp.O117.067728
_version_ 1783233962348380160
author Zhang, Bo
Pirmoradian, Mohammad
Zubarev, Roman
Käll, Lukas
author_facet Zhang, Bo
Pirmoradian, Mohammad
Zubarev, Roman
Käll, Lukas
author_sort Zhang, Bo
collection PubMed
description Most implementations of mass spectrometry-based proteomics involve enzymatic digestion of proteins, expanding the analysis to multiple proteolytic peptides for each protein. Currently, there is no consensus of how to summarize peptides' abundances to protein concentrations, and such efforts are complicated by the fact that error control normally is applied to the identification process, and do not directly control errors linking peptide abundance measures to protein concentration. Peptides resulting from suboptimal digestion or being partially modified are not representative of the protein concentration. Without a mechanism to remove such unrepresentative peptides, their abundance adversely impacts the estimation of their protein's concentration. Here, we present a relative quantification approach, Diffacto, that applies factor analysis to extract the covariation of peptides' abundances. The method enables a weighted geometrical average summarization and automatic elimination of incoherent peptides. We demonstrate, based on a set of controlled label-free experiments using standard mixtures of proteins, that the covariation structure extracted by the factor analysis accurately reflects protein concentrations. In the 1% peptide-spectrum match-level FDR data set, as many as 11% of the peptides have abundance differences incoherent with the other peptides attributed to the same protein. If not controlled, such contradicting peptide abundance have a severe impact on protein quantifications. When adding the quantities of each protein's three most abundant peptides, we note as many as 14% of the proteins being estimated as having a negative correlation with their actual concentration differences between samples. Diffacto reduced the amount of such obviously incorrectly quantified proteins to 1.6%. Furthermore, by analyzing clinical data sets from two breast cancer studies, our method revealed the persistent proteomic signatures linked to three subtypes of breast cancer. We conclude that Diffacto can facilitate the interpretation and enhance the utility of most types of proteomics data.
format Online
Article
Text
id pubmed-5417831
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher The American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-54178312017-05-08 Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences Zhang, Bo Pirmoradian, Mohammad Zubarev, Roman Käll, Lukas Mol Cell Proteomics Technological Innovation and Resources Most implementations of mass spectrometry-based proteomics involve enzymatic digestion of proteins, expanding the analysis to multiple proteolytic peptides for each protein. Currently, there is no consensus of how to summarize peptides' abundances to protein concentrations, and such efforts are complicated by the fact that error control normally is applied to the identification process, and do not directly control errors linking peptide abundance measures to protein concentration. Peptides resulting from suboptimal digestion or being partially modified are not representative of the protein concentration. Without a mechanism to remove such unrepresentative peptides, their abundance adversely impacts the estimation of their protein's concentration. Here, we present a relative quantification approach, Diffacto, that applies factor analysis to extract the covariation of peptides' abundances. The method enables a weighted geometrical average summarization and automatic elimination of incoherent peptides. We demonstrate, based on a set of controlled label-free experiments using standard mixtures of proteins, that the covariation structure extracted by the factor analysis accurately reflects protein concentrations. In the 1% peptide-spectrum match-level FDR data set, as many as 11% of the peptides have abundance differences incoherent with the other peptides attributed to the same protein. If not controlled, such contradicting peptide abundance have a severe impact on protein quantifications. When adding the quantities of each protein's three most abundant peptides, we note as many as 14% of the proteins being estimated as having a negative correlation with their actual concentration differences between samples. Diffacto reduced the amount of such obviously incorrectly quantified proteins to 1.6%. Furthermore, by analyzing clinical data sets from two breast cancer studies, our method revealed the persistent proteomic signatures linked to three subtypes of breast cancer. We conclude that Diffacto can facilitate the interpretation and enhance the utility of most types of proteomics data. The American Society for Biochemistry and Molecular Biology 2017-05 2017-03-16 /pmc/articles/PMC5417831/ /pubmed/28302922 http://dx.doi.org/10.1074/mcp.O117.067728 Text en © 2017 by The American Society for Biochemistry and Molecular Biology, Inc. Author's Choice—Final version free via Creative Commons CC-BY license (http://creativecommons.org/licenses/by/4.0) .
spellingShingle Technological Innovation and Resources
Zhang, Bo
Pirmoradian, Mohammad
Zubarev, Roman
Käll, Lukas
Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences
title Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences
title_full Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences
title_fullStr Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences
title_full_unstemmed Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences
title_short Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences
title_sort covariation of peptide abundances accurately reflects protein concentration differences
topic Technological Innovation and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5417831/
https://www.ncbi.nlm.nih.gov/pubmed/28302922
http://dx.doi.org/10.1074/mcp.O117.067728
work_keys_str_mv AT zhangbo covariationofpeptideabundancesaccuratelyreflectsproteinconcentrationdifferences
AT pirmoradianmohammad covariationofpeptideabundancesaccuratelyreflectsproteinconcentrationdifferences
AT zubarevroman covariationofpeptideabundancesaccuratelyreflectsproteinconcentrationdifferences
AT kalllukas covariationofpeptideabundancesaccuratelyreflectsproteinconcentrationdifferences