Cargando…

DEqMS: A Method for Accurate Variance Estimation in Differential Protein Expression Analysis

Quantitative proteomics by mass spectrometry is widely used in biomarker research and basic biology research for investigation of phenotype level cellular events. Despite the wide application, the methodology for statistical analysis of differentially expressed proteins has not been unified. Various...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yafeng, Orre, Lukas M., Zhou Tran, Yan, Mermelekas, Georgios, Johansson, Henrik J., Malyutina, Alina, Anders, Simon, Lehtiö, Janne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The American Society for Biochemistry and Molecular Biology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7261819/
https://www.ncbi.nlm.nih.gov/pubmed/32205417
http://dx.doi.org/10.1074/mcp.TIR119.001646
_version_ 1783540564273135616
author Zhu, Yafeng
Orre, Lukas M.
Zhou Tran, Yan
Mermelekas, Georgios
Johansson, Henrik J.
Malyutina, Alina
Anders, Simon
Lehtiö, Janne
author_facet Zhu, Yafeng
Orre, Lukas M.
Zhou Tran, Yan
Mermelekas, Georgios
Johansson, Henrik J.
Malyutina, Alina
Anders, Simon
Lehtiö, Janne
author_sort Zhu, Yafeng
collection PubMed
description Quantitative proteomics by mass spectrometry is widely used in biomarker research and basic biology research for investigation of phenotype level cellular events. Despite the wide application, the methodology for statistical analysis of differentially expressed proteins has not been unified. Various methods such as t test, linear model and mixed effect models are used to define changes in proteomics experiments. However, none of these methods consider the specific structure of MS-data. Choices between methods, often originally developed for other types of data, are based on compromises between features such as statistical power, general applicability and user friendliness. Furthermore, whether to include proteins identified with one peptide in statistical analysis of differential protein expression varies between studies. Here we present DEqMS, a robust statistical method developed specifically for differential protein expression analysis in mass spectrometry data. In all data sets investigated there is a clear dependence of variance on the number of PSMs or peptides used for protein quantification. DEqMS takes this feature into account when assessing differential protein expression. This allows for a more accurate data-dependent estimation of protein variance and inclusion of single peptide identifications without increasing false discoveries. The method was tested in several data sets including E. coli proteome spike-in data, using both label-free and TMT-labeled quantification. Compared with previous statistical methods used in quantitative proteomics, DEqMS showed consistently better accuracy in detecting altered protein levels compared with other statistical methods in both label-free and labeled quantitative proteomics data. DEqMS is available as an R package in Bioconductor.
format Online
Article
Text
id pubmed-7261819
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-72618192020-06-09 DEqMS: A Method for Accurate Variance Estimation in Differential Protein Expression Analysis Zhu, Yafeng Orre, Lukas M. Zhou Tran, Yan Mermelekas, Georgios Johansson, Henrik J. Malyutina, Alina Anders, Simon Lehtiö, Janne Mol Cell Proteomics Technological Innovation and Resources Quantitative proteomics by mass spectrometry is widely used in biomarker research and basic biology research for investigation of phenotype level cellular events. Despite the wide application, the methodology for statistical analysis of differentially expressed proteins has not been unified. Various methods such as t test, linear model and mixed effect models are used to define changes in proteomics experiments. However, none of these methods consider the specific structure of MS-data. Choices between methods, often originally developed for other types of data, are based on compromises between features such as statistical power, general applicability and user friendliness. Furthermore, whether to include proteins identified with one peptide in statistical analysis of differential protein expression varies between studies. Here we present DEqMS, a robust statistical method developed specifically for differential protein expression analysis in mass spectrometry data. In all data sets investigated there is a clear dependence of variance on the number of PSMs or peptides used for protein quantification. DEqMS takes this feature into account when assessing differential protein expression. This allows for a more accurate data-dependent estimation of protein variance and inclusion of single peptide identifications without increasing false discoveries. The method was tested in several data sets including E. coli proteome spike-in data, using both label-free and TMT-labeled quantification. Compared with previous statistical methods used in quantitative proteomics, DEqMS showed consistently better accuracy in detecting altered protein levels compared with other statistical methods in both label-free and labeled quantitative proteomics data. DEqMS is available as an R package in Bioconductor. The American Society for Biochemistry and Molecular Biology 2020-06 2020-03-23 /pmc/articles/PMC7261819/ /pubmed/32205417 http://dx.doi.org/10.1074/mcp.TIR119.001646 Text en © 2020 Zhu et al. Author's Choice—Final version open access under the terms of the Creative Commons CC-BY license (http://creativecommons.org/licenses/by/4.0) .
spellingShingle Technological Innovation and Resources
Zhu, Yafeng
Orre, Lukas M.
Zhou Tran, Yan
Mermelekas, Georgios
Johansson, Henrik J.
Malyutina, Alina
Anders, Simon
Lehtiö, Janne
DEqMS: A Method for Accurate Variance Estimation in Differential Protein Expression Analysis
title DEqMS: A Method for Accurate Variance Estimation in Differential Protein Expression Analysis
title_full DEqMS: A Method for Accurate Variance Estimation in Differential Protein Expression Analysis
title_fullStr DEqMS: A Method for Accurate Variance Estimation in Differential Protein Expression Analysis
title_full_unstemmed DEqMS: A Method for Accurate Variance Estimation in Differential Protein Expression Analysis
title_short DEqMS: A Method for Accurate Variance Estimation in Differential Protein Expression Analysis
title_sort deqms: a method for accurate variance estimation in differential protein expression analysis
topic Technological Innovation and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7261819/
https://www.ncbi.nlm.nih.gov/pubmed/32205417
http://dx.doi.org/10.1074/mcp.TIR119.001646
work_keys_str_mv AT zhuyafeng deqmsamethodforaccuratevarianceestimationindifferentialproteinexpressionanalysis
AT orrelukasm deqmsamethodforaccuratevarianceestimationindifferentialproteinexpressionanalysis
AT zhoutranyan deqmsamethodforaccuratevarianceestimationindifferentialproteinexpressionanalysis
AT mermelekasgeorgios deqmsamethodforaccuratevarianceestimationindifferentialproteinexpressionanalysis
AT johanssonhenrikj deqmsamethodforaccuratevarianceestimationindifferentialproteinexpressionanalysis
AT malyutinaalina deqmsamethodforaccuratevarianceestimationindifferentialproteinexpressionanalysis
AT anderssimon deqmsamethodforaccuratevarianceestimationindifferentialproteinexpressionanalysis
AT lehtiojanne deqmsamethodforaccuratevarianceestimationindifferentialproteinexpressionanalysis