Cargando…
PolySTest: Robust Statistical Testing of Proteomics Data with Missing Values Improves Detection of Biologically Relevant Features
Statistical testing remains one of the main challenges for high-confidence detection of differentially regulated proteins or peptides in large-scale quantitative proteomics experiments by mass spectrometry. Statistical tests need to be sufficiently robust to deal with experiment intrinsic data struc...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Biochemistry and Molecular Biology
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015005/ https://www.ncbi.nlm.nih.gov/pubmed/32424025 http://dx.doi.org/10.1074/mcp.RA119.001777 |
_version_ | 1783673597548560384 |
---|---|
author | Schwämmle, Veit Hagensen, Christina E. Rogowska-Wrzesinska, Adelina Jensen, Ole N. |
author_facet | Schwämmle, Veit Hagensen, Christina E. Rogowska-Wrzesinska, Adelina Jensen, Ole N. |
author_sort | Schwämmle, Veit |
collection | PubMed |
description | Statistical testing remains one of the main challenges for high-confidence detection of differentially regulated proteins or peptides in large-scale quantitative proteomics experiments by mass spectrometry. Statistical tests need to be sufficiently robust to deal with experiment intrinsic data structures and variations and often also reduced feature coverage across different biological samples due to ubiquitous missing values. A robust statistical test provides accurate confidence scores of large-scale proteomics results, regardless of instrument platform, experimental protocol and software tools. However, the multitude of different combinations of experimental strategies, mass spectrometry techniques and informatics methods complicate the decision of choosing appropriate statistical approaches. We address this challenge by introducing PolySTest, a user-friendly web service for statistical testing, data browsing and data visualization. We introduce a new method, Miss test, that simultaneously tests for missingness and feature abundance, thereby complementing common statistical tests by rescuing otherwise discarded data features. We demonstrate that PolySTest with integrated Miss test achieves higher confidence and higher sensitivity for artificial and experimental proteomics data sets with known ground truth. Application of PolySTest to mass spectrometry based large-scale proteomics data obtained from differentiating muscle cells resulted in the rescue of 10–20% additional proteins in the identified molecular networks relevant to muscle differentiation. We conclude that PolySTest is a valuable addition to existing tools and instrument enhancements that improve coverage and depth of large-scale proteomics experiments. A fully functional demo version of PolySTest and Miss test is available via http://computproteomics.bmb.sdu.dk/Apps/PolySTest. |
format | Online Article Text |
id | pubmed-8015005 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | American Society for Biochemistry and Molecular Biology |
record_format | MEDLINE/PubMed |
spelling | pubmed-80150052021-04-12 PolySTest: Robust Statistical Testing of Proteomics Data with Missing Values Improves Detection of Biologically Relevant Features Schwämmle, Veit Hagensen, Christina E. Rogowska-Wrzesinska, Adelina Jensen, Ole N. Mol Cell Proteomics Technological Resources and Innovation Statistical testing remains one of the main challenges for high-confidence detection of differentially regulated proteins or peptides in large-scale quantitative proteomics experiments by mass spectrometry. Statistical tests need to be sufficiently robust to deal with experiment intrinsic data structures and variations and often also reduced feature coverage across different biological samples due to ubiquitous missing values. A robust statistical test provides accurate confidence scores of large-scale proteomics results, regardless of instrument platform, experimental protocol and software tools. However, the multitude of different combinations of experimental strategies, mass spectrometry techniques and informatics methods complicate the decision of choosing appropriate statistical approaches. We address this challenge by introducing PolySTest, a user-friendly web service for statistical testing, data browsing and data visualization. We introduce a new method, Miss test, that simultaneously tests for missingness and feature abundance, thereby complementing common statistical tests by rescuing otherwise discarded data features. We demonstrate that PolySTest with integrated Miss test achieves higher confidence and higher sensitivity for artificial and experimental proteomics data sets with known ground truth. Application of PolySTest to mass spectrometry based large-scale proteomics data obtained from differentiating muscle cells resulted in the rescue of 10–20% additional proteins in the identified molecular networks relevant to muscle differentiation. We conclude that PolySTest is a valuable addition to existing tools and instrument enhancements that improve coverage and depth of large-scale proteomics experiments. A fully functional demo version of PolySTest and Miss test is available via http://computproteomics.bmb.sdu.dk/Apps/PolySTest. American Society for Biochemistry and Molecular Biology 2020-11-23 /pmc/articles/PMC8015005/ /pubmed/32424025 http://dx.doi.org/10.1074/mcp.RA119.001777 Text en © 2020 © 2020 Schwämmle et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Technological Resources and Innovation Schwämmle, Veit Hagensen, Christina E. Rogowska-Wrzesinska, Adelina Jensen, Ole N. PolySTest: Robust Statistical Testing of Proteomics Data with Missing Values Improves Detection of Biologically Relevant Features |
title | PolySTest: Robust Statistical Testing of Proteomics Data with Missing Values Improves Detection of Biologically Relevant Features |
title_full | PolySTest: Robust Statistical Testing of Proteomics Data with Missing Values Improves Detection of Biologically Relevant Features |
title_fullStr | PolySTest: Robust Statistical Testing of Proteomics Data with Missing Values Improves Detection of Biologically Relevant Features |
title_full_unstemmed | PolySTest: Robust Statistical Testing of Proteomics Data with Missing Values Improves Detection of Biologically Relevant Features |
title_short | PolySTest: Robust Statistical Testing of Proteomics Data with Missing Values Improves Detection of Biologically Relevant Features |
title_sort | polystest: robust statistical testing of proteomics data with missing values improves detection of biologically relevant features |
topic | Technological Resources and Innovation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015005/ https://www.ncbi.nlm.nih.gov/pubmed/32424025 http://dx.doi.org/10.1074/mcp.RA119.001777 |
work_keys_str_mv | AT schwammleveit polystestrobuststatisticaltestingofproteomicsdatawithmissingvaluesimprovesdetectionofbiologicallyrelevantfeatures AT hagensenchristinae polystestrobuststatisticaltestingofproteomicsdatawithmissingvaluesimprovesdetectionofbiologicallyrelevantfeatures AT rogowskawrzesinskaadelina polystestrobuststatisticaltestingofproteomicsdatawithmissingvaluesimprovesdetectionofbiologicallyrelevantfeatures AT jensenolen polystestrobuststatisticaltestingofproteomicsdatawithmissingvaluesimprovesdetectionofbiologicallyrelevantfeatures |