Cargando…

QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics

[Image: see text] Shotgun proteomics experiments integrate a complex sequence of processes, any of which can introduce variability. Quality metrics computed from LC-MS/MS data have relied upon identifying MS/MS scans, but a new mode for the QuaMeter software produces metrics that are independent of...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xia, Chambers, Matthew C., Vega-Montoto, Lorenzo J., Bunk, David M., Stein, Stephen E., Tabb, David L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2014
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3982976/
https://www.ncbi.nlm.nih.gov/pubmed/24494671
http://dx.doi.org/10.1021/ac4034455
_version_ 1782311234005106688
author Wang, Xia
Chambers, Matthew C.
Vega-Montoto, Lorenzo J.
Bunk, David M.
Stein, Stephen E.
Tabb, David L.
author_facet Wang, Xia
Chambers, Matthew C.
Vega-Montoto, Lorenzo J.
Bunk, David M.
Stein, Stephen E.
Tabb, David L.
author_sort Wang, Xia
collection PubMed
description [Image: see text] Shotgun proteomics experiments integrate a complex sequence of processes, any of which can introduce variability. Quality metrics computed from LC-MS/MS data have relied upon identifying MS/MS scans, but a new mode for the QuaMeter software produces metrics that are independent of identifications. Rather than evaluating each metric independently, we have created a robust multivariate statistical toolkit that accommodates the correlation structure of these metrics and allows for hierarchical relationships among data sets. The framework enables visualization and structural assessment of variability. Study 1 for the Clinical Proteomics Technology Assessment for Cancer (CPTAC), which analyzed three replicates of two common samples at each of two time points among 23 mass spectrometers in nine laboratories, provided the data to demonstrate this framework, and CPTAC Study 5 provided data from complex lysates under Standard Operating Procedures (SOPs) to complement these findings. Identification-independent quality metrics enabled the differentiation of sites and run-times through robust principal components analysis and subsequent factor analysis. Dissimilarity metrics revealed outliers in performance, and a nested ANOVA model revealed the extent to which all metrics or individual metrics were impacted by mass spectrometer and run time. Study 5 data revealed that even when SOPs have been applied, instrument-dependent variability remains prominent, although it may be reduced, while within-site variability is reduced significantly. Finally, identification-independent quality metrics were shown to be predictive of identification sensitivity in these data sets. QuaMeter and the associated multivariate framework are available from http://fenchurch.mc.vanderbilt.edu and http://homepages.uc.edu/~wang2x7/, respectively.
format Online
Article
Text
id pubmed-3982976
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-39829762015-02-04 QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics Wang, Xia Chambers, Matthew C. Vega-Montoto, Lorenzo J. Bunk, David M. Stein, Stephen E. Tabb, David L. Anal Chem [Image: see text] Shotgun proteomics experiments integrate a complex sequence of processes, any of which can introduce variability. Quality metrics computed from LC-MS/MS data have relied upon identifying MS/MS scans, but a new mode for the QuaMeter software produces metrics that are independent of identifications. Rather than evaluating each metric independently, we have created a robust multivariate statistical toolkit that accommodates the correlation structure of these metrics and allows for hierarchical relationships among data sets. The framework enables visualization and structural assessment of variability. Study 1 for the Clinical Proteomics Technology Assessment for Cancer (CPTAC), which analyzed three replicates of two common samples at each of two time points among 23 mass spectrometers in nine laboratories, provided the data to demonstrate this framework, and CPTAC Study 5 provided data from complex lysates under Standard Operating Procedures (SOPs) to complement these findings. Identification-independent quality metrics enabled the differentiation of sites and run-times through robust principal components analysis and subsequent factor analysis. Dissimilarity metrics revealed outliers in performance, and a nested ANOVA model revealed the extent to which all metrics or individual metrics were impacted by mass spectrometer and run time. Study 5 data revealed that even when SOPs have been applied, instrument-dependent variability remains prominent, although it may be reduced, while within-site variability is reduced significantly. Finally, identification-independent quality metrics were shown to be predictive of identification sensitivity in these data sets. QuaMeter and the associated multivariate framework are available from http://fenchurch.mc.vanderbilt.edu and http://homepages.uc.edu/~wang2x7/, respectively. American Chemical Society 2014-02-04 2014-03-04 /pmc/articles/PMC3982976/ /pubmed/24494671 http://dx.doi.org/10.1021/ac4034455 Text en Copyright © 2014 American Chemical Society
spellingShingle Wang, Xia
Chambers, Matthew C.
Vega-Montoto, Lorenzo J.
Bunk, David M.
Stein, Stephen E.
Tabb, David L.
QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics
title QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics
title_full QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics
title_fullStr QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics
title_full_unstemmed QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics
title_short QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics
title_sort qc metrics from cptac raw lc-ms/ms data interpreted through multivariate statistics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3982976/
https://www.ncbi.nlm.nih.gov/pubmed/24494671
http://dx.doi.org/10.1021/ac4034455
work_keys_str_mv AT wangxia qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics
AT chambersmatthewc qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics
AT vegamontotolorenzoj qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics
AT bunkdavidm qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics
AT steinstephene qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics
AT tabbdavidl qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics