Cargando…
QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics
[Image: see text] Shotgun proteomics experiments integrate a complex sequence of processes, any of which can introduce variability. Quality metrics computed from LC-MS/MS data have relied upon identifying MS/MS scans, but a new mode for the QuaMeter software produces metrics that are independent of...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical
Society
2014
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3982976/ https://www.ncbi.nlm.nih.gov/pubmed/24494671 http://dx.doi.org/10.1021/ac4034455 |
_version_ | 1782311234005106688 |
---|---|
author | Wang, Xia Chambers, Matthew C. Vega-Montoto, Lorenzo J. Bunk, David M. Stein, Stephen E. Tabb, David L. |
author_facet | Wang, Xia Chambers, Matthew C. Vega-Montoto, Lorenzo J. Bunk, David M. Stein, Stephen E. Tabb, David L. |
author_sort | Wang, Xia |
collection | PubMed |
description | [Image: see text] Shotgun proteomics experiments integrate a complex sequence of processes, any of which can introduce variability. Quality metrics computed from LC-MS/MS data have relied upon identifying MS/MS scans, but a new mode for the QuaMeter software produces metrics that are independent of identifications. Rather than evaluating each metric independently, we have created a robust multivariate statistical toolkit that accommodates the correlation structure of these metrics and allows for hierarchical relationships among data sets. The framework enables visualization and structural assessment of variability. Study 1 for the Clinical Proteomics Technology Assessment for Cancer (CPTAC), which analyzed three replicates of two common samples at each of two time points among 23 mass spectrometers in nine laboratories, provided the data to demonstrate this framework, and CPTAC Study 5 provided data from complex lysates under Standard Operating Procedures (SOPs) to complement these findings. Identification-independent quality metrics enabled the differentiation of sites and run-times through robust principal components analysis and subsequent factor analysis. Dissimilarity metrics revealed outliers in performance, and a nested ANOVA model revealed the extent to which all metrics or individual metrics were impacted by mass spectrometer and run time. Study 5 data revealed that even when SOPs have been applied, instrument-dependent variability remains prominent, although it may be reduced, while within-site variability is reduced significantly. Finally, identification-independent quality metrics were shown to be predictive of identification sensitivity in these data sets. QuaMeter and the associated multivariate framework are available from http://fenchurch.mc.vanderbilt.edu and http://homepages.uc.edu/~wang2x7/, respectively. |
format | Online Article Text |
id | pubmed-3982976 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | American Chemical
Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-39829762015-02-04 QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics Wang, Xia Chambers, Matthew C. Vega-Montoto, Lorenzo J. Bunk, David M. Stein, Stephen E. Tabb, David L. Anal Chem [Image: see text] Shotgun proteomics experiments integrate a complex sequence of processes, any of which can introduce variability. Quality metrics computed from LC-MS/MS data have relied upon identifying MS/MS scans, but a new mode for the QuaMeter software produces metrics that are independent of identifications. Rather than evaluating each metric independently, we have created a robust multivariate statistical toolkit that accommodates the correlation structure of these metrics and allows for hierarchical relationships among data sets. The framework enables visualization and structural assessment of variability. Study 1 for the Clinical Proteomics Technology Assessment for Cancer (CPTAC), which analyzed three replicates of two common samples at each of two time points among 23 mass spectrometers in nine laboratories, provided the data to demonstrate this framework, and CPTAC Study 5 provided data from complex lysates under Standard Operating Procedures (SOPs) to complement these findings. Identification-independent quality metrics enabled the differentiation of sites and run-times through robust principal components analysis and subsequent factor analysis. Dissimilarity metrics revealed outliers in performance, and a nested ANOVA model revealed the extent to which all metrics or individual metrics were impacted by mass spectrometer and run time. Study 5 data revealed that even when SOPs have been applied, instrument-dependent variability remains prominent, although it may be reduced, while within-site variability is reduced significantly. Finally, identification-independent quality metrics were shown to be predictive of identification sensitivity in these data sets. QuaMeter and the associated multivariate framework are available from http://fenchurch.mc.vanderbilt.edu and http://homepages.uc.edu/~wang2x7/, respectively. American Chemical Society 2014-02-04 2014-03-04 /pmc/articles/PMC3982976/ /pubmed/24494671 http://dx.doi.org/10.1021/ac4034455 Text en Copyright © 2014 American Chemical Society |
spellingShingle | Wang, Xia Chambers, Matthew C. Vega-Montoto, Lorenzo J. Bunk, David M. Stein, Stephen E. Tabb, David L. QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics |
title | QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted
through Multivariate Statistics |
title_full | QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted
through Multivariate Statistics |
title_fullStr | QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted
through Multivariate Statistics |
title_full_unstemmed | QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted
through Multivariate Statistics |
title_short | QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted
through Multivariate Statistics |
title_sort | qc metrics from cptac raw lc-ms/ms data interpreted
through multivariate statistics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3982976/ https://www.ncbi.nlm.nih.gov/pubmed/24494671 http://dx.doi.org/10.1021/ac4034455 |
work_keys_str_mv | AT wangxia qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics AT chambersmatthewc qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics AT vegamontotolorenzoj qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics AT bunkdavidm qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics AT steinstephene qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics AT tabbdavidl qcmetricsfromcptacrawlcmsmsdatainterpretedthroughmultivariatestatistics |