Cargando…

Improved quality control processing of peptide-centric LC-MS proteomics data

Motivation: In the analysis of differential peptide peak intensities (i.e. abundance measures), LC-MS analyses with poor quality peptide abundance data can bias downstream statistical analyses and hence the biological interpretation for an otherwise high-quality dataset. Although considerable effort...

Descripción completa

Detalles Bibliográficos
Autores principales: Matzke, Melissa M., Waters, Katrina M., Metz, Thomas O., Jacobs, Jon M., Sims, Amy C., Baric, Ralph S., Pounds, Joel G., Webb-Robertson, Bobbie-Jo M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3187650/
https://www.ncbi.nlm.nih.gov/pubmed/21852304
http://dx.doi.org/10.1093/bioinformatics/btr479
_version_ 1782213325027803136
author Matzke, Melissa M.
Waters, Katrina M.
Metz, Thomas O.
Jacobs, Jon M.
Sims, Amy C.
Baric, Ralph S.
Pounds, Joel G.
Webb-Robertson, Bobbie-Jo M.
author_facet Matzke, Melissa M.
Waters, Katrina M.
Metz, Thomas O.
Jacobs, Jon M.
Sims, Amy C.
Baric, Ralph S.
Pounds, Joel G.
Webb-Robertson, Bobbie-Jo M.
author_sort Matzke, Melissa M.
collection PubMed
description Motivation: In the analysis of differential peptide peak intensities (i.e. abundance measures), LC-MS analyses with poor quality peptide abundance data can bias downstream statistical analyses and hence the biological interpretation for an otherwise high-quality dataset. Although considerable effort has been placed on assuring the quality of the peptide identification with respect to spectral processing, to date quality assessment of the subsequent peptide abundance data matrix has been limited to a subjective visual inspection of run-by-run correlation or individual peptide components. Identifying statistical outliers is a critical step in the processing of proteomics data as many of the downstream statistical analyses [e.g. analysis of variance (ANOVA)] rely upon accurate estimates of sample variance, and their results are influenced by extreme values. Results: We describe a novel multivariate statistical strategy for the identification of LC-MS runs with extreme peptide abundance distributions. Comparison with current method (run-by-run correlation) demonstrates a significantly better rate of identification of outlier runs by the multivariate strategy. Simulation studies also suggest that this strategy significantly outperforms correlation alone in the identification of statistically extreme liquid chromatography-mass spectrometry (LC-MS) runs. Availability: https://www.biopilot.org/docs/Software/RMD.php Contact: bj@pnl.gov Supplementary information: Supplementary material is available at Bioinformatics online.
format Online
Article
Text
id pubmed-3187650
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31876502011-10-05 Improved quality control processing of peptide-centric LC-MS proteomics data Matzke, Melissa M. Waters, Katrina M. Metz, Thomas O. Jacobs, Jon M. Sims, Amy C. Baric, Ralph S. Pounds, Joel G. Webb-Robertson, Bobbie-Jo M. Bioinformatics Original Papers Motivation: In the analysis of differential peptide peak intensities (i.e. abundance measures), LC-MS analyses with poor quality peptide abundance data can bias downstream statistical analyses and hence the biological interpretation for an otherwise high-quality dataset. Although considerable effort has been placed on assuring the quality of the peptide identification with respect to spectral processing, to date quality assessment of the subsequent peptide abundance data matrix has been limited to a subjective visual inspection of run-by-run correlation or individual peptide components. Identifying statistical outliers is a critical step in the processing of proteomics data as many of the downstream statistical analyses [e.g. analysis of variance (ANOVA)] rely upon accurate estimates of sample variance, and their results are influenced by extreme values. Results: We describe a novel multivariate statistical strategy for the identification of LC-MS runs with extreme peptide abundance distributions. Comparison with current method (run-by-run correlation) demonstrates a significantly better rate of identification of outlier runs by the multivariate strategy. Simulation studies also suggest that this strategy significantly outperforms correlation alone in the identification of statistically extreme liquid chromatography-mass spectrometry (LC-MS) runs. Availability: https://www.biopilot.org/docs/Software/RMD.php Contact: bj@pnl.gov Supplementary information: Supplementary material is available at Bioinformatics online. Oxford University Press 2011-10-15 2011-08-18 /pmc/articles/PMC3187650/ /pubmed/21852304 http://dx.doi.org/10.1093/bioinformatics/btr479 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Matzke, Melissa M.
Waters, Katrina M.
Metz, Thomas O.
Jacobs, Jon M.
Sims, Amy C.
Baric, Ralph S.
Pounds, Joel G.
Webb-Robertson, Bobbie-Jo M.
Improved quality control processing of peptide-centric LC-MS proteomics data
title Improved quality control processing of peptide-centric LC-MS proteomics data
title_full Improved quality control processing of peptide-centric LC-MS proteomics data
title_fullStr Improved quality control processing of peptide-centric LC-MS proteomics data
title_full_unstemmed Improved quality control processing of peptide-centric LC-MS proteomics data
title_short Improved quality control processing of peptide-centric LC-MS proteomics data
title_sort improved quality control processing of peptide-centric lc-ms proteomics data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3187650/
https://www.ncbi.nlm.nih.gov/pubmed/21852304
http://dx.doi.org/10.1093/bioinformatics/btr479
work_keys_str_mv AT matzkemelissam improvedqualitycontrolprocessingofpeptidecentriclcmsproteomicsdata
AT waterskatrinam improvedqualitycontrolprocessingofpeptidecentriclcmsproteomicsdata
AT metzthomaso improvedqualitycontrolprocessingofpeptidecentriclcmsproteomicsdata
AT jacobsjonm improvedqualitycontrolprocessingofpeptidecentriclcmsproteomicsdata
AT simsamyc improvedqualitycontrolprocessingofpeptidecentriclcmsproteomicsdata
AT baricralphs improvedqualitycontrolprocessingofpeptidecentriclcmsproteomicsdata
AT poundsjoelg improvedqualitycontrolprocessingofpeptidecentriclcmsproteomicsdata
AT webbrobertsonbobbiejom improvedqualitycontrolprocessingofpeptidecentriclcmsproteomicsdata