Cargando…
MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry
Mass spectrometry (MS) enables high-throughput identification and quantification of proteins in complex biological samples and can provide insights into the global function of biological systems. Label-free quantification is cost-effective and suitable for the analysis of human samples. Despite rapi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Biochemistry and Molecular Biology
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368900/ https://www.ncbi.nlm.nih.gov/pubmed/37105364 http://dx.doi.org/10.1016/j.mcpro.2023.100558 |
_version_ | 1785077605052973056 |
---|---|
author | Hediyeh-Zadeh, Soroor Webb, Andrew I. Davis, Melissa J. |
author_facet | Hediyeh-Zadeh, Soroor Webb, Andrew I. Davis, Melissa J. |
author_sort | Hediyeh-Zadeh, Soroor |
collection | PubMed |
description | Mass spectrometry (MS) enables high-throughput identification and quantification of proteins in complex biological samples and can provide insights into the global function of biological systems. Label-free quantification is cost-effective and suitable for the analysis of human samples. Despite rapid developments in label-free data acquisition workflows, the number of proteins quantified across samples can be limited by technical and biological variability. This variation can result in missing values which can in turn challenge downstream data analysis tasks. General purpose or gene expression-specific imputation algorithms are widely used to improve data completeness. Here, we propose an imputation algorithm designated for label-free MS data that is aware of the type of missingness affecting data. On published datasets acquired by data-dependent and data-independent acquisition workflows with variable degrees of biological complexity, we demonstrate that the proposed missing value estimation procedure by barycenter computation competes closely with the state-of-the-art imputation algorithms in differential abundance tasks while outperforming them in the accuracy of variance estimates of the peptide abundance measurements, and better controls the false discovery rate in label-free MS experiments. The barycenter estimation procedure is implemented in the msImpute software package and is available from the Bioconductor repository. |
format | Online Article Text |
id | pubmed-10368900 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Society for Biochemistry and Molecular Biology |
record_format | MEDLINE/PubMed |
spelling | pubmed-103689002023-07-27 MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry Hediyeh-Zadeh, Soroor Webb, Andrew I. Davis, Melissa J. Mol Cell Proteomics Technological Innovation and Resources Mass spectrometry (MS) enables high-throughput identification and quantification of proteins in complex biological samples and can provide insights into the global function of biological systems. Label-free quantification is cost-effective and suitable for the analysis of human samples. Despite rapid developments in label-free data acquisition workflows, the number of proteins quantified across samples can be limited by technical and biological variability. This variation can result in missing values which can in turn challenge downstream data analysis tasks. General purpose or gene expression-specific imputation algorithms are widely used to improve data completeness. Here, we propose an imputation algorithm designated for label-free MS data that is aware of the type of missingness affecting data. On published datasets acquired by data-dependent and data-independent acquisition workflows with variable degrees of biological complexity, we demonstrate that the proposed missing value estimation procedure by barycenter computation competes closely with the state-of-the-art imputation algorithms in differential abundance tasks while outperforming them in the accuracy of variance estimates of the peptide abundance measurements, and better controls the false discovery rate in label-free MS experiments. The barycenter estimation procedure is implemented in the msImpute software package and is available from the Bioconductor repository. American Society for Biochemistry and Molecular Biology 2023-04-25 /pmc/articles/PMC10368900/ /pubmed/37105364 http://dx.doi.org/10.1016/j.mcpro.2023.100558 Text en © 2023 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Technological Innovation and Resources Hediyeh-Zadeh, Soroor Webb, Andrew I. Davis, Melissa J. MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry |
title | MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry |
title_full | MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry |
title_fullStr | MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry |
title_full_unstemmed | MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry |
title_short | MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry |
title_sort | msimpute: estimation of missing peptide intensity data in label-free quantitative mass spectrometry |
topic | Technological Innovation and Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368900/ https://www.ncbi.nlm.nih.gov/pubmed/37105364 http://dx.doi.org/10.1016/j.mcpro.2023.100558 |
work_keys_str_mv | AT hediyehzadehsoroor msimputeestimationofmissingpeptideintensitydatainlabelfreequantitativemassspectrometry AT webbandrewi msimputeestimationofmissingpeptideintensitydatainlabelfreequantitativemassspectrometry AT davismelissaj msimputeestimationofmissingpeptideintensitydatainlabelfreequantitativemassspectrometry |