Cargando…

Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics

Imputing missing values is common practice in label-free quantitative proteomics. Imputation aims at replacing a missing value with a user-defined one. However, the imputation itself may not be optimally considered downstream of the imputation process, as imputed datasets are often considered as if...

Descripción completa

Detalles Bibliográficos
Autores principales: Chion, Marie, Carapito, Christine, Bertrand, Frédéric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9462777/
https://www.ncbi.nlm.nih.gov/pubmed/36037245
http://dx.doi.org/10.1371/journal.pcbi.1010420
_version_ 1784787265035173888
author Chion, Marie
Carapito, Christine
Bertrand, Frédéric
author_facet Chion, Marie
Carapito, Christine
Bertrand, Frédéric
author_sort Chion, Marie
collection PubMed
description Imputing missing values is common practice in label-free quantitative proteomics. Imputation aims at replacing a missing value with a user-defined one. However, the imputation itself may not be optimally considered downstream of the imputation process, as imputed datasets are often considered as if they had always been complete. Hence, the uncertainty due to the imputation is not adequately taken into account. We provide a rigorous multiple imputation strategy, leading to a less biased estimation of the parameters’ variability thanks to Rubin’s rules. The imputation-based peptide’s intensities’ variance estimator is then moderated using Bayesian hierarchical models. This estimator is finally included in moderated t-test statistics to provide differential analyses results. This workflow can be used both at peptide and protein-level in quantification datasets. Indeed, an aggregation step is included for protein-level results based on peptide-level quantification data. Our methodology, named mi4p, was compared to the state-of-the-art limma workflow implemented in the DAPAR R package, both on simulated and real datasets. We observed a trade-off between sensitivity and specificity, while the overall performance of mi4p outperforms DAPAR in terms of F-Score.
format Online
Article
Text
id pubmed-9462777
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-94627772022-09-10 Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics Chion, Marie Carapito, Christine Bertrand, Frédéric PLoS Comput Biol Research Article Imputing missing values is common practice in label-free quantitative proteomics. Imputation aims at replacing a missing value with a user-defined one. However, the imputation itself may not be optimally considered downstream of the imputation process, as imputed datasets are often considered as if they had always been complete. Hence, the uncertainty due to the imputation is not adequately taken into account. We provide a rigorous multiple imputation strategy, leading to a less biased estimation of the parameters’ variability thanks to Rubin’s rules. The imputation-based peptide’s intensities’ variance estimator is then moderated using Bayesian hierarchical models. This estimator is finally included in moderated t-test statistics to provide differential analyses results. This workflow can be used both at peptide and protein-level in quantification datasets. Indeed, an aggregation step is included for protein-level results based on peptide-level quantification data. Our methodology, named mi4p, was compared to the state-of-the-art limma workflow implemented in the DAPAR R package, both on simulated and real datasets. We observed a trade-off between sensitivity and specificity, while the overall performance of mi4p outperforms DAPAR in terms of F-Score. Public Library of Science 2022-08-29 /pmc/articles/PMC9462777/ /pubmed/36037245 http://dx.doi.org/10.1371/journal.pcbi.1010420 Text en © 2022 Chion et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chion, Marie
Carapito, Christine
Bertrand, Frédéric
Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics
title Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics
title_full Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics
title_fullStr Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics
title_full_unstemmed Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics
title_short Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics
title_sort accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9462777/
https://www.ncbi.nlm.nih.gov/pubmed/36037245
http://dx.doi.org/10.1371/journal.pcbi.1010420
work_keys_str_mv AT chionmarie accountingformultipleimputationinducedvariabilityfordifferentialanalysisinmassspectrometrybasedlabelfreequantitativeproteomics
AT carapitochristine accountingformultipleimputationinducedvariabilityfordifferentialanalysisinmassspectrometrybasedlabelfreequantitativeproteomics
AT bertrandfrederic accountingformultipleimputationinducedvariabilityfordifferentialanalysisinmassspectrometrybasedlabelfreequantitativeproteomics