Cargando…

Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics

Analysis of differential abundance in proteomics data sets requires careful application of missing value imputation. Missing abundance values widely vary when performing comparisons across different sample treatments. For example, one would expect a consistent rate of “missing at random” (MAR) acros...

Descripción completa

Detalles Bibliográficos
Autores principales: Gardner, Miranda L., Freitas, Michael A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8431783/
https://www.ncbi.nlm.nih.gov/pubmed/34502557
http://dx.doi.org/10.3390/ijms22179650
_version_ 1783751017417932800
author Gardner, Miranda L.
Freitas, Michael A.
author_facet Gardner, Miranda L.
Freitas, Michael A.
author_sort Gardner, Miranda L.
collection PubMed
description Analysis of differential abundance in proteomics data sets requires careful application of missing value imputation. Missing abundance values widely vary when performing comparisons across different sample treatments. For example, one would expect a consistent rate of “missing at random” (MAR) across batches of samples and varying rates of “missing not at random” (MNAR) depending on the inherent difference in sample treatments within the study. The missing value imputation strategy must thus be selected that best accounts for both MAR and MNAR simultaneously. Several important issues must be considered when deciding the appropriate missing value imputation strategy: (1) when it is appropriate to impute data; (2) how to choose a method that reflects the combinatorial manner of MAR and MNAR that occurs in an experiment. This paper provides an evaluation of missing value imputation strategies used in proteomics and presents a case for the use of hybrid left-censored missing value imputation approaches that can handle the MNAR problem common to proteomics data.
format Online
Article
Text
id pubmed-8431783
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84317832021-09-11 Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics Gardner, Miranda L. Freitas, Michael A. Int J Mol Sci Article Analysis of differential abundance in proteomics data sets requires careful application of missing value imputation. Missing abundance values widely vary when performing comparisons across different sample treatments. For example, one would expect a consistent rate of “missing at random” (MAR) across batches of samples and varying rates of “missing not at random” (MNAR) depending on the inherent difference in sample treatments within the study. The missing value imputation strategy must thus be selected that best accounts for both MAR and MNAR simultaneously. Several important issues must be considered when deciding the appropriate missing value imputation strategy: (1) when it is appropriate to impute data; (2) how to choose a method that reflects the combinatorial manner of MAR and MNAR that occurs in an experiment. This paper provides an evaluation of missing value imputation strategies used in proteomics and presents a case for the use of hybrid left-censored missing value imputation approaches that can handle the MNAR problem common to proteomics data. MDPI 2021-09-06 /pmc/articles/PMC8431783/ /pubmed/34502557 http://dx.doi.org/10.3390/ijms22179650 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gardner, Miranda L.
Freitas, Michael A.
Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics
title Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics
title_full Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics
title_fullStr Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics
title_full_unstemmed Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics
title_short Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics
title_sort multiple imputation approaches applied to the missing value problem in bottom-up proteomics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8431783/
https://www.ncbi.nlm.nih.gov/pubmed/34502557
http://dx.doi.org/10.3390/ijms22179650
work_keys_str_mv AT gardnermirandal multipleimputationapproachesappliedtothemissingvalueprobleminbottomupproteomics
AT freitasmichaela multipleimputationapproachesappliedtothemissingvalueprobleminbottomupproteomics