Cargando…

NAguideR: performing and prioritizing missing value imputations for consistent bottom-up proteomic analyses

Mass spectrometry (MS)-based quantitative proteomics experiments frequently generate data with missing values, which may profoundly affect downstream analyses. A wide variety of imputation methods have been established to deal with the missing-value issue. To date, however, there is a scarcity of ef...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Shisheng, Li, Wenxue, Hu, Liqiang, Cheng, Jingqiu, Yang, Hao, Liu, Yansheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7641313/
https://www.ncbi.nlm.nih.gov/pubmed/32526036
http://dx.doi.org/10.1093/nar/gkaa498
_version_ 1783605890672230400
author Wang, Shisheng
Li, Wenxue
Hu, Liqiang
Cheng, Jingqiu
Yang, Hao
Liu, Yansheng
author_facet Wang, Shisheng
Li, Wenxue
Hu, Liqiang
Cheng, Jingqiu
Yang, Hao
Liu, Yansheng
author_sort Wang, Shisheng
collection PubMed
description Mass spectrometry (MS)-based quantitative proteomics experiments frequently generate data with missing values, which may profoundly affect downstream analyses. A wide variety of imputation methods have been established to deal with the missing-value issue. To date, however, there is a scarcity of efficient, systematic, and easy-to-handle tools that are tailored for proteomics community. Herein, we developed a user-friendly and powerful stand-alone software, NAguideR, to enable implementation and evaluation of different missing value methods offered by 23 widely used missing-value imputation algorithms. NAguideR further evaluates data imputation results through classic computational criteria and, unprecedentedly, proteomic empirical criteria, such as quantitative consistency between different charge-states of the same peptide, different peptides belonging to the same proteins, and individual proteins participating protein complexes and functional interactions. We applied NAguideR into three label-free proteomic datasets featuring peptide-level, protein-level, and phosphoproteomic variables respectively, all generated by data independent acquisition mass spectrometry (DIA-MS) with substantial biological replicates. The results indicate that NAguideR is able to discriminate the optimal imputation methods that are facilitating DIA-MS experiments over those sub-optimal and low-performance algorithms. NAguideR further provides downloadable tables and figures supporting flexible data analysis and interpretation. NAguideR is freely available at http://www.omicsolution.org/wukong/NAguideR/ and the source code: https://github.com/wangshisheng/NAguideR/.
format Online
Article
Text
id pubmed-7641313
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-76413132020-11-10 NAguideR: performing and prioritizing missing value imputations for consistent bottom-up proteomic analyses Wang, Shisheng Li, Wenxue Hu, Liqiang Cheng, Jingqiu Yang, Hao Liu, Yansheng Nucleic Acids Res Methods Online Mass spectrometry (MS)-based quantitative proteomics experiments frequently generate data with missing values, which may profoundly affect downstream analyses. A wide variety of imputation methods have been established to deal with the missing-value issue. To date, however, there is a scarcity of efficient, systematic, and easy-to-handle tools that are tailored for proteomics community. Herein, we developed a user-friendly and powerful stand-alone software, NAguideR, to enable implementation and evaluation of different missing value methods offered by 23 widely used missing-value imputation algorithms. NAguideR further evaluates data imputation results through classic computational criteria and, unprecedentedly, proteomic empirical criteria, such as quantitative consistency between different charge-states of the same peptide, different peptides belonging to the same proteins, and individual proteins participating protein complexes and functional interactions. We applied NAguideR into three label-free proteomic datasets featuring peptide-level, protein-level, and phosphoproteomic variables respectively, all generated by data independent acquisition mass spectrometry (DIA-MS) with substantial biological replicates. The results indicate that NAguideR is able to discriminate the optimal imputation methods that are facilitating DIA-MS experiments over those sub-optimal and low-performance algorithms. NAguideR further provides downloadable tables and figures supporting flexible data analysis and interpretation. NAguideR is freely available at http://www.omicsolution.org/wukong/NAguideR/ and the source code: https://github.com/wangshisheng/NAguideR/. Oxford University Press 2020-06-11 /pmc/articles/PMC7641313/ /pubmed/32526036 http://dx.doi.org/10.1093/nar/gkaa498 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Wang, Shisheng
Li, Wenxue
Hu, Liqiang
Cheng, Jingqiu
Yang, Hao
Liu, Yansheng
NAguideR: performing and prioritizing missing value imputations for consistent bottom-up proteomic analyses
title NAguideR: performing and prioritizing missing value imputations for consistent bottom-up proteomic analyses
title_full NAguideR: performing and prioritizing missing value imputations for consistent bottom-up proteomic analyses
title_fullStr NAguideR: performing and prioritizing missing value imputations for consistent bottom-up proteomic analyses
title_full_unstemmed NAguideR: performing and prioritizing missing value imputations for consistent bottom-up proteomic analyses
title_short NAguideR: performing and prioritizing missing value imputations for consistent bottom-up proteomic analyses
title_sort naguider: performing and prioritizing missing value imputations for consistent bottom-up proteomic analyses
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7641313/
https://www.ncbi.nlm.nih.gov/pubmed/32526036
http://dx.doi.org/10.1093/nar/gkaa498
work_keys_str_mv AT wangshisheng naguiderperformingandprioritizingmissingvalueimputationsforconsistentbottomupproteomicanalyses
AT liwenxue naguiderperformingandprioritizingmissingvalueimputationsforconsistentbottomupproteomicanalyses
AT huliqiang naguiderperformingandprioritizingmissingvalueimputationsforconsistentbottomupproteomicanalyses
AT chengjingqiu naguiderperformingandprioritizingmissingvalueimputationsforconsistentbottomupproteomicanalyses
AT yanghao naguiderperformingandprioritizingmissingvalueimputationsforconsistentbottomupproteomicanalyses
AT liuyansheng naguiderperformingandprioritizingmissingvalueimputationsforconsistentbottomupproteomicanalyses