Cargando…

Accurate Label-Free Quantification by directLFQ to Compare Unlimited Numbers of Proteomes

Recent advances in mass spectrometry–based proteomics enable the acquisition of increasingly large datasets within relatively short times, which exposes bottlenecks in the bioinformatics pipeline. Although peptide identification is already scalable, most label-free quantification (LFQ) algorithms sc...

Descripción completa

Detalles Bibliográficos
Autores principales: Ammar, Constantin, Schessner, Julia Patricia, Willems, Sander, Michaelis, André C., Mann, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Biochemistry and Molecular Biology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10315922/
https://www.ncbi.nlm.nih.gov/pubmed/37225017
http://dx.doi.org/10.1016/j.mcpro.2023.100581
_version_ 1785067605372960768
author Ammar, Constantin
Schessner, Julia Patricia
Willems, Sander
Michaelis, André C.
Mann, Matthias
author_facet Ammar, Constantin
Schessner, Julia Patricia
Willems, Sander
Michaelis, André C.
Mann, Matthias
author_sort Ammar, Constantin
collection PubMed
description Recent advances in mass spectrometry–based proteomics enable the acquisition of increasingly large datasets within relatively short times, which exposes bottlenecks in the bioinformatics pipeline. Although peptide identification is already scalable, most label-free quantification (LFQ) algorithms scale quadratic or cubic with the sample numbers, which may even preclude the analysis of large-scale data. Here we introduce directLFQ, a ratio-based approach for sample normalization and the calculation of protein intensities. It estimates quantities via aligning samples and ion traces by shifting them on top of each other in logarithmic space. Importantly, directLFQ scales linearly with the number of samples, allowing analyses of large studies to finish in minutes instead of days or months. We quantify 10,000 proteomes in 10 min and 100,000 proteomes in less than 2 h, a 1000-fold faster than some implementations of the popular LFQ algorithm MaxLFQ. In-depth characterization of directLFQ reveals excellent normalization properties and benchmark results, comparing favorably to MaxLFQ for both data-dependent acquisition and data-independent acquisition. In addition, directLFQ provides normalized peptide intensity estimates for peptide-level comparisons. It is an important part of an overall quantitative proteomic pipeline that also needs to include high sensitive statistical analysis leading to proteoform resolution. Available as an open-source Python package and a graphical user interface with a one-click installer, it can be used in the AlphaPept ecosystem as well as downstream of most common computational proteomics pipelines.
format Online
Article
Text
id pubmed-10315922
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-103159222023-07-04 Accurate Label-Free Quantification by directLFQ to Compare Unlimited Numbers of Proteomes Ammar, Constantin Schessner, Julia Patricia Willems, Sander Michaelis, André C. Mann, Matthias Mol Cell Proteomics Technological Innovation and Resources Recent advances in mass spectrometry–based proteomics enable the acquisition of increasingly large datasets within relatively short times, which exposes bottlenecks in the bioinformatics pipeline. Although peptide identification is already scalable, most label-free quantification (LFQ) algorithms scale quadratic or cubic with the sample numbers, which may even preclude the analysis of large-scale data. Here we introduce directLFQ, a ratio-based approach for sample normalization and the calculation of protein intensities. It estimates quantities via aligning samples and ion traces by shifting them on top of each other in logarithmic space. Importantly, directLFQ scales linearly with the number of samples, allowing analyses of large studies to finish in minutes instead of days or months. We quantify 10,000 proteomes in 10 min and 100,000 proteomes in less than 2 h, a 1000-fold faster than some implementations of the popular LFQ algorithm MaxLFQ. In-depth characterization of directLFQ reveals excellent normalization properties and benchmark results, comparing favorably to MaxLFQ for both data-dependent acquisition and data-independent acquisition. In addition, directLFQ provides normalized peptide intensity estimates for peptide-level comparisons. It is an important part of an overall quantitative proteomic pipeline that also needs to include high sensitive statistical analysis leading to proteoform resolution. Available as an open-source Python package and a graphical user interface with a one-click installer, it can be used in the AlphaPept ecosystem as well as downstream of most common computational proteomics pipelines. American Society for Biochemistry and Molecular Biology 2023-05-22 /pmc/articles/PMC10315922/ /pubmed/37225017 http://dx.doi.org/10.1016/j.mcpro.2023.100581 Text en © 2023 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Technological Innovation and Resources
Ammar, Constantin
Schessner, Julia Patricia
Willems, Sander
Michaelis, André C.
Mann, Matthias
Accurate Label-Free Quantification by directLFQ to Compare Unlimited Numbers of Proteomes
title Accurate Label-Free Quantification by directLFQ to Compare Unlimited Numbers of Proteomes
title_full Accurate Label-Free Quantification by directLFQ to Compare Unlimited Numbers of Proteomes
title_fullStr Accurate Label-Free Quantification by directLFQ to Compare Unlimited Numbers of Proteomes
title_full_unstemmed Accurate Label-Free Quantification by directLFQ to Compare Unlimited Numbers of Proteomes
title_short Accurate Label-Free Quantification by directLFQ to Compare Unlimited Numbers of Proteomes
title_sort accurate label-free quantification by directlfq to compare unlimited numbers of proteomes
topic Technological Innovation and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10315922/
https://www.ncbi.nlm.nih.gov/pubmed/37225017
http://dx.doi.org/10.1016/j.mcpro.2023.100581
work_keys_str_mv AT ammarconstantin accuratelabelfreequantificationbydirectlfqtocompareunlimitednumbersofproteomes
AT schessnerjuliapatricia accuratelabelfreequantificationbydirectlfqtocompareunlimitednumbersofproteomes
AT willemssander accuratelabelfreequantificationbydirectlfqtocompareunlimitednumbersofproteomes
AT michaelisandrec accuratelabelfreequantificationbydirectlfqtocompareunlimitednumbersofproteomes
AT mannmatthias accuratelabelfreequantificationbydirectlfqtocompareunlimitednumbersofproteomes