Cargando…
Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment
DIA is a mainstream method for quantitative proteomics, but consistent quantification across multiple LC-MS/MS instruments remains a bottleneck in parallelizing data acquisition. One reason for this inconsistency and missing quantification is the retention time shift which current software does not...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10616189/ https://www.ncbi.nlm.nih.gov/pubmed/37903988 http://dx.doi.org/10.1038/s42003-023-05437-2 |
_version_ | 1785129338320977920 |
---|---|
author | Gupta, Shubham Sing, Justin C. Röst, Hannes L. |
author_facet | Gupta, Shubham Sing, Justin C. Röst, Hannes L. |
author_sort | Gupta, Shubham |
collection | PubMed |
description | DIA is a mainstream method for quantitative proteomics, but consistent quantification across multiple LC-MS/MS instruments remains a bottleneck in parallelizing data acquisition. One reason for this inconsistency and missing quantification is the retention time shift which current software does not adequately address for runs from multiple sites. We present multirun chromatogram alignment strategies to map peaks across columns, including the traditional reference-based Star method, and two novel approaches: MST and Progressive alignment. These reference-free strategies produce a quantitatively accurate data-matrix, even from heterogeneous multi-column studies. Progressive alignment also generates merged chromatograms from all runs which has not been previously achieved for LC-MS/MS data. First, we demonstrate the effectiveness of multirun alignment strategies on a gold-standard annotated dataset, resulting in a threefold reduction in quantitation error-rate compared to non-aligned DIA results. Subsequently, on a multi-species dataset that DIAlignR effectively controls the quantitative error rate, improves precision in protein measurements, and exhibits conservative peak alignment. We next show that the MST alignment reduces cross-site CV by 50% for highly abundant proteins when applied to a dataset from 11 different LC-MS/MS setups. Finally, the reanalysis of 949 plasma runs with multirun alignment revealed a more than 50% increase in insulin resistance (IR) and respiratory viral infection (RVI) proteins, identifying 11 and 13 proteins respectively, compared to prior analysis without it. The three strategies are implemented in our DIAlignR workflow (>2.3) and can be combined with linear, non-linear, or hybrid pairwise alignment. |
format | Online Article Text |
id | pubmed-10616189 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-106161892023-11-01 Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment Gupta, Shubham Sing, Justin C. Röst, Hannes L. Commun Biol Article DIA is a mainstream method for quantitative proteomics, but consistent quantification across multiple LC-MS/MS instruments remains a bottleneck in parallelizing data acquisition. One reason for this inconsistency and missing quantification is the retention time shift which current software does not adequately address for runs from multiple sites. We present multirun chromatogram alignment strategies to map peaks across columns, including the traditional reference-based Star method, and two novel approaches: MST and Progressive alignment. These reference-free strategies produce a quantitatively accurate data-matrix, even from heterogeneous multi-column studies. Progressive alignment also generates merged chromatograms from all runs which has not been previously achieved for LC-MS/MS data. First, we demonstrate the effectiveness of multirun alignment strategies on a gold-standard annotated dataset, resulting in a threefold reduction in quantitation error-rate compared to non-aligned DIA results. Subsequently, on a multi-species dataset that DIAlignR effectively controls the quantitative error rate, improves precision in protein measurements, and exhibits conservative peak alignment. We next show that the MST alignment reduces cross-site CV by 50% for highly abundant proteins when applied to a dataset from 11 different LC-MS/MS setups. Finally, the reanalysis of 949 plasma runs with multirun alignment revealed a more than 50% increase in insulin resistance (IR) and respiratory viral infection (RVI) proteins, identifying 11 and 13 proteins respectively, compared to prior analysis without it. The three strategies are implemented in our DIAlignR workflow (>2.3) and can be combined with linear, non-linear, or hybrid pairwise alignment. Nature Publishing Group UK 2023-10-30 /pmc/articles/PMC10616189/ /pubmed/37903988 http://dx.doi.org/10.1038/s42003-023-05437-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Gupta, Shubham Sing, Justin C. Röst, Hannes L. Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment |
title | Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment |
title_full | Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment |
title_fullStr | Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment |
title_full_unstemmed | Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment |
title_short | Achieving quantitative reproducibility in label-free multisite DIA experiments through multirun alignment |
title_sort | achieving quantitative reproducibility in label-free multisite dia experiments through multirun alignment |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10616189/ https://www.ncbi.nlm.nih.gov/pubmed/37903988 http://dx.doi.org/10.1038/s42003-023-05437-2 |
work_keys_str_mv | AT guptashubham achievingquantitativereproducibilityinlabelfreemultisitediaexperimentsthroughmultirunalignment AT singjustinc achievingquantitativereproducibilityinlabelfreemultisitediaexperimentsthroughmultirunalignment AT rosthannesl achievingquantitativereproducibilityinlabelfreemultisitediaexperimentsthroughmultirunalignment |