Cargando…

Alignstein: Optimal transport for improved LC-MS retention time alignment

BACKGROUND: Reproducibility of liquid chromatography separation is limited by retention time drift. As a result, measured signals lack correspondence over replicates of the liquid chromatography–mass spectrometry (LC-MS) experiments. Correction of these errors is named retention time alignment and n...

Descripción completa

Detalles Bibliográficos
Autores principales: Skoraczyński, Grzegorz, Gambin, Anna, Miasojedow, Błażej
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9633278/
https://www.ncbi.nlm.nih.gov/pubmed/36329619
http://dx.doi.org/10.1093/gigascience/giac101
_version_ 1784824229140627456
author Skoraczyński, Grzegorz
Gambin, Anna
Miasojedow, Błażej
author_facet Skoraczyński, Grzegorz
Gambin, Anna
Miasojedow, Błażej
author_sort Skoraczyński, Grzegorz
collection PubMed
description BACKGROUND: Reproducibility of liquid chromatography separation is limited by retention time drift. As a result, measured signals lack correspondence over replicates of the liquid chromatography–mass spectrometry (LC-MS) experiments. Correction of these errors is named retention time alignment and needs to be performed before further quantitative analysis. Despite the availability of numerous alignment algorithms, their accuracy is limited (e.g., for retention time drift that swaps analytes’ elution order). RESULTS: We present the Alignstein, an algorithm for LC-MS retention time alignment. It correctly finds correspondence even for swapped signals. To achieve this, we implemented the generalization of the Wasserstein distance to compare multidimensional features without any reduction of the information or dimension of the analyzed data. Moreover, Alignstein by design requires neither a reference sample nor prior signal identification. We validate the algorithm on publicly available benchmark datasets obtaining competitive results. Finally, we show that it can detect the information contained in the tandem mass spectrum by the spatial properties of chromatograms. CONCLUSIONS: We show that the use of optimal transport effectively overcomes the limitations of existing algorithms for statistical analysis of mass spectrometry datasets. The algorithm’s source code is available at https://github.com/grzsko/Alignstein.
format Online
Article
Text
id pubmed-9633278
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96332782022-11-04 Alignstein: Optimal transport for improved LC-MS retention time alignment Skoraczyński, Grzegorz Gambin, Anna Miasojedow, Błażej Gigascience Technical Note BACKGROUND: Reproducibility of liquid chromatography separation is limited by retention time drift. As a result, measured signals lack correspondence over replicates of the liquid chromatography–mass spectrometry (LC-MS) experiments. Correction of these errors is named retention time alignment and needs to be performed before further quantitative analysis. Despite the availability of numerous alignment algorithms, their accuracy is limited (e.g., for retention time drift that swaps analytes’ elution order). RESULTS: We present the Alignstein, an algorithm for LC-MS retention time alignment. It correctly finds correspondence even for swapped signals. To achieve this, we implemented the generalization of the Wasserstein distance to compare multidimensional features without any reduction of the information or dimension of the analyzed data. Moreover, Alignstein by design requires neither a reference sample nor prior signal identification. We validate the algorithm on publicly available benchmark datasets obtaining competitive results. Finally, we show that it can detect the information contained in the tandem mass spectrum by the spatial properties of chromatograms. CONCLUSIONS: We show that the use of optimal transport effectively overcomes the limitations of existing algorithms for statistical analysis of mass spectrometry datasets. The algorithm’s source code is available at https://github.com/grzsko/Alignstein. Oxford University Press 2022-11-03 /pmc/articles/PMC9633278/ /pubmed/36329619 http://dx.doi.org/10.1093/gigascience/giac101 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Skoraczyński, Grzegorz
Gambin, Anna
Miasojedow, Błażej
Alignstein: Optimal transport for improved LC-MS retention time alignment
title Alignstein: Optimal transport for improved LC-MS retention time alignment
title_full Alignstein: Optimal transport for improved LC-MS retention time alignment
title_fullStr Alignstein: Optimal transport for improved LC-MS retention time alignment
title_full_unstemmed Alignstein: Optimal transport for improved LC-MS retention time alignment
title_short Alignstein: Optimal transport for improved LC-MS retention time alignment
title_sort alignstein: optimal transport for improved lc-ms retention time alignment
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9633278/
https://www.ncbi.nlm.nih.gov/pubmed/36329619
http://dx.doi.org/10.1093/gigascience/giac101
work_keys_str_mv AT skoraczynskigrzegorz alignsteinoptimaltransportforimprovedlcmsretentiontimealignment
AT gambinanna alignsteinoptimaltransportforimprovedlcmsretentiontimealignment
AT miasojedowbłazej alignsteinoptimaltransportforimprovedlcmsretentiontimealignment