Cargando…

G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics

BACKGROUND: Liquid chromatography–mass spectrometry is widely used in untargeted metabolomics for composition profiling. In multi-run analysis scenarios, features of each run are aligned into consensus features by feature alignment algorithms to observe the intensity variations across runs. However,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Ruimin, Lu, Miaoshan, An, Shaowei, Wang, Jinyin, Yu, Changbin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10644574/
https://www.ncbi.nlm.nih.gov/pubmed/37964228
http://dx.doi.org/10.1186/s12859-023-05525-4
_version_ 1785147257526419456
author Wang, Ruimin
Lu, Miaoshan
An, Shaowei
Wang, Jinyin
Yu, Changbin
author_facet Wang, Ruimin
Lu, Miaoshan
An, Shaowei
Wang, Jinyin
Yu, Changbin
author_sort Wang, Ruimin
collection PubMed
description BACKGROUND: Liquid chromatography–mass spectrometry is widely used in untargeted metabolomics for composition profiling. In multi-run analysis scenarios, features of each run are aligned into consensus features by feature alignment algorithms to observe the intensity variations across runs. However, most of the existing feature alignment methods focus more on accurate retention time correction, while underestimating the importance of feature matching. None of the existing methods can comprehensively consider feature correspondences among all runs and achieve optimal matching. RESULTS: To comprehensively analyze feature correspondences among runs, we propose G-Aligner, a graph-based feature alignment method for untargeted LC–MS data. In the feature matching stage, G-Aligner treats features and potential correspondences as nodes and edges in a multipartite graph, considers the multi-run feature matching problem an unbalanced multidimensional assignment problem, and provides three combinatorial optimization algorithms to find optimal matching solutions. In comparison with the feature alignment methods in OpenMS, MZmine2 and XCMS on three public metabolomics benchmark datasets, G-Aligner achieved the best feature alignment performance on all the three datasets with up to 9.8% and 26.6% increase in accurately aligned features and analytes, and helped all comparison software obtain more accurate results on their self-extracted features by integrating G-Aligner to their analysis workflow. G-Aligner is open-source and freely available at https://github.com/CSi-Studio/G-Aligner under a permissive license. Benchmark datasets, manual annotation results, evaluation methods and results are available at https://doi.org/10.5281/zenodo.8313034 CONCLUSIONS: In this study, we proposed G-Aligner to improve feature matching accuracy for untargeted metabolomics LC–MS data. G-Aligner comprehensively considered potential feature correspondences between all runs, converting the feature matching problem as a multidimensional assignment problem (MAP). In evaluations on three public metabolomics benchmark datasets, G-Aligner achieved the highest alignment accuracy on manual annotated and popular software extracted features, proving the effectiveness and robustness of the algorithm. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05525-4.
format Online
Article
Text
id pubmed-10644574
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-106445742023-11-14 G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics Wang, Ruimin Lu, Miaoshan An, Shaowei Wang, Jinyin Yu, Changbin BMC Bioinformatics Research BACKGROUND: Liquid chromatography–mass spectrometry is widely used in untargeted metabolomics for composition profiling. In multi-run analysis scenarios, features of each run are aligned into consensus features by feature alignment algorithms to observe the intensity variations across runs. However, most of the existing feature alignment methods focus more on accurate retention time correction, while underestimating the importance of feature matching. None of the existing methods can comprehensively consider feature correspondences among all runs and achieve optimal matching. RESULTS: To comprehensively analyze feature correspondences among runs, we propose G-Aligner, a graph-based feature alignment method for untargeted LC–MS data. In the feature matching stage, G-Aligner treats features and potential correspondences as nodes and edges in a multipartite graph, considers the multi-run feature matching problem an unbalanced multidimensional assignment problem, and provides three combinatorial optimization algorithms to find optimal matching solutions. In comparison with the feature alignment methods in OpenMS, MZmine2 and XCMS on three public metabolomics benchmark datasets, G-Aligner achieved the best feature alignment performance on all the three datasets with up to 9.8% and 26.6% increase in accurately aligned features and analytes, and helped all comparison software obtain more accurate results on their self-extracted features by integrating G-Aligner to their analysis workflow. G-Aligner is open-source and freely available at https://github.com/CSi-Studio/G-Aligner under a permissive license. Benchmark datasets, manual annotation results, evaluation methods and results are available at https://doi.org/10.5281/zenodo.8313034 CONCLUSIONS: In this study, we proposed G-Aligner to improve feature matching accuracy for untargeted metabolomics LC–MS data. G-Aligner comprehensively considered potential feature correspondences between all runs, converting the feature matching problem as a multidimensional assignment problem (MAP). In evaluations on three public metabolomics benchmark datasets, G-Aligner achieved the highest alignment accuracy on manual annotated and popular software extracted features, proving the effectiveness and robustness of the algorithm. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05525-4. BioMed Central 2023-11-14 /pmc/articles/PMC10644574/ /pubmed/37964228 http://dx.doi.org/10.1186/s12859-023-05525-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Wang, Ruimin
Lu, Miaoshan
An, Shaowei
Wang, Jinyin
Yu, Changbin
G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics
title G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics
title_full G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics
title_fullStr G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics
title_full_unstemmed G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics
title_short G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics
title_sort g-aligner: a graph-based feature alignment method for untargeted lc–ms-based metabolomics
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10644574/
https://www.ncbi.nlm.nih.gov/pubmed/37964228
http://dx.doi.org/10.1186/s12859-023-05525-4
work_keys_str_mv AT wangruimin galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics
AT lumiaoshan galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics
AT anshaowei galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics
AT wangjinyin galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics
AT yuchangbin galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics