Cargando…
G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics
BACKGROUND: Liquid chromatography–mass spectrometry is widely used in untargeted metabolomics for composition profiling. In multi-run analysis scenarios, features of each run are aligned into consensus features by feature alignment algorithms to observe the intensity variations across runs. However,...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10644574/ https://www.ncbi.nlm.nih.gov/pubmed/37964228 http://dx.doi.org/10.1186/s12859-023-05525-4 |
_version_ | 1785147257526419456 |
---|---|
author | Wang, Ruimin Lu, Miaoshan An, Shaowei Wang, Jinyin Yu, Changbin |
author_facet | Wang, Ruimin Lu, Miaoshan An, Shaowei Wang, Jinyin Yu, Changbin |
author_sort | Wang, Ruimin |
collection | PubMed |
description | BACKGROUND: Liquid chromatography–mass spectrometry is widely used in untargeted metabolomics for composition profiling. In multi-run analysis scenarios, features of each run are aligned into consensus features by feature alignment algorithms to observe the intensity variations across runs. However, most of the existing feature alignment methods focus more on accurate retention time correction, while underestimating the importance of feature matching. None of the existing methods can comprehensively consider feature correspondences among all runs and achieve optimal matching. RESULTS: To comprehensively analyze feature correspondences among runs, we propose G-Aligner, a graph-based feature alignment method for untargeted LC–MS data. In the feature matching stage, G-Aligner treats features and potential correspondences as nodes and edges in a multipartite graph, considers the multi-run feature matching problem an unbalanced multidimensional assignment problem, and provides three combinatorial optimization algorithms to find optimal matching solutions. In comparison with the feature alignment methods in OpenMS, MZmine2 and XCMS on three public metabolomics benchmark datasets, G-Aligner achieved the best feature alignment performance on all the three datasets with up to 9.8% and 26.6% increase in accurately aligned features and analytes, and helped all comparison software obtain more accurate results on their self-extracted features by integrating G-Aligner to their analysis workflow. G-Aligner is open-source and freely available at https://github.com/CSi-Studio/G-Aligner under a permissive license. Benchmark datasets, manual annotation results, evaluation methods and results are available at https://doi.org/10.5281/zenodo.8313034 CONCLUSIONS: In this study, we proposed G-Aligner to improve feature matching accuracy for untargeted metabolomics LC–MS data. G-Aligner comprehensively considered potential feature correspondences between all runs, converting the feature matching problem as a multidimensional assignment problem (MAP). In evaluations on three public metabolomics benchmark datasets, G-Aligner achieved the highest alignment accuracy on manual annotated and popular software extracted features, proving the effectiveness and robustness of the algorithm. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05525-4. |
format | Online Article Text |
id | pubmed-10644574 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-106445742023-11-14 G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics Wang, Ruimin Lu, Miaoshan An, Shaowei Wang, Jinyin Yu, Changbin BMC Bioinformatics Research BACKGROUND: Liquid chromatography–mass spectrometry is widely used in untargeted metabolomics for composition profiling. In multi-run analysis scenarios, features of each run are aligned into consensus features by feature alignment algorithms to observe the intensity variations across runs. However, most of the existing feature alignment methods focus more on accurate retention time correction, while underestimating the importance of feature matching. None of the existing methods can comprehensively consider feature correspondences among all runs and achieve optimal matching. RESULTS: To comprehensively analyze feature correspondences among runs, we propose G-Aligner, a graph-based feature alignment method for untargeted LC–MS data. In the feature matching stage, G-Aligner treats features and potential correspondences as nodes and edges in a multipartite graph, considers the multi-run feature matching problem an unbalanced multidimensional assignment problem, and provides three combinatorial optimization algorithms to find optimal matching solutions. In comparison with the feature alignment methods in OpenMS, MZmine2 and XCMS on three public metabolomics benchmark datasets, G-Aligner achieved the best feature alignment performance on all the three datasets with up to 9.8% and 26.6% increase in accurately aligned features and analytes, and helped all comparison software obtain more accurate results on their self-extracted features by integrating G-Aligner to their analysis workflow. G-Aligner is open-source and freely available at https://github.com/CSi-Studio/G-Aligner under a permissive license. Benchmark datasets, manual annotation results, evaluation methods and results are available at https://doi.org/10.5281/zenodo.8313034 CONCLUSIONS: In this study, we proposed G-Aligner to improve feature matching accuracy for untargeted metabolomics LC–MS data. G-Aligner comprehensively considered potential feature correspondences between all runs, converting the feature matching problem as a multidimensional assignment problem (MAP). In evaluations on three public metabolomics benchmark datasets, G-Aligner achieved the highest alignment accuracy on manual annotated and popular software extracted features, proving the effectiveness and robustness of the algorithm. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05525-4. BioMed Central 2023-11-14 /pmc/articles/PMC10644574/ /pubmed/37964228 http://dx.doi.org/10.1186/s12859-023-05525-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Wang, Ruimin Lu, Miaoshan An, Shaowei Wang, Jinyin Yu, Changbin G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics |
title | G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics |
title_full | G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics |
title_fullStr | G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics |
title_full_unstemmed | G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics |
title_short | G-Aligner: a graph-based feature alignment method for untargeted LC–MS-based metabolomics |
title_sort | g-aligner: a graph-based feature alignment method for untargeted lc–ms-based metabolomics |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10644574/ https://www.ncbi.nlm.nih.gov/pubmed/37964228 http://dx.doi.org/10.1186/s12859-023-05525-4 |
work_keys_str_mv | AT wangruimin galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics AT lumiaoshan galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics AT anshaowei galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics AT wangjinyin galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics AT yuchangbin galigneragraphbasedfeaturealignmentmethodforuntargetedlcmsbasedmetabolomics |