Cargando…

Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography–Mass Spectrometry Metabolomics Datasets

[Image: see text] Integration of multiple datasets can greatly enhance bioanalytical studies, for example, by increasing power to discover and validate biomarkers. In liquid chromatography–mass spectrometry (LC–MS) metabolomics, it is especially hard to combine untargeted datasets since the majority...

Descripción completa

Detalles Bibliográficos
Autores principales: Climaco Pinto, Rui, Karaman, Ibrahim, Lewis, Matthew R., Hällqvist, Jenny, Kaluarachchi, Manuja, Graça, Gonçalo, Chekmeneva, Elena, Durainayagam, Brenan, Ghanbari, Mohsen, Ikram, M. Arfan, Zetterberg, Henrik, Griffin, Julian, Elliott, Paul, Tzoulaki, Ioanna, Dehghan, Abbas, Herrington, David, Ebbels, Timothy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9008693/
https://www.ncbi.nlm.nih.gov/pubmed/35360896
http://dx.doi.org/10.1021/acs.analchem.1c03592
_version_ 1784687110263930880
author Climaco Pinto, Rui
Karaman, Ibrahim
Lewis, Matthew R.
Hällqvist, Jenny
Kaluarachchi, Manuja
Graça, Gonçalo
Chekmeneva, Elena
Durainayagam, Brenan
Ghanbari, Mohsen
Ikram, M. Arfan
Zetterberg, Henrik
Griffin, Julian
Elliott, Paul
Tzoulaki, Ioanna
Dehghan, Abbas
Herrington, David
Ebbels, Timothy
author_facet Climaco Pinto, Rui
Karaman, Ibrahim
Lewis, Matthew R.
Hällqvist, Jenny
Kaluarachchi, Manuja
Graça, Gonçalo
Chekmeneva, Elena
Durainayagam, Brenan
Ghanbari, Mohsen
Ikram, M. Arfan
Zetterberg, Henrik
Griffin, Julian
Elliott, Paul
Tzoulaki, Ioanna
Dehghan, Abbas
Herrington, David
Ebbels, Timothy
author_sort Climaco Pinto, Rui
collection PubMed
description [Image: see text] Integration of multiple datasets can greatly enhance bioanalytical studies, for example, by increasing power to discover and validate biomarkers. In liquid chromatography–mass spectrometry (LC–MS) metabolomics, it is especially hard to combine untargeted datasets since the majority of metabolomic features are not annotated and thus cannot be matched by chemical identity. Typically, the information available for each feature is retention time (RT), mass-to-charge ratio (m/z), and feature intensity (FI). Pairs of features from the same metabolite in separate datasets can exhibit small but significant differences, making matching very challenging. Current methods to address this issue are too simple or rely on assumptions that cannot be met in all cases. We present a method to find feature correspondence between two similar LC–MS metabolomics experiments or batches using only the features’ RT, m/z, and FI. We demonstrate the method on both real and synthetic datasets, using six orthogonal validation strategies to gauge the matching quality. In our main example, 4953 features were uniquely matched, of which 585 (96.8%) of 604 manually annotated features were correct. In a second example, 2324 features could be uniquely matched, with 79 (90.8%) out of 87 annotated features correctly matched. Most of the missed annotated matches are between features that behave very differently from modeled inter-dataset shifts of RT, MZ, and FI. In a third example with simulated data with 4755 features per dataset, 99.6% of the matches were correct. Finally, the results of matching three other dataset pairs using our method are compared with a published alternative method, metabCombiner, showing the advantages of our approach. The method can be applied using M2S (Match 2 Sets), a free, open-source MATLAB toolbox, available at https://github.com/rjdossan/M2S.
format Online
Article
Text
id pubmed-9008693
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-90086932022-04-14 Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography–Mass Spectrometry Metabolomics Datasets Climaco Pinto, Rui Karaman, Ibrahim Lewis, Matthew R. Hällqvist, Jenny Kaluarachchi, Manuja Graça, Gonçalo Chekmeneva, Elena Durainayagam, Brenan Ghanbari, Mohsen Ikram, M. Arfan Zetterberg, Henrik Griffin, Julian Elliott, Paul Tzoulaki, Ioanna Dehghan, Abbas Herrington, David Ebbels, Timothy Anal Chem [Image: see text] Integration of multiple datasets can greatly enhance bioanalytical studies, for example, by increasing power to discover and validate biomarkers. In liquid chromatography–mass spectrometry (LC–MS) metabolomics, it is especially hard to combine untargeted datasets since the majority of metabolomic features are not annotated and thus cannot be matched by chemical identity. Typically, the information available for each feature is retention time (RT), mass-to-charge ratio (m/z), and feature intensity (FI). Pairs of features from the same metabolite in separate datasets can exhibit small but significant differences, making matching very challenging. Current methods to address this issue are too simple or rely on assumptions that cannot be met in all cases. We present a method to find feature correspondence between two similar LC–MS metabolomics experiments or batches using only the features’ RT, m/z, and FI. We demonstrate the method on both real and synthetic datasets, using six orthogonal validation strategies to gauge the matching quality. In our main example, 4953 features were uniquely matched, of which 585 (96.8%) of 604 manually annotated features were correct. In a second example, 2324 features could be uniquely matched, with 79 (90.8%) out of 87 annotated features correctly matched. Most of the missed annotated matches are between features that behave very differently from modeled inter-dataset shifts of RT, MZ, and FI. In a third example with simulated data with 4755 features per dataset, 99.6% of the matches were correct. Finally, the results of matching three other dataset pairs using our method are compared with a published alternative method, metabCombiner, showing the advantages of our approach. The method can be applied using M2S (Match 2 Sets), a free, open-source MATLAB toolbox, available at https://github.com/rjdossan/M2S. American Chemical Society 2022-03-31 2022-04-12 /pmc/articles/PMC9008693/ /pubmed/35360896 http://dx.doi.org/10.1021/acs.analchem.1c03592 Text en © 2022 American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Climaco Pinto, Rui
Karaman, Ibrahim
Lewis, Matthew R.
Hällqvist, Jenny
Kaluarachchi, Manuja
Graça, Gonçalo
Chekmeneva, Elena
Durainayagam, Brenan
Ghanbari, Mohsen
Ikram, M. Arfan
Zetterberg, Henrik
Griffin, Julian
Elliott, Paul
Tzoulaki, Ioanna
Dehghan, Abbas
Herrington, David
Ebbels, Timothy
Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography–Mass Spectrometry Metabolomics Datasets
title Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography–Mass Spectrometry Metabolomics Datasets
title_full Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography–Mass Spectrometry Metabolomics Datasets
title_fullStr Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography–Mass Spectrometry Metabolomics Datasets
title_full_unstemmed Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography–Mass Spectrometry Metabolomics Datasets
title_short Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography–Mass Spectrometry Metabolomics Datasets
title_sort finding correspondence between metabolomic features in untargeted liquid chromatography–mass spectrometry metabolomics datasets
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9008693/
https://www.ncbi.nlm.nih.gov/pubmed/35360896
http://dx.doi.org/10.1021/acs.analchem.1c03592
work_keys_str_mv AT climacopintorui findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT karamanibrahim findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT lewismatthewr findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT hallqvistjenny findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT kaluarachchimanuja findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT gracagoncalo findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT chekmenevaelena findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT durainayagambrenan findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT ghanbarimohsen findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT ikrammarfan findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT zetterberghenrik findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT griffinjulian findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT elliottpaul findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT tzoulakiioanna findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT dehghanabbas findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT herringtondavid findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets
AT ebbelstimothy findingcorrespondencebetweenmetabolomicfeaturesinuntargetedliquidchromatographymassspectrometrymetabolomicsdatasets