Cargando…

Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics

Liquid chromatography coupled to mass spectrometry (LCMS) is widely used in metabolomics due to its sensitivity, reproducibility, speed and versatility. Metabolites are detected as peaks which are characterised by mass-over-charge ratio (m/z) and retention time (rt), and one of the most critical but...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Mingshu, Fraser, Karl, Huege, Jan, Featonby, Tom, Rasmussen, Susanne, Jones, Chris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4419193/
https://www.ncbi.nlm.nih.gov/pubmed/25972771
http://dx.doi.org/10.1007/s11306-014-0727-x
_version_ 1782369539933077504
author Cao, Mingshu
Fraser, Karl
Huege, Jan
Featonby, Tom
Rasmussen, Susanne
Jones, Chris
author_facet Cao, Mingshu
Fraser, Karl
Huege, Jan
Featonby, Tom
Rasmussen, Susanne
Jones, Chris
author_sort Cao, Mingshu
collection PubMed
description Liquid chromatography coupled to mass spectrometry (LCMS) is widely used in metabolomics due to its sensitivity, reproducibility, speed and versatility. Metabolites are detected as peaks which are characterised by mass-over-charge ratio (m/z) and retention time (rt), and one of the most critical but also the most challenging tasks in metabolomics is to annotate the large number of peaks detected in biological samples. Accurate m/z measurements enable the prediction of molecular formulae which provide clues to the chemical identity of peaks, but often a number of metabolites have identical molecular formulae. Chromatographic behaviour, reflecting the physicochemical properties of metabolites, should also provide structural information. However, the variation in rt between analytical runs, and the complicating factors underlying the observed time shifts, make the use of such information for peak annotation a non-trivial task. To this end, we conducted Quantitative Structure–Retention Relationship (QSRR) modelling between the calculated molecular descriptors (MDs) and the experimental retention times (rts) of 93 authentic compounds analysed using hydrophilic interaction liquid chromatography (HILIC) coupled to high resolution MS. A predictive QSRR model based on Random Forests algorithm outperformed a Multiple Linear Regression based model, and achieved a high correlation between predicted rts and experimental rts (Pearson’s correlation coefficient = 0.97), with mean and median absolute error of 0.52 min and 0.34 min (corresponding to 5.1 and 3.2 % error), respectively. We demonstrate that rt prediction with the precision achieved enables the systematic utilisation of rts for annotating unknown peaks detected in a metabolomics study. The application of the QSRR model with the strategy we outlined enhanced the peak annotation process by reducing the number of false positives resulting from database queries by matching accurate mass alone, and enriching the reference library. The predicted rts were validated using either authentic compounds or ion fragmentation patterns. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-014-0727-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4419193
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-44191932015-05-11 Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics Cao, Mingshu Fraser, Karl Huege, Jan Featonby, Tom Rasmussen, Susanne Jones, Chris Metabolomics Original Article Liquid chromatography coupled to mass spectrometry (LCMS) is widely used in metabolomics due to its sensitivity, reproducibility, speed and versatility. Metabolites are detected as peaks which are characterised by mass-over-charge ratio (m/z) and retention time (rt), and one of the most critical but also the most challenging tasks in metabolomics is to annotate the large number of peaks detected in biological samples. Accurate m/z measurements enable the prediction of molecular formulae which provide clues to the chemical identity of peaks, but often a number of metabolites have identical molecular formulae. Chromatographic behaviour, reflecting the physicochemical properties of metabolites, should also provide structural information. However, the variation in rt between analytical runs, and the complicating factors underlying the observed time shifts, make the use of such information for peak annotation a non-trivial task. To this end, we conducted Quantitative Structure–Retention Relationship (QSRR) modelling between the calculated molecular descriptors (MDs) and the experimental retention times (rts) of 93 authentic compounds analysed using hydrophilic interaction liquid chromatography (HILIC) coupled to high resolution MS. A predictive QSRR model based on Random Forests algorithm outperformed a Multiple Linear Regression based model, and achieved a high correlation between predicted rts and experimental rts (Pearson’s correlation coefficient = 0.97), with mean and median absolute error of 0.52 min and 0.34 min (corresponding to 5.1 and 3.2 % error), respectively. We demonstrate that rt prediction with the precision achieved enables the systematic utilisation of rts for annotating unknown peaks detected in a metabolomics study. The application of the QSRR model with the strategy we outlined enhanced the peak annotation process by reducing the number of false positives resulting from database queries by matching accurate mass alone, and enriching the reference library. The predicted rts were validated using either authentic compounds or ion fragmentation patterns. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-014-0727-x) contains supplementary material, which is available to authorized users. Springer US 2014-09-07 2015 /pmc/articles/PMC4419193/ /pubmed/25972771 http://dx.doi.org/10.1007/s11306-014-0727-x Text en © The Author(s) 2014 https://creativecommons.org/licenses/by/4.0/ Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
spellingShingle Original Article
Cao, Mingshu
Fraser, Karl
Huege, Jan
Featonby, Tom
Rasmussen, Susanne
Jones, Chris
Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics
title Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics
title_full Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics
title_fullStr Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics
title_full_unstemmed Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics
title_short Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics
title_sort predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4419193/
https://www.ncbi.nlm.nih.gov/pubmed/25972771
http://dx.doi.org/10.1007/s11306-014-0727-x
work_keys_str_mv AT caomingshu predictingretentiontimeinhydrophilicinteractionliquidchromatographymassspectrometryanditsuseforpeakannotationinmetabolomics
AT fraserkarl predictingretentiontimeinhydrophilicinteractionliquidchromatographymassspectrometryanditsuseforpeakannotationinmetabolomics
AT huegejan predictingretentiontimeinhydrophilicinteractionliquidchromatographymassspectrometryanditsuseforpeakannotationinmetabolomics
AT featonbytom predictingretentiontimeinhydrophilicinteractionliquidchromatographymassspectrometryanditsuseforpeakannotationinmetabolomics
AT rasmussensusanne predictingretentiontimeinhydrophilicinteractionliquidchromatographymassspectrometryanditsuseforpeakannotationinmetabolomics
AT joneschris predictingretentiontimeinhydrophilicinteractionliquidchromatographymassspectrometryanditsuseforpeakannotationinmetabolomics