Cargando…

Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'

BACKGROUND: Metabolomics experiments using Mass Spectrometry (MS) technology measure the mass to charge ratio (m/z) and intensity of ionised molecules in crude extracts of complex biological samples to generate high dimensional metabolite 'fingerprint' or metabolite 'profile' dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Draper, John, Enot, David P, Parker, David, Beckmann, Manfred, Snowdon, Stuart, Lin, Wanchang, Zubair, Hassan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2721842/
https://www.ncbi.nlm.nih.gov/pubmed/19622150
http://dx.doi.org/10.1186/1471-2105-10-227
_version_ 1782170240416743424
author Draper, John
Enot, David P
Parker, David
Beckmann, Manfred
Snowdon, Stuart
Lin, Wanchang
Zubair, Hassan
author_facet Draper, John
Enot, David P
Parker, David
Beckmann, Manfred
Snowdon, Stuart
Lin, Wanchang
Zubair, Hassan
author_sort Draper, John
collection PubMed
description BACKGROUND: Metabolomics experiments using Mass Spectrometry (MS) technology measure the mass to charge ratio (m/z) and intensity of ionised molecules in crude extracts of complex biological samples to generate high dimensional metabolite 'fingerprint' or metabolite 'profile' data. High resolution MS instruments perform routinely with a mass accuracy of < 5 ppm (parts per million) thus providing potentially a direct method for signal putative annotation using databases containing metabolite mass information. Most database interfaces support only simple queries with the default assumption that molecules either gain or lose a single proton when ionised. In reality the annotation process is confounded by the fact that many ionisation products will be not only molecular isotopes but also salt/solvent adducts and neutral loss fragments of original metabolites. This report describes an annotation strategy that will allow searching based on all potential ionisation products predicted to form during electrospray ionisation (ESI). RESULTS: Metabolite 'structures' harvested from publicly accessible databases were converted into a common format to generate a comprehensive archive in MZedDB. 'Rules' were derived from chemical information that allowed MZedDB to generate a list of adducts and neutral loss fragments putatively able to form for each structure and calculate, on the fly, the exact molecular weight of every potential ionisation product to provide targets for annotation searches based on accurate mass. We demonstrate that data matrices representing populations of ionisation products generated from different biological matrices contain a large proportion (sometimes > 50%) of molecular isotopes, salt adducts and neutral loss fragments. Correlation analysis of ESI-MS data features confirmed the predicted relationships of m/z signals. An integrated isotope enumerator in MZedDB allowed verification of exact isotopic pattern distributions to corroborate experimental data. CONCLUSION: We conclude that although ultra-high accurate mass instruments provide major insight into the chemical diversity of biological extracts, the facile annotation of a large proportion of signals is not possible by simple, automated query of current databases using computed molecular formulae. Parameterising MZedDB to take into account predicted ionisation behaviour and the biological source of any sample improves greatly both the frequency and accuracy of potential annotation 'hits' in ESI-MS data.
format Text
id pubmed-2721842
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27218422009-08-06 Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules' Draper, John Enot, David P Parker, David Beckmann, Manfred Snowdon, Stuart Lin, Wanchang Zubair, Hassan BMC Bioinformatics Research Article BACKGROUND: Metabolomics experiments using Mass Spectrometry (MS) technology measure the mass to charge ratio (m/z) and intensity of ionised molecules in crude extracts of complex biological samples to generate high dimensional metabolite 'fingerprint' or metabolite 'profile' data. High resolution MS instruments perform routinely with a mass accuracy of < 5 ppm (parts per million) thus providing potentially a direct method for signal putative annotation using databases containing metabolite mass information. Most database interfaces support only simple queries with the default assumption that molecules either gain or lose a single proton when ionised. In reality the annotation process is confounded by the fact that many ionisation products will be not only molecular isotopes but also salt/solvent adducts and neutral loss fragments of original metabolites. This report describes an annotation strategy that will allow searching based on all potential ionisation products predicted to form during electrospray ionisation (ESI). RESULTS: Metabolite 'structures' harvested from publicly accessible databases were converted into a common format to generate a comprehensive archive in MZedDB. 'Rules' were derived from chemical information that allowed MZedDB to generate a list of adducts and neutral loss fragments putatively able to form for each structure and calculate, on the fly, the exact molecular weight of every potential ionisation product to provide targets for annotation searches based on accurate mass. We demonstrate that data matrices representing populations of ionisation products generated from different biological matrices contain a large proportion (sometimes > 50%) of molecular isotopes, salt adducts and neutral loss fragments. Correlation analysis of ESI-MS data features confirmed the predicted relationships of m/z signals. An integrated isotope enumerator in MZedDB allowed verification of exact isotopic pattern distributions to corroborate experimental data. CONCLUSION: We conclude that although ultra-high accurate mass instruments provide major insight into the chemical diversity of biological extracts, the facile annotation of a large proportion of signals is not possible by simple, automated query of current databases using computed molecular formulae. Parameterising MZedDB to take into account predicted ionisation behaviour and the biological source of any sample improves greatly both the frequency and accuracy of potential annotation 'hits' in ESI-MS data. BioMed Central 2009-07-21 /pmc/articles/PMC2721842/ /pubmed/19622150 http://dx.doi.org/10.1186/1471-2105-10-227 Text en Copyright © 2009 Draper et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Draper, John
Enot, David P
Parker, David
Beckmann, Manfred
Snowdon, Stuart
Lin, Wanchang
Zubair, Hassan
Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'
title Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'
title_full Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'
title_fullStr Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'
title_full_unstemmed Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'
title_short Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'
title_sort metabolite signal identification in accurate mass metabolomics data with mzeddb, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2721842/
https://www.ncbi.nlm.nih.gov/pubmed/19622150
http://dx.doi.org/10.1186/1471-2105-10-227
work_keys_str_mv AT draperjohn metabolitesignalidentificationinaccuratemassmetabolomicsdatawithmzeddbaninteractivemzannotationtoolutilisingpredictedionisationbehaviourrules
AT enotdavidp metabolitesignalidentificationinaccuratemassmetabolomicsdatawithmzeddbaninteractivemzannotationtoolutilisingpredictedionisationbehaviourrules
AT parkerdavid metabolitesignalidentificationinaccuratemassmetabolomicsdatawithmzeddbaninteractivemzannotationtoolutilisingpredictedionisationbehaviourrules
AT beckmannmanfred metabolitesignalidentificationinaccuratemassmetabolomicsdatawithmzeddbaninteractivemzannotationtoolutilisingpredictedionisationbehaviourrules
AT snowdonstuart metabolitesignalidentificationinaccuratemassmetabolomicsdatawithmzeddbaninteractivemzannotationtoolutilisingpredictedionisationbehaviourrules
AT linwanchang metabolitesignalidentificationinaccuratemassmetabolomicsdatawithmzeddbaninteractivemzannotationtoolutilisingpredictedionisationbehaviourrules
AT zubairhassan metabolitesignalidentificationinaccuratemassmetabolomicsdatawithmzeddbaninteractivemzannotationtoolutilisingpredictedionisationbehaviourrules