Cargando…

New Approach Combining Molecular Fingerprints and Machine Learning to Estimate Relative Ionization Efficiency in Electrospray Ionization

[Image: see text] Electrospray ionization (ESI) is widely used as an ionization source for the analysis of complex mixtures by mass spectrometry. However, different compounds ionize more or less effectively in the ESI source, meaning instrument responses can vary by orders of magnitude, often in har...

Descripción completa

Detalles Bibliográficos
Autores principales: Mayhew, Alfred W., Topping, David O., Hamilton, Jacqueline F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2020
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7191837/
https://www.ncbi.nlm.nih.gov/pubmed/32363303
http://dx.doi.org/10.1021/acsomega.0c00732
_version_ 1783527924868055040
author Mayhew, Alfred W.
Topping, David O.
Hamilton, Jacqueline F.
author_facet Mayhew, Alfred W.
Topping, David O.
Hamilton, Jacqueline F.
author_sort Mayhew, Alfred W.
collection PubMed
description [Image: see text] Electrospray ionization (ESI) is widely used as an ionization source for the analysis of complex mixtures by mass spectrometry. However, different compounds ionize more or less effectively in the ESI source, meaning instrument responses can vary by orders of magnitude, often in hard-to-predict ways. This precludes the use of ESI for quantitative analysis where authentic standards are not available. Relative ionization efficiency (RIE) scales have been proposed as a route to predict the response of compounds in ESI. In this work, a scale of RIEs was constructed for 51 carboxylic acids, spanning a wide range of additional functionalities, to produce a model for predicting the RIE of unknown compounds. While using a limited number of compounds, we explore the usefulness of building a predictor using popular supervised regression techniques, encoding the compounds as combinations of different structural features using a range of common “fingerprints”. It was found that Bayesian ridge regression gives the best predictive model, encoding compounds using features designed for activity coefficient models. This produced a predictive model with an R(2) score of 0.62 and a root-mean-square error (RMSE) of 0.362. Such scores are comparable to those obtained in previous studies but without the requirement to first measure or predict the physical properties of the compounds, potentially reducing the time required to make predictions.
format Online
Article
Text
id pubmed-7191837
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-71918372020-05-01 New Approach Combining Molecular Fingerprints and Machine Learning to Estimate Relative Ionization Efficiency in Electrospray Ionization Mayhew, Alfred W. Topping, David O. Hamilton, Jacqueline F. ACS Omega [Image: see text] Electrospray ionization (ESI) is widely used as an ionization source for the analysis of complex mixtures by mass spectrometry. However, different compounds ionize more or less effectively in the ESI source, meaning instrument responses can vary by orders of magnitude, often in hard-to-predict ways. This precludes the use of ESI for quantitative analysis where authentic standards are not available. Relative ionization efficiency (RIE) scales have been proposed as a route to predict the response of compounds in ESI. In this work, a scale of RIEs was constructed for 51 carboxylic acids, spanning a wide range of additional functionalities, to produce a model for predicting the RIE of unknown compounds. While using a limited number of compounds, we explore the usefulness of building a predictor using popular supervised regression techniques, encoding the compounds as combinations of different structural features using a range of common “fingerprints”. It was found that Bayesian ridge regression gives the best predictive model, encoding compounds using features designed for activity coefficient models. This produced a predictive model with an R(2) score of 0.62 and a root-mean-square error (RMSE) of 0.362. Such scores are comparable to those obtained in previous studies but without the requirement to first measure or predict the physical properties of the compounds, potentially reducing the time required to make predictions. American Chemical Society 2020-04-14 /pmc/articles/PMC7191837/ /pubmed/32363303 http://dx.doi.org/10.1021/acsomega.0c00732 Text en Copyright © 2020 American Chemical Society This is an open access article published under a Creative Commons Attribution (CC-BY) License (http://pubs.acs.org/page/policy/authorchoice_ccby_termsofuse.html) , which permits unrestricted use, distribution and reproduction in any medium, provided the author and source are cited.
spellingShingle Mayhew, Alfred W.
Topping, David O.
Hamilton, Jacqueline F.
New Approach Combining Molecular Fingerprints and Machine Learning to Estimate Relative Ionization Efficiency in Electrospray Ionization
title New Approach Combining Molecular Fingerprints and Machine Learning to Estimate Relative Ionization Efficiency in Electrospray Ionization
title_full New Approach Combining Molecular Fingerprints and Machine Learning to Estimate Relative Ionization Efficiency in Electrospray Ionization
title_fullStr New Approach Combining Molecular Fingerprints and Machine Learning to Estimate Relative Ionization Efficiency in Electrospray Ionization
title_full_unstemmed New Approach Combining Molecular Fingerprints and Machine Learning to Estimate Relative Ionization Efficiency in Electrospray Ionization
title_short New Approach Combining Molecular Fingerprints and Machine Learning to Estimate Relative Ionization Efficiency in Electrospray Ionization
title_sort new approach combining molecular fingerprints and machine learning to estimate relative ionization efficiency in electrospray ionization
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7191837/
https://www.ncbi.nlm.nih.gov/pubmed/32363303
http://dx.doi.org/10.1021/acsomega.0c00732
work_keys_str_mv AT mayhewalfredw newapproachcombiningmolecularfingerprintsandmachinelearningtoestimaterelativeionizationefficiencyinelectrosprayionization
AT toppingdavido newapproachcombiningmolecularfingerprintsandmachinelearningtoestimaterelativeionizationefficiencyinelectrosprayionization
AT hamiltonjacquelinef newapproachcombiningmolecularfingerprintsandmachinelearningtoestimaterelativeionizationefficiencyinelectrosprayionization