Cargando…

Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines

[Image: see text] Ultra-high performance liquid chromatography coupled to ion mobility separation and high-resolution mass spectrometry instruments have proven very valuable for screening of emerging contaminants in the aquatic environment. However, when applying suspect or nontarget approaches (i.e...

Descripción completa

Detalles Bibliográficos
Autores principales: Celma, Alberto, Bade, Richard, Sancho, Juan Vicente, Hernandez, Félix, Humphries, Melissa, Bijlsma, Lubertus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9709913/
https://www.ncbi.nlm.nih.gov/pubmed/36280383
http://dx.doi.org/10.1021/acs.jcim.2c00847
_version_ 1784841263001894912
author Celma, Alberto
Bade, Richard
Sancho, Juan Vicente
Hernandez, Félix
Humphries, Melissa
Bijlsma, Lubertus
author_facet Celma, Alberto
Bade, Richard
Sancho, Juan Vicente
Hernandez, Félix
Humphries, Melissa
Bijlsma, Lubertus
author_sort Celma, Alberto
collection PubMed
description [Image: see text] Ultra-high performance liquid chromatography coupled to ion mobility separation and high-resolution mass spectrometry instruments have proven very valuable for screening of emerging contaminants in the aquatic environment. However, when applying suspect or nontarget approaches (i.e., when no reference standards are available), there is no information on retention time (RT) and collision cross-section (CCS) values to facilitate identification. In silico prediction tools of RT and CCS can therefore be of great utility to decrease the number of candidates to investigate. In this work, Multiple Adaptive Regression Splines (MARS) were evaluated for the prediction of both RT and CCS. MARS prediction models were developed and validated using a database of 477 protonated molecules, 169 deprotonated molecules, and 249 sodium adducts. Multivariate and univariate models were evaluated showing a better fit for univariate models to the experimental data. The RT model (R(2) = 0.855) showed a deviation between predicted and experimental data of ±2.32 min (95% confidence intervals). The deviation observed for CCS data of protonated molecules using the CCS(H) model (R(2) = 0.966) was ±4.05% with 95% confidence intervals. The CCS(H) model was also tested for the prediction of deprotonated molecules, resulting in deviations below ±5.86% for the 95% of the cases. Finally, a third model was developed for sodium adducts (CCS(Na), R(2) = 0.954) with deviation below ±5.25% for 95% of the cases. The developed models have been incorporated in an open-access and user-friendly online platform which represents a great advantage for third-party research laboratories for predicting both RT and CCS data.
format Online
Article
Text
id pubmed-9709913
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-97099132022-12-01 Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines Celma, Alberto Bade, Richard Sancho, Juan Vicente Hernandez, Félix Humphries, Melissa Bijlsma, Lubertus J Chem Inf Model [Image: see text] Ultra-high performance liquid chromatography coupled to ion mobility separation and high-resolution mass spectrometry instruments have proven very valuable for screening of emerging contaminants in the aquatic environment. However, when applying suspect or nontarget approaches (i.e., when no reference standards are available), there is no information on retention time (RT) and collision cross-section (CCS) values to facilitate identification. In silico prediction tools of RT and CCS can therefore be of great utility to decrease the number of candidates to investigate. In this work, Multiple Adaptive Regression Splines (MARS) were evaluated for the prediction of both RT and CCS. MARS prediction models were developed and validated using a database of 477 protonated molecules, 169 deprotonated molecules, and 249 sodium adducts. Multivariate and univariate models were evaluated showing a better fit for univariate models to the experimental data. The RT model (R(2) = 0.855) showed a deviation between predicted and experimental data of ±2.32 min (95% confidence intervals). The deviation observed for CCS data of protonated molecules using the CCS(H) model (R(2) = 0.966) was ±4.05% with 95% confidence intervals. The CCS(H) model was also tested for the prediction of deprotonated molecules, resulting in deviations below ±5.86% for the 95% of the cases. Finally, a third model was developed for sodium adducts (CCS(Na), R(2) = 0.954) with deviation below ±5.25% for 95% of the cases. The developed models have been incorporated in an open-access and user-friendly online platform which represents a great advantage for third-party research laboratories for predicting both RT and CCS data. American Chemical Society 2022-10-24 2022-11-28 /pmc/articles/PMC9709913/ /pubmed/36280383 http://dx.doi.org/10.1021/acs.jcim.2c00847 Text en © 2022 American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Celma, Alberto
Bade, Richard
Sancho, Juan Vicente
Hernandez, Félix
Humphries, Melissa
Bijlsma, Lubertus
Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines
title Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines
title_full Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines
title_fullStr Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines
title_full_unstemmed Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines
title_short Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines
title_sort prediction of retention time and collision cross section (ccs(h+), ccs(h–), and ccs(na+)) of emerging contaminants using multiple adaptive regression splines
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9709913/
https://www.ncbi.nlm.nih.gov/pubmed/36280383
http://dx.doi.org/10.1021/acs.jcim.2c00847
work_keys_str_mv AT celmaalberto predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines
AT baderichard predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines
AT sanchojuanvicente predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines
AT hernandezfelix predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines
AT humphriesmelissa predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines
AT bijlsmalubertus predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines