Cargando…
Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines
[Image: see text] Ultra-high performance liquid chromatography coupled to ion mobility separation and high-resolution mass spectrometry instruments have proven very valuable for screening of emerging contaminants in the aquatic environment. However, when applying suspect or nontarget approaches (i.e...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9709913/ https://www.ncbi.nlm.nih.gov/pubmed/36280383 http://dx.doi.org/10.1021/acs.jcim.2c00847 |
_version_ | 1784841263001894912 |
---|---|
author | Celma, Alberto Bade, Richard Sancho, Juan Vicente Hernandez, Félix Humphries, Melissa Bijlsma, Lubertus |
author_facet | Celma, Alberto Bade, Richard Sancho, Juan Vicente Hernandez, Félix Humphries, Melissa Bijlsma, Lubertus |
author_sort | Celma, Alberto |
collection | PubMed |
description | [Image: see text] Ultra-high performance liquid chromatography coupled to ion mobility separation and high-resolution mass spectrometry instruments have proven very valuable for screening of emerging contaminants in the aquatic environment. However, when applying suspect or nontarget approaches (i.e., when no reference standards are available), there is no information on retention time (RT) and collision cross-section (CCS) values to facilitate identification. In silico prediction tools of RT and CCS can therefore be of great utility to decrease the number of candidates to investigate. In this work, Multiple Adaptive Regression Splines (MARS) were evaluated for the prediction of both RT and CCS. MARS prediction models were developed and validated using a database of 477 protonated molecules, 169 deprotonated molecules, and 249 sodium adducts. Multivariate and univariate models were evaluated showing a better fit for univariate models to the experimental data. The RT model (R(2) = 0.855) showed a deviation between predicted and experimental data of ±2.32 min (95% confidence intervals). The deviation observed for CCS data of protonated molecules using the CCS(H) model (R(2) = 0.966) was ±4.05% with 95% confidence intervals. The CCS(H) model was also tested for the prediction of deprotonated molecules, resulting in deviations below ±5.86% for the 95% of the cases. Finally, a third model was developed for sodium adducts (CCS(Na), R(2) = 0.954) with deviation below ±5.25% for 95% of the cases. The developed models have been incorporated in an open-access and user-friendly online platform which represents a great advantage for third-party research laboratories for predicting both RT and CCS data. |
format | Online Article Text |
id | pubmed-9709913 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-97099132022-12-01 Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines Celma, Alberto Bade, Richard Sancho, Juan Vicente Hernandez, Félix Humphries, Melissa Bijlsma, Lubertus J Chem Inf Model [Image: see text] Ultra-high performance liquid chromatography coupled to ion mobility separation and high-resolution mass spectrometry instruments have proven very valuable for screening of emerging contaminants in the aquatic environment. However, when applying suspect or nontarget approaches (i.e., when no reference standards are available), there is no information on retention time (RT) and collision cross-section (CCS) values to facilitate identification. In silico prediction tools of RT and CCS can therefore be of great utility to decrease the number of candidates to investigate. In this work, Multiple Adaptive Regression Splines (MARS) were evaluated for the prediction of both RT and CCS. MARS prediction models were developed and validated using a database of 477 protonated molecules, 169 deprotonated molecules, and 249 sodium adducts. Multivariate and univariate models were evaluated showing a better fit for univariate models to the experimental data. The RT model (R(2) = 0.855) showed a deviation between predicted and experimental data of ±2.32 min (95% confidence intervals). The deviation observed for CCS data of protonated molecules using the CCS(H) model (R(2) = 0.966) was ±4.05% with 95% confidence intervals. The CCS(H) model was also tested for the prediction of deprotonated molecules, resulting in deviations below ±5.86% for the 95% of the cases. Finally, a third model was developed for sodium adducts (CCS(Na), R(2) = 0.954) with deviation below ±5.25% for 95% of the cases. The developed models have been incorporated in an open-access and user-friendly online platform which represents a great advantage for third-party research laboratories for predicting both RT and CCS data. American Chemical Society 2022-10-24 2022-11-28 /pmc/articles/PMC9709913/ /pubmed/36280383 http://dx.doi.org/10.1021/acs.jcim.2c00847 Text en © 2022 American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Celma, Alberto Bade, Richard Sancho, Juan Vicente Hernandez, Félix Humphries, Melissa Bijlsma, Lubertus Prediction of Retention Time and Collision Cross Section (CCS(H+), CCS(H–), and CCS(Na+)) of Emerging Contaminants Using Multiple Adaptive Regression Splines |
title | Prediction of Retention
Time and Collision Cross Section
(CCS(H+), CCS(H–), and CCS(Na+))
of Emerging Contaminants Using Multiple Adaptive Regression Splines |
title_full | Prediction of Retention
Time and Collision Cross Section
(CCS(H+), CCS(H–), and CCS(Na+))
of Emerging Contaminants Using Multiple Adaptive Regression Splines |
title_fullStr | Prediction of Retention
Time and Collision Cross Section
(CCS(H+), CCS(H–), and CCS(Na+))
of Emerging Contaminants Using Multiple Adaptive Regression Splines |
title_full_unstemmed | Prediction of Retention
Time and Collision Cross Section
(CCS(H+), CCS(H–), and CCS(Na+))
of Emerging Contaminants Using Multiple Adaptive Regression Splines |
title_short | Prediction of Retention
Time and Collision Cross Section
(CCS(H+), CCS(H–), and CCS(Na+))
of Emerging Contaminants Using Multiple Adaptive Regression Splines |
title_sort | prediction of retention
time and collision cross section
(ccs(h+), ccs(h–), and ccs(na+))
of emerging contaminants using multiple adaptive regression splines |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9709913/ https://www.ncbi.nlm.nih.gov/pubmed/36280383 http://dx.doi.org/10.1021/acs.jcim.2c00847 |
work_keys_str_mv | AT celmaalberto predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines AT baderichard predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines AT sanchojuanvicente predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines AT hernandezfelix predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines AT humphriesmelissa predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines AT bijlsmalubertus predictionofretentiontimeandcollisioncrosssectionccshccshandccsnaofemergingcontaminantsusingmultipleadaptiveregressionsplines |