Cargando…

MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS

[Image: see text] To achieve water quality objectives of the zero pollution action plan in Europe, rapid methods are needed to identify the presence of toxic substances in complex water samples. However, only a small fraction of chemicals detected with nontarget high-resolution mass spectrometry can...

Descripción completa

Detalles Bibliográficos
Autores principales: Peets, Pilleriin, Wang, Wei-Chieh, MacLeod, Matthew, Breitholtz, Magnus, Martin, Jonathan W., Kruve, Anneli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670854/
https://www.ncbi.nlm.nih.gov/pubmed/36269851
http://dx.doi.org/10.1021/acs.est.2c02536
_version_ 1784832421176279040
author Peets, Pilleriin
Wang, Wei-Chieh
MacLeod, Matthew
Breitholtz, Magnus
Martin, Jonathan W.
Kruve, Anneli
author_facet Peets, Pilleriin
Wang, Wei-Chieh
MacLeod, Matthew
Breitholtz, Magnus
Martin, Jonathan W.
Kruve, Anneli
author_sort Peets, Pilleriin
collection PubMed
description [Image: see text] To achieve water quality objectives of the zero pollution action plan in Europe, rapid methods are needed to identify the presence of toxic substances in complex water samples. However, only a small fraction of chemicals detected with nontarget high-resolution mass spectrometry can be identified, and fewer have ecotoxicological data available. We hypothesized that ecotoxicological data could be predicted for unknown molecular features in data-rich high-resolution mass spectrometry (HRMS) spectra, thereby circumventing time-consuming steps of molecular identification and rapidly flagging molecules of potentially high toxicity in complex samples. Here, we present MS2Tox, a machine learning method, to predict the toxicity of unidentified chemicals based on high-resolution accurate mass tandem mass spectra (MS(2)). The MS2Tox model for fish toxicity was trained and tested on 647 lethal concentration (LC(50)) values from the CompTox database and validated for 219 chemicals and 420 MS(2) spectra from MassBank. The root mean square error (RMSE) of MS2Tox predictions was below 0.89 log-mM, while the experimental repeatability of LC(50) values in CompTox was 0.44 log-mM. MS2Tox allowed accurate prediction of fish LC(50) values for 22 chemicals detected in water samples, and empirical evidence suggested the right directionality for another 68 chemicals. Moreover, by incorporating structural information, e.g., the presence of carbonyl-benzene, amide moieties, or hydroxyl groups, MS2Tox outperforms baseline models that use only the exact mass or log K(OW).
format Online
Article
Text
id pubmed-9670854
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-96708542022-11-18 MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS Peets, Pilleriin Wang, Wei-Chieh MacLeod, Matthew Breitholtz, Magnus Martin, Jonathan W. Kruve, Anneli Environ Sci Technol [Image: see text] To achieve water quality objectives of the zero pollution action plan in Europe, rapid methods are needed to identify the presence of toxic substances in complex water samples. However, only a small fraction of chemicals detected with nontarget high-resolution mass spectrometry can be identified, and fewer have ecotoxicological data available. We hypothesized that ecotoxicological data could be predicted for unknown molecular features in data-rich high-resolution mass spectrometry (HRMS) spectra, thereby circumventing time-consuming steps of molecular identification and rapidly flagging molecules of potentially high toxicity in complex samples. Here, we present MS2Tox, a machine learning method, to predict the toxicity of unidentified chemicals based on high-resolution accurate mass tandem mass spectra (MS(2)). The MS2Tox model for fish toxicity was trained and tested on 647 lethal concentration (LC(50)) values from the CompTox database and validated for 219 chemicals and 420 MS(2) spectra from MassBank. The root mean square error (RMSE) of MS2Tox predictions was below 0.89 log-mM, while the experimental repeatability of LC(50) values in CompTox was 0.44 log-mM. MS2Tox allowed accurate prediction of fish LC(50) values for 22 chemicals detected in water samples, and empirical evidence suggested the right directionality for another 68 chemicals. Moreover, by incorporating structural information, e.g., the presence of carbonyl-benzene, amide moieties, or hydroxyl groups, MS2Tox outperforms baseline models that use only the exact mass or log K(OW). American Chemical Society 2022-10-21 2022-11-15 /pmc/articles/PMC9670854/ /pubmed/36269851 http://dx.doi.org/10.1021/acs.est.2c02536 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Peets, Pilleriin
Wang, Wei-Chieh
MacLeod, Matthew
Breitholtz, Magnus
Martin, Jonathan W.
Kruve, Anneli
MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS
title MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS
title_full MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS
title_fullStr MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS
title_full_unstemmed MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS
title_short MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS
title_sort ms2tox machine learning tool for predicting the ecotoxicity of unidentified chemicals in water by nontarget lc-hrms
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670854/
https://www.ncbi.nlm.nih.gov/pubmed/36269851
http://dx.doi.org/10.1021/acs.est.2c02536
work_keys_str_mv AT peetspilleriin ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms
AT wangweichieh ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms
AT macleodmatthew ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms
AT breitholtzmagnus ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms
AT martinjonathanw ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms
AT kruveanneli ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms