Cargando…
MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS
[Image: see text] To achieve water quality objectives of the zero pollution action plan in Europe, rapid methods are needed to identify the presence of toxic substances in complex water samples. However, only a small fraction of chemicals detected with nontarget high-resolution mass spectrometry can...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670854/ https://www.ncbi.nlm.nih.gov/pubmed/36269851 http://dx.doi.org/10.1021/acs.est.2c02536 |
_version_ | 1784832421176279040 |
---|---|
author | Peets, Pilleriin Wang, Wei-Chieh MacLeod, Matthew Breitholtz, Magnus Martin, Jonathan W. Kruve, Anneli |
author_facet | Peets, Pilleriin Wang, Wei-Chieh MacLeod, Matthew Breitholtz, Magnus Martin, Jonathan W. Kruve, Anneli |
author_sort | Peets, Pilleriin |
collection | PubMed |
description | [Image: see text] To achieve water quality objectives of the zero pollution action plan in Europe, rapid methods are needed to identify the presence of toxic substances in complex water samples. However, only a small fraction of chemicals detected with nontarget high-resolution mass spectrometry can be identified, and fewer have ecotoxicological data available. We hypothesized that ecotoxicological data could be predicted for unknown molecular features in data-rich high-resolution mass spectrometry (HRMS) spectra, thereby circumventing time-consuming steps of molecular identification and rapidly flagging molecules of potentially high toxicity in complex samples. Here, we present MS2Tox, a machine learning method, to predict the toxicity of unidentified chemicals based on high-resolution accurate mass tandem mass spectra (MS(2)). The MS2Tox model for fish toxicity was trained and tested on 647 lethal concentration (LC(50)) values from the CompTox database and validated for 219 chemicals and 420 MS(2) spectra from MassBank. The root mean square error (RMSE) of MS2Tox predictions was below 0.89 log-mM, while the experimental repeatability of LC(50) values in CompTox was 0.44 log-mM. MS2Tox allowed accurate prediction of fish LC(50) values for 22 chemicals detected in water samples, and empirical evidence suggested the right directionality for another 68 chemicals. Moreover, by incorporating structural information, e.g., the presence of carbonyl-benzene, amide moieties, or hydroxyl groups, MS2Tox outperforms baseline models that use only the exact mass or log K(OW). |
format | Online Article Text |
id | pubmed-9670854 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-96708542022-11-18 MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS Peets, Pilleriin Wang, Wei-Chieh MacLeod, Matthew Breitholtz, Magnus Martin, Jonathan W. Kruve, Anneli Environ Sci Technol [Image: see text] To achieve water quality objectives of the zero pollution action plan in Europe, rapid methods are needed to identify the presence of toxic substances in complex water samples. However, only a small fraction of chemicals detected with nontarget high-resolution mass spectrometry can be identified, and fewer have ecotoxicological data available. We hypothesized that ecotoxicological data could be predicted for unknown molecular features in data-rich high-resolution mass spectrometry (HRMS) spectra, thereby circumventing time-consuming steps of molecular identification and rapidly flagging molecules of potentially high toxicity in complex samples. Here, we present MS2Tox, a machine learning method, to predict the toxicity of unidentified chemicals based on high-resolution accurate mass tandem mass spectra (MS(2)). The MS2Tox model for fish toxicity was trained and tested on 647 lethal concentration (LC(50)) values from the CompTox database and validated for 219 chemicals and 420 MS(2) spectra from MassBank. The root mean square error (RMSE) of MS2Tox predictions was below 0.89 log-mM, while the experimental repeatability of LC(50) values in CompTox was 0.44 log-mM. MS2Tox allowed accurate prediction of fish LC(50) values for 22 chemicals detected in water samples, and empirical evidence suggested the right directionality for another 68 chemicals. Moreover, by incorporating structural information, e.g., the presence of carbonyl-benzene, amide moieties, or hydroxyl groups, MS2Tox outperforms baseline models that use only the exact mass or log K(OW). American Chemical Society 2022-10-21 2022-11-15 /pmc/articles/PMC9670854/ /pubmed/36269851 http://dx.doi.org/10.1021/acs.est.2c02536 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Peets, Pilleriin Wang, Wei-Chieh MacLeod, Matthew Breitholtz, Magnus Martin, Jonathan W. Kruve, Anneli MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS |
title | MS2Tox Machine Learning
Tool for Predicting the Ecotoxicity
of Unidentified Chemicals in Water by Nontarget LC-HRMS |
title_full | MS2Tox Machine Learning
Tool for Predicting the Ecotoxicity
of Unidentified Chemicals in Water by Nontarget LC-HRMS |
title_fullStr | MS2Tox Machine Learning
Tool for Predicting the Ecotoxicity
of Unidentified Chemicals in Water by Nontarget LC-HRMS |
title_full_unstemmed | MS2Tox Machine Learning
Tool for Predicting the Ecotoxicity
of Unidentified Chemicals in Water by Nontarget LC-HRMS |
title_short | MS2Tox Machine Learning
Tool for Predicting the Ecotoxicity
of Unidentified Chemicals in Water by Nontarget LC-HRMS |
title_sort | ms2tox machine learning
tool for predicting the ecotoxicity
of unidentified chemicals in water by nontarget lc-hrms |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670854/ https://www.ncbi.nlm.nih.gov/pubmed/36269851 http://dx.doi.org/10.1021/acs.est.2c02536 |
work_keys_str_mv | AT peetspilleriin ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms AT wangweichieh ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms AT macleodmatthew ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms AT breitholtzmagnus ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms AT martinjonathanw ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms AT kruveanneli ms2toxmachinelearningtoolforpredictingtheecotoxicityofunidentifiedchemicalsinwaterbynontargetlchrms |