Cargando…

Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization

[Image: see text] Machine Learning (ML) is increasingly applied to fill data gaps in assessments to quantify impacts associated with chemical emissions and chemicals in products. However, the systematic application of ML-based approaches to fill chemical data gaps is still limited, and their potenti...

Descripción completa

Detalles Bibliográficos
Autores principales: von Borries, Kerstin, Holmquist, Hanna, Kosnik, Marissa, Beckwith, Katie V., Jolliet, Olivier, Goodman, Jonathan M., Fantke, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10666540/
https://www.ncbi.nlm.nih.gov/pubmed/37914529
http://dx.doi.org/10.1021/acs.est.3c05300
_version_ 1785148973834567680
author von Borries, Kerstin
Holmquist, Hanna
Kosnik, Marissa
Beckwith, Katie V.
Jolliet, Olivier
Goodman, Jonathan M.
Fantke, Peter
author_facet von Borries, Kerstin
Holmquist, Hanna
Kosnik, Marissa
Beckwith, Katie V.
Jolliet, Olivier
Goodman, Jonathan M.
Fantke, Peter
author_sort von Borries, Kerstin
collection PubMed
description [Image: see text] Machine Learning (ML) is increasingly applied to fill data gaps in assessments to quantify impacts associated with chemical emissions and chemicals in products. However, the systematic application of ML-based approaches to fill chemical data gaps is still limited, and their potential for addressing a wide range of chemicals is unknown. We prioritized chemical-related parameters for chemical toxicity characterization to inform ML model development based on two criteria: (1) each parameter’s relevance to robustly characterize chemical toxicity described by the uncertainty in characterization results attributable to each parameter and (2) the potential for ML-based approaches to predict parameter values for a wide range of chemicals described by the availability of chemicals with measured parameter data. We prioritized 13 out of 38 parameters for developing ML-based approaches, while flagging another nine with critical data gaps. For all prioritized parameters, we performed a chemical space analysis to assess further the potential for ML-based approaches to predict data for diverse chemicals considering the structural diversity of available measured data, showing that ML-based approaches can potentially predict 8–46% of marketed chemicals based on 1–10% with available measured data. Our results can systematically inform future ML model development efforts to address data gaps in chemical toxicity characterization.
format Online
Article
Text
id pubmed-10666540
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-106665402023-11-23 Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization von Borries, Kerstin Holmquist, Hanna Kosnik, Marissa Beckwith, Katie V. Jolliet, Olivier Goodman, Jonathan M. Fantke, Peter Environ Sci Technol [Image: see text] Machine Learning (ML) is increasingly applied to fill data gaps in assessments to quantify impacts associated with chemical emissions and chemicals in products. However, the systematic application of ML-based approaches to fill chemical data gaps is still limited, and their potential for addressing a wide range of chemicals is unknown. We prioritized chemical-related parameters for chemical toxicity characterization to inform ML model development based on two criteria: (1) each parameter’s relevance to robustly characterize chemical toxicity described by the uncertainty in characterization results attributable to each parameter and (2) the potential for ML-based approaches to predict parameter values for a wide range of chemicals described by the availability of chemicals with measured parameter data. We prioritized 13 out of 38 parameters for developing ML-based approaches, while flagging another nine with critical data gaps. For all prioritized parameters, we performed a chemical space analysis to assess further the potential for ML-based approaches to predict data for diverse chemicals considering the structural diversity of available measured data, showing that ML-based approaches can potentially predict 8–46% of marketed chemicals based on 1–10% with available measured data. Our results can systematically inform future ML model development efforts to address data gaps in chemical toxicity characterization. American Chemical Society 2023-11-01 /pmc/articles/PMC10666540/ /pubmed/37914529 http://dx.doi.org/10.1021/acs.est.3c05300 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle von Borries, Kerstin
Holmquist, Hanna
Kosnik, Marissa
Beckwith, Katie V.
Jolliet, Olivier
Goodman, Jonathan M.
Fantke, Peter
Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization
title Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization
title_full Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization
title_fullStr Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization
title_full_unstemmed Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization
title_short Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization
title_sort potential for machine learning to address data gaps in human toxicity and ecotoxicity characterization
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10666540/
https://www.ncbi.nlm.nih.gov/pubmed/37914529
http://dx.doi.org/10.1021/acs.est.3c05300
work_keys_str_mv AT vonborrieskerstin potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization
AT holmquisthanna potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization
AT kosnikmarissa potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization
AT beckwithkatiev potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization
AT jollietolivier potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization
AT goodmanjonathanm potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization
AT fantkepeter potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization