Cargando…
Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization
[Image: see text] Machine Learning (ML) is increasingly applied to fill data gaps in assessments to quantify impacts associated with chemical emissions and chemicals in products. However, the systematic application of ML-based approaches to fill chemical data gaps is still limited, and their potenti...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10666540/ https://www.ncbi.nlm.nih.gov/pubmed/37914529 http://dx.doi.org/10.1021/acs.est.3c05300 |
_version_ | 1785148973834567680 |
---|---|
author | von Borries, Kerstin Holmquist, Hanna Kosnik, Marissa Beckwith, Katie V. Jolliet, Olivier Goodman, Jonathan M. Fantke, Peter |
author_facet | von Borries, Kerstin Holmquist, Hanna Kosnik, Marissa Beckwith, Katie V. Jolliet, Olivier Goodman, Jonathan M. Fantke, Peter |
author_sort | von Borries, Kerstin |
collection | PubMed |
description | [Image: see text] Machine Learning (ML) is increasingly applied to fill data gaps in assessments to quantify impacts associated with chemical emissions and chemicals in products. However, the systematic application of ML-based approaches to fill chemical data gaps is still limited, and their potential for addressing a wide range of chemicals is unknown. We prioritized chemical-related parameters for chemical toxicity characterization to inform ML model development based on two criteria: (1) each parameter’s relevance to robustly characterize chemical toxicity described by the uncertainty in characterization results attributable to each parameter and (2) the potential for ML-based approaches to predict parameter values for a wide range of chemicals described by the availability of chemicals with measured parameter data. We prioritized 13 out of 38 parameters for developing ML-based approaches, while flagging another nine with critical data gaps. For all prioritized parameters, we performed a chemical space analysis to assess further the potential for ML-based approaches to predict data for diverse chemicals considering the structural diversity of available measured data, showing that ML-based approaches can potentially predict 8–46% of marketed chemicals based on 1–10% with available measured data. Our results can systematically inform future ML model development efforts to address data gaps in chemical toxicity characterization. |
format | Online Article Text |
id | pubmed-10666540 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-106665402023-11-23 Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization von Borries, Kerstin Holmquist, Hanna Kosnik, Marissa Beckwith, Katie V. Jolliet, Olivier Goodman, Jonathan M. Fantke, Peter Environ Sci Technol [Image: see text] Machine Learning (ML) is increasingly applied to fill data gaps in assessments to quantify impacts associated with chemical emissions and chemicals in products. However, the systematic application of ML-based approaches to fill chemical data gaps is still limited, and their potential for addressing a wide range of chemicals is unknown. We prioritized chemical-related parameters for chemical toxicity characterization to inform ML model development based on two criteria: (1) each parameter’s relevance to robustly characterize chemical toxicity described by the uncertainty in characterization results attributable to each parameter and (2) the potential for ML-based approaches to predict parameter values for a wide range of chemicals described by the availability of chemicals with measured parameter data. We prioritized 13 out of 38 parameters for developing ML-based approaches, while flagging another nine with critical data gaps. For all prioritized parameters, we performed a chemical space analysis to assess further the potential for ML-based approaches to predict data for diverse chemicals considering the structural diversity of available measured data, showing that ML-based approaches can potentially predict 8–46% of marketed chemicals based on 1–10% with available measured data. Our results can systematically inform future ML model development efforts to address data gaps in chemical toxicity characterization. American Chemical Society 2023-11-01 /pmc/articles/PMC10666540/ /pubmed/37914529 http://dx.doi.org/10.1021/acs.est.3c05300 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | von Borries, Kerstin Holmquist, Hanna Kosnik, Marissa Beckwith, Katie V. Jolliet, Olivier Goodman, Jonathan M. Fantke, Peter Potential for Machine Learning to Address Data Gaps in Human Toxicity and Ecotoxicity Characterization |
title | Potential
for Machine Learning to Address Data Gaps
in Human Toxicity and Ecotoxicity Characterization |
title_full | Potential
for Machine Learning to Address Data Gaps
in Human Toxicity and Ecotoxicity Characterization |
title_fullStr | Potential
for Machine Learning to Address Data Gaps
in Human Toxicity and Ecotoxicity Characterization |
title_full_unstemmed | Potential
for Machine Learning to Address Data Gaps
in Human Toxicity and Ecotoxicity Characterization |
title_short | Potential
for Machine Learning to Address Data Gaps
in Human Toxicity and Ecotoxicity Characterization |
title_sort | potential
for machine learning to address data gaps
in human toxicity and ecotoxicity characterization |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10666540/ https://www.ncbi.nlm.nih.gov/pubmed/37914529 http://dx.doi.org/10.1021/acs.est.3c05300 |
work_keys_str_mv | AT vonborrieskerstin potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization AT holmquisthanna potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization AT kosnikmarissa potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization AT beckwithkatiev potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization AT jollietolivier potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization AT goodmanjonathanm potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization AT fantkepeter potentialformachinelearningtoaddressdatagapsinhumantoxicityandecotoxicitycharacterization |