Cargando…
The Bigger Fish: A Comparison of Meta-Learning QSAR Models on Low-Resourced Aquatic Toxicity Regression Tasks
[Image: see text] Toxicological information as needed for risk assessments of chemical compounds is often sparse. Unfortunately, gathering new toxicological information experimentally often involves animal testing. Simulated alternatives, e.g., quantitative structure–activity relationship (QSAR) mod...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10666535/ https://www.ncbi.nlm.nih.gov/pubmed/37315216 http://dx.doi.org/10.1021/acs.est.3c00334 |
_version_ | 1785148972708397056 |
---|---|
author | Schlender, Thalea Viljanen, Markus van Rijn, Jan N. Mohr, Felix Peijnenburg, Willie JGM. Hoos, Holger H. Rorije, Emiel Wong, Albert |
author_facet | Schlender, Thalea Viljanen, Markus van Rijn, Jan N. Mohr, Felix Peijnenburg, Willie JGM. Hoos, Holger H. Rorije, Emiel Wong, Albert |
author_sort | Schlender, Thalea |
collection | PubMed |
description | [Image: see text] Toxicological information as needed for risk assessments of chemical compounds is often sparse. Unfortunately, gathering new toxicological information experimentally often involves animal testing. Simulated alternatives, e.g., quantitative structure–activity relationship (QSAR) models, are preferred to infer the toxicity of new compounds. Aquatic toxicity data collections consist of many related tasks—each predicting the toxicity of new compounds on a given species. Since many of these tasks are inherently low-resource, i.e., involve few associated compounds, this is challenging. Meta-learning is a subfield of artificial intelligence that can lead to more accurate models by enabling the utilization of information across tasks. In our work, we benchmark various state-of-the-art meta-learning techniques for building QSAR models, focusing on knowledge sharing between species. Specifically, we employ and compare transformational machine learning, model-agnostic meta-learning, fine-tuning, and multi-task models. Our experiments show that established knowledge-sharing techniques outperform single-task approaches. We recommend the use of multi-task random forest models for aquatic toxicity modeling, which matched or exceeded the performance of other approaches and robustly produced good results in the low-resource settings we studied. This model functions on a species level, predicting toxicity for multiple species across various phyla, with flexible exposure duration and on a large chemical applicability domain. |
format | Online Article Text |
id | pubmed-10666535 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-106665352023-11-23 The Bigger Fish: A Comparison of Meta-Learning QSAR Models on Low-Resourced Aquatic Toxicity Regression Tasks Schlender, Thalea Viljanen, Markus van Rijn, Jan N. Mohr, Felix Peijnenburg, Willie JGM. Hoos, Holger H. Rorije, Emiel Wong, Albert Environ Sci Technol [Image: see text] Toxicological information as needed for risk assessments of chemical compounds is often sparse. Unfortunately, gathering new toxicological information experimentally often involves animal testing. Simulated alternatives, e.g., quantitative structure–activity relationship (QSAR) models, are preferred to infer the toxicity of new compounds. Aquatic toxicity data collections consist of many related tasks—each predicting the toxicity of new compounds on a given species. Since many of these tasks are inherently low-resource, i.e., involve few associated compounds, this is challenging. Meta-learning is a subfield of artificial intelligence that can lead to more accurate models by enabling the utilization of information across tasks. In our work, we benchmark various state-of-the-art meta-learning techniques for building QSAR models, focusing on knowledge sharing between species. Specifically, we employ and compare transformational machine learning, model-agnostic meta-learning, fine-tuning, and multi-task models. Our experiments show that established knowledge-sharing techniques outperform single-task approaches. We recommend the use of multi-task random forest models for aquatic toxicity modeling, which matched or exceeded the performance of other approaches and robustly produced good results in the low-resource settings we studied. This model functions on a species level, predicting toxicity for multiple species across various phyla, with flexible exposure duration and on a large chemical applicability domain. American Chemical Society 2023-06-14 /pmc/articles/PMC10666535/ /pubmed/37315216 http://dx.doi.org/10.1021/acs.est.3c00334 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Schlender, Thalea Viljanen, Markus van Rijn, Jan N. Mohr, Felix Peijnenburg, Willie JGM. Hoos, Holger H. Rorije, Emiel Wong, Albert The Bigger Fish: A Comparison of Meta-Learning QSAR Models on Low-Resourced Aquatic Toxicity Regression Tasks |
title | The
Bigger Fish:
A Comparison of Meta-Learning QSAR
Models on Low-Resourced Aquatic Toxicity Regression Tasks |
title_full | The
Bigger Fish:
A Comparison of Meta-Learning QSAR
Models on Low-Resourced Aquatic Toxicity Regression Tasks |
title_fullStr | The
Bigger Fish:
A Comparison of Meta-Learning QSAR
Models on Low-Resourced Aquatic Toxicity Regression Tasks |
title_full_unstemmed | The
Bigger Fish:
A Comparison of Meta-Learning QSAR
Models on Low-Resourced Aquatic Toxicity Regression Tasks |
title_short | The
Bigger Fish:
A Comparison of Meta-Learning QSAR
Models on Low-Resourced Aquatic Toxicity Regression Tasks |
title_sort | the
bigger fish:
a comparison of meta-learning qsar
models on low-resourced aquatic toxicity regression tasks |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10666535/ https://www.ncbi.nlm.nih.gov/pubmed/37315216 http://dx.doi.org/10.1021/acs.est.3c00334 |
work_keys_str_mv | AT schlenderthalea thebiggerfishacomparisonofmetalearningqsarmodelsonlowresourcedaquatictoxicityregressiontasks AT viljanenmarkus thebiggerfishacomparisonofmetalearningqsarmodelsonlowresourcedaquatictoxicityregressiontasks AT vanrijnjann thebiggerfishacomparisonofmetalearningqsarmodelsonlowresourcedaquatictoxicityregressiontasks AT mohrfelix thebiggerfishacomparisonofmetalearningqsarmodelsonlowresourcedaquatictoxicityregressiontasks AT peijnenburgwilliejgm thebiggerfishacomparisonofmetalearningqsarmodelsonlowresourcedaquatictoxicityregressiontasks AT hoosholgerh thebiggerfishacomparisonofmetalearningqsarmodelsonlowresourcedaquatictoxicityregressiontasks AT rorijeemiel thebiggerfishacomparisonofmetalearningqsarmodelsonlowresourcedaquatictoxicityregressiontasks AT wongalbert thebiggerfishacomparisonofmetalearningqsarmodelsonlowresourcedaquatictoxicityregressiontasks |