Cargando…
KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development
Risk assessment of newly synthesised chemicals is a prerequisite for regulatory approval. In this context, in silico methods have great potential to reduce time, cost, and ultimately animal testing as they make use of the ever-growing amount of available toxicity data. Here, KnowTox is presented, a...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7157991/ https://www.ncbi.nlm.nih.gov/pubmed/33431007 http://dx.doi.org/10.1186/s13321-020-00422-x |
_version_ | 1783522446509342720 |
---|---|
author | Morger, Andrea Mathea, Miriam Achenbach, Janosch H. Wolf, Antje Buesen, Roland Schleifer, Klaus-Juergen Landsiedel, Robert Volkamer, Andrea |
author_facet | Morger, Andrea Mathea, Miriam Achenbach, Janosch H. Wolf, Antje Buesen, Roland Schleifer, Klaus-Juergen Landsiedel, Robert Volkamer, Andrea |
author_sort | Morger, Andrea |
collection | PubMed |
description | Risk assessment of newly synthesised chemicals is a prerequisite for regulatory approval. In this context, in silico methods have great potential to reduce time, cost, and ultimately animal testing as they make use of the ever-growing amount of available toxicity data. Here, KnowTox is presented, a novel pipeline that combines three different in silico toxicology approaches to allow for confident prediction of potentially toxic effects of query compounds, i.e. machine learning models for 88 endpoints, alerts for 919 toxic substructures, and computational support for read-across. It is mainly based on the ToxCast dataset, containing after preprocessing a sparse matrix of 7912 compounds tested against 985 endpoints. When applying machine learning models, applicability and reliability of predictions for new chemicals are of utmost importance. Therefore, first, the conformal prediction technique was deployed, comprising an additional calibration step and per definition creating internally valid predictors at a given significance level. Second, to further improve validity and information efficiency, two adaptations are suggested, exemplified at the androgen receptor antagonism endpoint. An absolute increase in validity of 23% on the in-house dataset of 534 compounds could be achieved by introducing KNNRegressor normalisation. This increase in validity comes at the cost of efficiency, which could again be improved by 20% for the initial ToxCast model by balancing the dataset during model training. Finally, the value of the developed pipeline for risk assessment is discussed using two in-house triazole molecules. Compared to a single toxicity prediction method, complementing the outputs of different approaches can have a higher impact on guiding toxicity testing and de-selecting most likely harmful development-candidate compounds early in the development process. |
format | Online Article Text |
id | pubmed-7157991 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-71579912020-04-20 KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development Morger, Andrea Mathea, Miriam Achenbach, Janosch H. Wolf, Antje Buesen, Roland Schleifer, Klaus-Juergen Landsiedel, Robert Volkamer, Andrea J Cheminform Research Article Risk assessment of newly synthesised chemicals is a prerequisite for regulatory approval. In this context, in silico methods have great potential to reduce time, cost, and ultimately animal testing as they make use of the ever-growing amount of available toxicity data. Here, KnowTox is presented, a novel pipeline that combines three different in silico toxicology approaches to allow for confident prediction of potentially toxic effects of query compounds, i.e. machine learning models for 88 endpoints, alerts for 919 toxic substructures, and computational support for read-across. It is mainly based on the ToxCast dataset, containing after preprocessing a sparse matrix of 7912 compounds tested against 985 endpoints. When applying machine learning models, applicability and reliability of predictions for new chemicals are of utmost importance. Therefore, first, the conformal prediction technique was deployed, comprising an additional calibration step and per definition creating internally valid predictors at a given significance level. Second, to further improve validity and information efficiency, two adaptations are suggested, exemplified at the androgen receptor antagonism endpoint. An absolute increase in validity of 23% on the in-house dataset of 534 compounds could be achieved by introducing KNNRegressor normalisation. This increase in validity comes at the cost of efficiency, which could again be improved by 20% for the initial ToxCast model by balancing the dataset during model training. Finally, the value of the developed pipeline for risk assessment is discussed using two in-house triazole molecules. Compared to a single toxicity prediction method, complementing the outputs of different approaches can have a higher impact on guiding toxicity testing and de-selecting most likely harmful development-candidate compounds early in the development process. Springer International Publishing 2020-04-14 /pmc/articles/PMC7157991/ /pubmed/33431007 http://dx.doi.org/10.1186/s13321-020-00422-x Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Morger, Andrea Mathea, Miriam Achenbach, Janosch H. Wolf, Antje Buesen, Roland Schleifer, Klaus-Juergen Landsiedel, Robert Volkamer, Andrea KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development |
title | KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development |
title_full | KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development |
title_fullStr | KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development |
title_full_unstemmed | KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development |
title_short | KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development |
title_sort | knowtox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7157991/ https://www.ncbi.nlm.nih.gov/pubmed/33431007 http://dx.doi.org/10.1186/s13321-020-00422-x |
work_keys_str_mv | AT morgerandrea knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment AT matheamiriam knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment AT achenbachjanoschh knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment AT wolfantje knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment AT buesenroland knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment AT schleiferklausjuergen knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment AT landsiedelrobert knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment AT volkamerandrea knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment |