Cargando…

KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development

Risk assessment of newly synthesised chemicals is a prerequisite for regulatory approval. In this context, in silico methods have great potential to reduce time, cost, and ultimately animal testing as they make use of the ever-growing amount of available toxicity data. Here, KnowTox is presented, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Morger, Andrea, Mathea, Miriam, Achenbach, Janosch H., Wolf, Antje, Buesen, Roland, Schleifer, Klaus-Juergen, Landsiedel, Robert, Volkamer, Andrea
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7157991/
https://www.ncbi.nlm.nih.gov/pubmed/33431007
http://dx.doi.org/10.1186/s13321-020-00422-x
_version_ 1783522446509342720
author Morger, Andrea
Mathea, Miriam
Achenbach, Janosch H.
Wolf, Antje
Buesen, Roland
Schleifer, Klaus-Juergen
Landsiedel, Robert
Volkamer, Andrea
author_facet Morger, Andrea
Mathea, Miriam
Achenbach, Janosch H.
Wolf, Antje
Buesen, Roland
Schleifer, Klaus-Juergen
Landsiedel, Robert
Volkamer, Andrea
author_sort Morger, Andrea
collection PubMed
description Risk assessment of newly synthesised chemicals is a prerequisite for regulatory approval. In this context, in silico methods have great potential to reduce time, cost, and ultimately animal testing as they make use of the ever-growing amount of available toxicity data. Here, KnowTox is presented, a novel pipeline that combines three different in silico toxicology approaches to allow for confident prediction of potentially toxic effects of query compounds, i.e. machine learning models for 88 endpoints, alerts for 919 toxic substructures, and computational support for read-across. It is mainly based on the ToxCast dataset, containing after preprocessing a sparse matrix of 7912 compounds tested against 985 endpoints. When applying machine learning models, applicability and reliability of predictions for new chemicals are of utmost importance. Therefore, first, the conformal prediction technique was deployed, comprising an additional calibration step and per definition creating internally valid predictors at a given significance level. Second, to further improve validity and information efficiency, two adaptations are suggested, exemplified at the androgen receptor antagonism endpoint. An absolute increase in validity of 23% on the in-house dataset of 534 compounds could be achieved by introducing KNNRegressor normalisation. This increase in validity comes at the cost of efficiency, which could again be improved by 20% for the initial ToxCast model by balancing the dataset during model training. Finally, the value of the developed pipeline for risk assessment is discussed using two in-house triazole molecules. Compared to a single toxicity prediction method, complementing the outputs of different approaches can have a higher impact on guiding toxicity testing and de-selecting most likely harmful development-candidate compounds early in the development process.
format Online
Article
Text
id pubmed-7157991
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-71579912020-04-20 KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development Morger, Andrea Mathea, Miriam Achenbach, Janosch H. Wolf, Antje Buesen, Roland Schleifer, Klaus-Juergen Landsiedel, Robert Volkamer, Andrea J Cheminform Research Article Risk assessment of newly synthesised chemicals is a prerequisite for regulatory approval. In this context, in silico methods have great potential to reduce time, cost, and ultimately animal testing as they make use of the ever-growing amount of available toxicity data. Here, KnowTox is presented, a novel pipeline that combines three different in silico toxicology approaches to allow for confident prediction of potentially toxic effects of query compounds, i.e. machine learning models for 88 endpoints, alerts for 919 toxic substructures, and computational support for read-across. It is mainly based on the ToxCast dataset, containing after preprocessing a sparse matrix of 7912 compounds tested against 985 endpoints. When applying machine learning models, applicability and reliability of predictions for new chemicals are of utmost importance. Therefore, first, the conformal prediction technique was deployed, comprising an additional calibration step and per definition creating internally valid predictors at a given significance level. Second, to further improve validity and information efficiency, two adaptations are suggested, exemplified at the androgen receptor antagonism endpoint. An absolute increase in validity of 23% on the in-house dataset of 534 compounds could be achieved by introducing KNNRegressor normalisation. This increase in validity comes at the cost of efficiency, which could again be improved by 20% for the initial ToxCast model by balancing the dataset during model training. Finally, the value of the developed pipeline for risk assessment is discussed using two in-house triazole molecules. Compared to a single toxicity prediction method, complementing the outputs of different approaches can have a higher impact on guiding toxicity testing and de-selecting most likely harmful development-candidate compounds early in the development process. Springer International Publishing 2020-04-14 /pmc/articles/PMC7157991/ /pubmed/33431007 http://dx.doi.org/10.1186/s13321-020-00422-x Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Morger, Andrea
Mathea, Miriam
Achenbach, Janosch H.
Wolf, Antje
Buesen, Roland
Schleifer, Klaus-Juergen
Landsiedel, Robert
Volkamer, Andrea
KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development
title KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development
title_full KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development
title_fullStr KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development
title_full_unstemmed KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development
title_short KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development
title_sort knowtox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7157991/
https://www.ncbi.nlm.nih.gov/pubmed/33431007
http://dx.doi.org/10.1186/s13321-020-00422-x
work_keys_str_mv AT morgerandrea knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment
AT matheamiriam knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment
AT achenbachjanoschh knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment
AT wolfantje knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment
AT buesenroland knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment
AT schleiferklausjuergen knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment
AT landsiedelrobert knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment
AT volkamerandrea knowtoxpipelineandcasestudyforconfidentpredictionofpotentialtoxiceffectsofcompoundsinearlyphasesofdevelopment