Cargando…

ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities

[Image: see text] Computational methods such as machine learning approaches have a strong track record of success in predicting the outcomes of in vitro assays. In contrast, their ability to predict in vivo endpoints is more limited due to the high number of parameters and processes that may influen...

Descripción completa

Detalles Bibliográficos
Autores principales: Garcia de Lomana, Marina, Morger, Andrea, Norinder, Ulf, Buesen, Roland, Landsiedel, Robert, Volkamer, Andrea, Kirchmair, Johannes, Mathea, Miriam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8317154/
https://www.ncbi.nlm.nih.gov/pubmed/34153183
http://dx.doi.org/10.1021/acs.jcim.1c00451
_version_ 1783730015699992576
author Garcia de Lomana, Marina
Morger, Andrea
Norinder, Ulf
Buesen, Roland
Landsiedel, Robert
Volkamer, Andrea
Kirchmair, Johannes
Mathea, Miriam
author_facet Garcia de Lomana, Marina
Morger, Andrea
Norinder, Ulf
Buesen, Roland
Landsiedel, Robert
Volkamer, Andrea
Kirchmair, Johannes
Mathea, Miriam
author_sort Garcia de Lomana, Marina
collection PubMed
description [Image: see text] Computational methods such as machine learning approaches have a strong track record of success in predicting the outcomes of in vitro assays. In contrast, their ability to predict in vivo endpoints is more limited due to the high number of parameters and processes that may influence the outcome. Recent studies have shown that the combination of chemical and biological data can yield better models for in vivo endpoints. The ChemBioSim approach presented in this work aims to enhance the performance of conformal prediction models for in vivo endpoints by combining chemical information with (predicted) bioactivity assay outcomes. Three in vivo toxicological endpoints, capturing genotoxic (MNT), hepatic (DILI), and cardiological (DICC) issues, were selected for this study due to their high relevance for the registration and authorization of new compounds. Since the sparsity of available biological assay data is challenging for predictive modeling, predicted bioactivity descriptors were introduced instead. Thus, a machine learning model for each of the 373 collected biological assays was trained and applied on the compounds of the in vivo toxicity data sets. Besides the chemical descriptors (molecular fingerprints and physicochemical properties), these predicted bioactivities served as descriptors for the models of the three in vivo endpoints. For this study, a workflow based on a conformal prediction framework (a method for confidence estimation) built on random forest models was developed. Furthermore, the most relevant chemical and bioactivity descriptors for each in vivo endpoint were preselected with lasso models. The incorporation of bioactivity descriptors increased the mean F1 scores of the MNT model from 0.61 to 0.70 and for the DICC model from 0.72 to 0.82 while the mean efficiencies increased by roughly 0.10 for both endpoints. In contrast, for the DILI endpoint, no significant improvement in model performance was observed. Besides pure performance improvements, an analysis of the most important bioactivity features allowed detection of novel and less intuitive relationships between the predicted biological assay outcomes used as descriptors and the in vivo endpoints. This study presents how the prediction of in vivo toxicity endpoints can be improved by the incorporation of biological information—which is not necessarily captured by chemical descriptors—in an automated workflow without the need for adding experimental workload for the generation of bioactivity descriptors as predicted outcomes of bioactivity assays were utilized. All bioactivity CP models for deriving the predicted bioactivities, as well as the in vivo toxicity CP models, can be freely downloaded from https://doi.org/10.5281/zenodo.4761225.
format Online
Article
Text
id pubmed-8317154
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-83171542021-07-28 ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities Garcia de Lomana, Marina Morger, Andrea Norinder, Ulf Buesen, Roland Landsiedel, Robert Volkamer, Andrea Kirchmair, Johannes Mathea, Miriam J Chem Inf Model [Image: see text] Computational methods such as machine learning approaches have a strong track record of success in predicting the outcomes of in vitro assays. In contrast, their ability to predict in vivo endpoints is more limited due to the high number of parameters and processes that may influence the outcome. Recent studies have shown that the combination of chemical and biological data can yield better models for in vivo endpoints. The ChemBioSim approach presented in this work aims to enhance the performance of conformal prediction models for in vivo endpoints by combining chemical information with (predicted) bioactivity assay outcomes. Three in vivo toxicological endpoints, capturing genotoxic (MNT), hepatic (DILI), and cardiological (DICC) issues, were selected for this study due to their high relevance for the registration and authorization of new compounds. Since the sparsity of available biological assay data is challenging for predictive modeling, predicted bioactivity descriptors were introduced instead. Thus, a machine learning model for each of the 373 collected biological assays was trained and applied on the compounds of the in vivo toxicity data sets. Besides the chemical descriptors (molecular fingerprints and physicochemical properties), these predicted bioactivities served as descriptors for the models of the three in vivo endpoints. For this study, a workflow based on a conformal prediction framework (a method for confidence estimation) built on random forest models was developed. Furthermore, the most relevant chemical and bioactivity descriptors for each in vivo endpoint were preselected with lasso models. The incorporation of bioactivity descriptors increased the mean F1 scores of the MNT model from 0.61 to 0.70 and for the DICC model from 0.72 to 0.82 while the mean efficiencies increased by roughly 0.10 for both endpoints. In contrast, for the DILI endpoint, no significant improvement in model performance was observed. Besides pure performance improvements, an analysis of the most important bioactivity features allowed detection of novel and less intuitive relationships between the predicted biological assay outcomes used as descriptors and the in vivo endpoints. This study presents how the prediction of in vivo toxicity endpoints can be improved by the incorporation of biological information—which is not necessarily captured by chemical descriptors—in an automated workflow without the need for adding experimental workload for the generation of bioactivity descriptors as predicted outcomes of bioactivity assays were utilized. All bioactivity CP models for deriving the predicted bioactivities, as well as the in vivo toxicity CP models, can be freely downloaded from https://doi.org/10.5281/zenodo.4761225. American Chemical Society 2021-06-21 2021-07-26 /pmc/articles/PMC8317154/ /pubmed/34153183 http://dx.doi.org/10.1021/acs.jcim.1c00451 Text en © 2021 The Authors. Published by American Chemical Society Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Garcia de Lomana, Marina
Morger, Andrea
Norinder, Ulf
Buesen, Roland
Landsiedel, Robert
Volkamer, Andrea
Kirchmair, Johannes
Mathea, Miriam
ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities
title ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities
title_full ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities
title_fullStr ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities
title_full_unstemmed ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities
title_short ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities
title_sort chembiosim: enhancing conformal prediction of in vivo toxicity by use of predicted bioactivities
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8317154/
https://www.ncbi.nlm.nih.gov/pubmed/34153183
http://dx.doi.org/10.1021/acs.jcim.1c00451
work_keys_str_mv AT garciadelomanamarina chembiosimenhancingconformalpredictionofinvivotoxicitybyuseofpredictedbioactivities
AT morgerandrea chembiosimenhancingconformalpredictionofinvivotoxicitybyuseofpredictedbioactivities
AT norinderulf chembiosimenhancingconformalpredictionofinvivotoxicitybyuseofpredictedbioactivities
AT buesenroland chembiosimenhancingconformalpredictionofinvivotoxicitybyuseofpredictedbioactivities
AT landsiedelrobert chembiosimenhancingconformalpredictionofinvivotoxicitybyuseofpredictedbioactivities
AT volkamerandrea chembiosimenhancingconformalpredictionofinvivotoxicitybyuseofpredictedbioactivities
AT kirchmairjohannes chembiosimenhancingconformalpredictionofinvivotoxicitybyuseofpredictedbioactivities
AT matheamiriam chembiosimenhancingconformalpredictionofinvivotoxicitybyuseofpredictedbioactivities