Cargando…

Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data

Prediction of compound toxicity is essential because covering the vast chemical space requiring safety assessment using traditional experimentally-based, resource-intensive techniques is impossible. However, such prediction is nontrivial due to the complex causal relationship between compound struct...

Descripción completa

Detalles Bibliográficos
Autores principales: Allen, Chad H. G., Koutsoukas, Alexios, Cortés-Ciriano, Isidro, Murrell, Daniel S., Malliavin, Thérèse E., Glen, Robert C., Bender, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Royal Society of Chemistry 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6062365/
https://www.ncbi.nlm.nih.gov/pubmed/30090397
http://dx.doi.org/10.1039/c5tx00406c
_version_ 1783342372100243456
author Allen, Chad H. G.
Koutsoukas, Alexios
Cortés-Ciriano, Isidro
Murrell, Daniel S.
Malliavin, Thérèse E.
Glen, Robert C.
Bender, Andreas
author_facet Allen, Chad H. G.
Koutsoukas, Alexios
Cortés-Ciriano, Isidro
Murrell, Daniel S.
Malliavin, Thérèse E.
Glen, Robert C.
Bender, Andreas
author_sort Allen, Chad H. G.
collection PubMed
description Prediction of compound toxicity is essential because covering the vast chemical space requiring safety assessment using traditional experimentally-based, resource-intensive techniques is impossible. However, such prediction is nontrivial due to the complex causal relationship between compound structure and in vivo harm. Protein target annotations and in vitro experimental outcomes encode relevant bioactivity information complementary to chemicals’ structures. This work tests the hypothesis that utilizing three complementary types of data will afford predictive models that outperform traditional models built using fewer data types. A tripartite, heterogeneous descriptor set for 367 compounds was comprised of (a) chemical descriptors, (b) protein target descriptors generated using an algorithm trained on 190 000 ligand–protein interactions from ChEMBL, and (c) descriptors derived from in vitro cell cytotoxicity dose–response data from a panel of human cell lines. 100 random forests classification models for predicting rat LD(50) were built using every combination of descriptors. Successive integration of data types improved predictive performance; models built using the full dataset had an average external correct classification rate of 0.82, compared to 0.73–0.80 for models built using two data types and 0.67–0.78 for models built using one. Pairwise comparisons of models trained on the same data showed that including a third data domain on top of chemistry improved average correct classification rate by 1.4–2.4 points, with p-values <0.01. Additionally, the approach enhanced the models’ applicability domains and proved useful for generating novel mechanism hypotheses. The use of tripartite heterogeneous bioactivity datasets is a useful technique for improving toxicity prediction. Both protein target descriptors – which have the practical value of being derived in silico – and cytotoxicity descriptors derived from experiment are suitable contributors to such datasets.
format Online
Article
Text
id pubmed-6062365
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-60623652018-08-08 Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data Allen, Chad H. G. Koutsoukas, Alexios Cortés-Ciriano, Isidro Murrell, Daniel S. Malliavin, Thérèse E. Glen, Robert C. Bender, Andreas Toxicol Res (Camb) Chemistry Prediction of compound toxicity is essential because covering the vast chemical space requiring safety assessment using traditional experimentally-based, resource-intensive techniques is impossible. However, such prediction is nontrivial due to the complex causal relationship between compound structure and in vivo harm. Protein target annotations and in vitro experimental outcomes encode relevant bioactivity information complementary to chemicals’ structures. This work tests the hypothesis that utilizing three complementary types of data will afford predictive models that outperform traditional models built using fewer data types. A tripartite, heterogeneous descriptor set for 367 compounds was comprised of (a) chemical descriptors, (b) protein target descriptors generated using an algorithm trained on 190 000 ligand–protein interactions from ChEMBL, and (c) descriptors derived from in vitro cell cytotoxicity dose–response data from a panel of human cell lines. 100 random forests classification models for predicting rat LD(50) were built using every combination of descriptors. Successive integration of data types improved predictive performance; models built using the full dataset had an average external correct classification rate of 0.82, compared to 0.73–0.80 for models built using two data types and 0.67–0.78 for models built using one. Pairwise comparisons of models trained on the same data showed that including a third data domain on top of chemistry improved average correct classification rate by 1.4–2.4 points, with p-values <0.01. Additionally, the approach enhanced the models’ applicability domains and proved useful for generating novel mechanism hypotheses. The use of tripartite heterogeneous bioactivity datasets is a useful technique for improving toxicity prediction. Both protein target descriptors – which have the practical value of being derived in silico – and cytotoxicity descriptors derived from experiment are suitable contributors to such datasets. Royal Society of Chemistry 2016-03-03 /pmc/articles/PMC6062365/ /pubmed/30090397 http://dx.doi.org/10.1039/c5tx00406c Text en This journal is © The Royal Society of Chemistry 2016 http://creativecommons.org/licenses/by/3.0/ This article is freely available. This article is licensed under a Creative Commons Attribution 3.0 Unported Licence (CC BY 3.0)
spellingShingle Chemistry
Allen, Chad H. G.
Koutsoukas, Alexios
Cortés-Ciriano, Isidro
Murrell, Daniel S.
Malliavin, Thérèse E.
Glen, Robert C.
Bender, Andreas
Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data
title Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data
title_full Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data
title_fullStr Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data
title_full_unstemmed Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data
title_short Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data
title_sort improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qhts data
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6062365/
https://www.ncbi.nlm.nih.gov/pubmed/30090397
http://dx.doi.org/10.1039/c5tx00406c
work_keys_str_mv AT allenchadhg improvingthepredictionoforganismleveltoxicitythroughintegrationofchemicalproteintargetandcytotoxicityqhtsdata
AT koutsoukasalexios improvingthepredictionoforganismleveltoxicitythroughintegrationofchemicalproteintargetandcytotoxicityqhtsdata
AT cortescirianoisidro improvingthepredictionoforganismleveltoxicitythroughintegrationofchemicalproteintargetandcytotoxicityqhtsdata
AT murrelldaniels improvingthepredictionoforganismleveltoxicitythroughintegrationofchemicalproteintargetandcytotoxicityqhtsdata
AT malliavintheresee improvingthepredictionoforganismleveltoxicitythroughintegrationofchemicalproteintargetandcytotoxicityqhtsdata
AT glenrobertc improvingthepredictionoforganismleveltoxicitythroughintegrationofchemicalproteintargetandcytotoxicityqhtsdata
AT benderandreas improvingthepredictionoforganismleveltoxicitythroughintegrationofchemicalproteintargetandcytotoxicityqhtsdata