Cargando…

Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations

Explainable machine learning for molecular toxicity prediction is a promising approach for efficient drug development and chemical safety. A predictive ML model of toxicity can reduce experimental cost and time while mitigating ethical concerns by significantly reducing animal and clinical testing....

Descripción completa

Detalles Bibliográficos
Autores principales:	Sharma, Bhanushee, Chenthamarakshan, Vijil, Dhurandhar, Amit, Pereira, Shiranee, Hendler, James A., Dordick, Jonathan S., Das, Payel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10039880/ https://www.ncbi.nlm.nih.gov/pubmed/36966203 http://dx.doi.org/10.1038/s41598-023-31169-8

_version_	1784912363141464064
author	Sharma, Bhanushee Chenthamarakshan, Vijil Dhurandhar, Amit Pereira, Shiranee Hendler, James A. Dordick, Jonathan S. Das, Payel
author_facet	Sharma, Bhanushee Chenthamarakshan, Vijil Dhurandhar, Amit Pereira, Shiranee Hendler, James A. Dordick, Jonathan S. Das, Payel
author_sort	Sharma, Bhanushee
collection	PubMed
description	Explainable machine learning for molecular toxicity prediction is a promising approach for efficient drug development and chemical safety. A predictive ML model of toxicity can reduce experimental cost and time while mitigating ethical concerns by significantly reducing animal and clinical testing. Herein, we use a deep learning framework for simultaneously modeling in vitro, in vivo, and clinical toxicity data. Two different molecular input representations are used; Morgan fingerprints and pre-trained SMILES embeddings. A multi-task deep learning model accurately predicts toxicity for all endpoints, including clinical, as indicated by the area under the Receiver Operator Characteristic curve and balanced accuracy. In particular, pre-trained molecular SMILES embeddings as input to the multi-task model improved clinical toxicity predictions compared to existing models in MoleculeNet benchmark. Additionally, our multitask approach is comprehensive in the sense that it is comparable to state-of-the-art approaches for specific endpoints in in vitro, in vivo and clinical platforms. Through both the multi-task model and transfer learning, we were able to indicate the minimal need of in vivo data for clinical toxicity predictions. To provide confidence and explain the model’s predictions, we adapt a post-hoc contrastive explanation method that returns pertinent positive and negative features, which correspond well to known mutagenic and reactive toxicophores, such as unsubstituted bonded heteroatoms, aromatic amines, and Michael receptors. Furthermore, toxicophore recovery by pertinent feature analysis captures more of the in vitro (53%) and in vivo (56%), rather than of the clinical (8%), endpoints, and indeed uncovers a preference in known toxicophore data towards in vitro and in vivo experimental data. To our knowledge, this is the first contrastive explanation, using both present and absent substructures, for predictions of clinical and in vivo molecular toxicity.
format	Online Article Text
id	pubmed-10039880
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-100398802023-03-27 Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations Sharma, Bhanushee Chenthamarakshan, Vijil Dhurandhar, Amit Pereira, Shiranee Hendler, James A. Dordick, Jonathan S. Das, Payel Sci Rep Article Explainable machine learning for molecular toxicity prediction is a promising approach for efficient drug development and chemical safety. A predictive ML model of toxicity can reduce experimental cost and time while mitigating ethical concerns by significantly reducing animal and clinical testing. Herein, we use a deep learning framework for simultaneously modeling in vitro, in vivo, and clinical toxicity data. Two different molecular input representations are used; Morgan fingerprints and pre-trained SMILES embeddings. A multi-task deep learning model accurately predicts toxicity for all endpoints, including clinical, as indicated by the area under the Receiver Operator Characteristic curve and balanced accuracy. In particular, pre-trained molecular SMILES embeddings as input to the multi-task model improved clinical toxicity predictions compared to existing models in MoleculeNet benchmark. Additionally, our multitask approach is comprehensive in the sense that it is comparable to state-of-the-art approaches for specific endpoints in in vitro, in vivo and clinical platforms. Through both the multi-task model and transfer learning, we were able to indicate the minimal need of in vivo data for clinical toxicity predictions. To provide confidence and explain the model’s predictions, we adapt a post-hoc contrastive explanation method that returns pertinent positive and negative features, which correspond well to known mutagenic and reactive toxicophores, such as unsubstituted bonded heteroatoms, aromatic amines, and Michael receptors. Furthermore, toxicophore recovery by pertinent feature analysis captures more of the in vitro (53%) and in vivo (56%), rather than of the clinical (8%), endpoints, and indeed uncovers a preference in known toxicophore data towards in vitro and in vivo experimental data. To our knowledge, this is the first contrastive explanation, using both present and absent substructures, for predictions of clinical and in vivo molecular toxicity. Nature Publishing Group UK 2023-03-25 /pmc/articles/PMC10039880/ /pubmed/36966203 http://dx.doi.org/10.1038/s41598-023-31169-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Sharma, Bhanushee Chenthamarakshan, Vijil Dhurandhar, Amit Pereira, Shiranee Hendler, James A. Dordick, Jonathan S. Das, Payel Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations
title	Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations
title_full	Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations
title_fullStr	Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations
title_full_unstemmed	Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations
title_short	Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations
title_sort	accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10039880/ https://www.ncbi.nlm.nih.gov/pubmed/36966203 http://dx.doi.org/10.1038/s41598-023-31169-8
work_keys_str_mv	AT sharmabhanushee accurateclinicaltoxicitypredictionusingmultitaskdeepneuralnetsandcontrastivemolecularexplanations AT chenthamarakshanvijil accurateclinicaltoxicitypredictionusingmultitaskdeepneuralnetsandcontrastivemolecularexplanations AT dhurandharamit accurateclinicaltoxicitypredictionusingmultitaskdeepneuralnetsandcontrastivemolecularexplanations AT pereirashiranee accurateclinicaltoxicitypredictionusingmultitaskdeepneuralnetsandcontrastivemolecularexplanations AT hendlerjamesa accurateclinicaltoxicitypredictionusingmultitaskdeepneuralnetsandcontrastivemolecularexplanations AT dordickjonathans accurateclinicaltoxicitypredictionusingmultitaskdeepneuralnetsandcontrastivemolecularexplanations AT daspayel accurateclinicaltoxicitypredictionusingmultitaskdeepneuralnetsandcontrastivemolecularexplanations

Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations

Ejemplares similares