Cargando…

Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants

[Image: see text] Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yiwei, Wang, Binyou, Jiang, Jie, Guo, Jianmin, Lai, Jia, Lian, Xiao-Yuan, Wu, Jianming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8515573/
https://www.ncbi.nlm.nih.gov/pubmed/34661009
http://dx.doi.org/10.1021/acsomega.1c03842
_version_ 1784583637897838592
author Wang, Yiwei
Wang, Binyou
Jiang, Jie
Guo, Jianmin
Lai, Jia
Lian, Xiao-Yuan
Wu, Jianming
author_facet Wang, Yiwei
Wang, Binyou
Jiang, Jie
Guo, Jianmin
Lai, Jia
Lian, Xiao-Yuan
Wu, Jianming
author_sort Wang, Yiwei
collection PubMed
description [Image: see text] Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem: the training data sets are usually biased (imbalanced positive and negative samples), which would result in model training difficulty and unsatisfactory prediction accuracy. Multitask networks obtained significantly better predictive accuracies than single-task methods, and capsule neural networks showed excellent performance in sparse data sets in previous studies. In this study, we developed a new multitask framework based on a capsule neural network (multitask CapsNet) to measure 12 different toxic effects simultaneously. We found that multitask CapsNet excelled in toxicity prediction and outperformed many other computational approaches using the multitask strategy. Only after training on biased data sets did multitask CapsNet achieve significantly improved prediction accuracy on the Tox21 Data Challenge, which gave the largest ratio of highest accuracy (8/12) among compared models. Our model gave a prediction accuracy of 96.6% for the target NR.PPAR.gamma, whose ratio of negative to positive samples was up to 36:1. These results suggested that multitask CapsNet could overcome the bias problems and would provide a novel, accurate, and efficient approach for predicting the toxicities of compounds.
format Online
Article
Text
id pubmed-8515573
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-85155732021-10-15 Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants Wang, Yiwei Wang, Binyou Jiang, Jie Guo, Jianmin Lai, Jia Lian, Xiao-Yuan Wu, Jianming ACS Omega [Image: see text] Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem: the training data sets are usually biased (imbalanced positive and negative samples), which would result in model training difficulty and unsatisfactory prediction accuracy. Multitask networks obtained significantly better predictive accuracies than single-task methods, and capsule neural networks showed excellent performance in sparse data sets in previous studies. In this study, we developed a new multitask framework based on a capsule neural network (multitask CapsNet) to measure 12 different toxic effects simultaneously. We found that multitask CapsNet excelled in toxicity prediction and outperformed many other computational approaches using the multitask strategy. Only after training on biased data sets did multitask CapsNet achieve significantly improved prediction accuracy on the Tox21 Data Challenge, which gave the largest ratio of highest accuracy (8/12) among compared models. Our model gave a prediction accuracy of 96.6% for the target NR.PPAR.gamma, whose ratio of negative to positive samples was up to 36:1. These results suggested that multitask CapsNet could overcome the bias problems and would provide a novel, accurate, and efficient approach for predicting the toxicities of compounds. American Chemical Society 2021-09-29 /pmc/articles/PMC8515573/ /pubmed/34661009 http://dx.doi.org/10.1021/acsomega.1c03842 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Wang, Yiwei
Wang, Binyou
Jiang, Jie
Guo, Jianmin
Lai, Jia
Lian, Xiao-Yuan
Wu, Jianming
Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_full Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_fullStr Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_full_unstemmed Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_short Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_sort multitask capsnet: an imbalanced data deep learning method for predicting toxicants
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8515573/
https://www.ncbi.nlm.nih.gov/pubmed/34661009
http://dx.doi.org/10.1021/acsomega.1c03842
work_keys_str_mv AT wangyiwei multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants
AT wangbinyou multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants
AT jiangjie multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants
AT guojianmin multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants
AT laijia multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants
AT lianxiaoyuan multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants
AT wujianming multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants