Cargando…
Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
[Image: see text] Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2021
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8515573/ https://www.ncbi.nlm.nih.gov/pubmed/34661009 http://dx.doi.org/10.1021/acsomega.1c03842 |
_version_ | 1784583637897838592 |
---|---|
author | Wang, Yiwei Wang, Binyou Jiang, Jie Guo, Jianmin Lai, Jia Lian, Xiao-Yuan Wu, Jianming |
author_facet | Wang, Yiwei Wang, Binyou Jiang, Jie Guo, Jianmin Lai, Jia Lian, Xiao-Yuan Wu, Jianming |
author_sort | Wang, Yiwei |
collection | PubMed |
description | [Image: see text] Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem: the training data sets are usually biased (imbalanced positive and negative samples), which would result in model training difficulty and unsatisfactory prediction accuracy. Multitask networks obtained significantly better predictive accuracies than single-task methods, and capsule neural networks showed excellent performance in sparse data sets in previous studies. In this study, we developed a new multitask framework based on a capsule neural network (multitask CapsNet) to measure 12 different toxic effects simultaneously. We found that multitask CapsNet excelled in toxicity prediction and outperformed many other computational approaches using the multitask strategy. Only after training on biased data sets did multitask CapsNet achieve significantly improved prediction accuracy on the Tox21 Data Challenge, which gave the largest ratio of highest accuracy (8/12) among compared models. Our model gave a prediction accuracy of 96.6% for the target NR.PPAR.gamma, whose ratio of negative to positive samples was up to 36:1. These results suggested that multitask CapsNet could overcome the bias problems and would provide a novel, accurate, and efficient approach for predicting the toxicities of compounds. |
format | Online Article Text |
id | pubmed-8515573 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-85155732021-10-15 Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants Wang, Yiwei Wang, Binyou Jiang, Jie Guo, Jianmin Lai, Jia Lian, Xiao-Yuan Wu, Jianming ACS Omega [Image: see text] Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem: the training data sets are usually biased (imbalanced positive and negative samples), which would result in model training difficulty and unsatisfactory prediction accuracy. Multitask networks obtained significantly better predictive accuracies than single-task methods, and capsule neural networks showed excellent performance in sparse data sets in previous studies. In this study, we developed a new multitask framework based on a capsule neural network (multitask CapsNet) to measure 12 different toxic effects simultaneously. We found that multitask CapsNet excelled in toxicity prediction and outperformed many other computational approaches using the multitask strategy. Only after training on biased data sets did multitask CapsNet achieve significantly improved prediction accuracy on the Tox21 Data Challenge, which gave the largest ratio of highest accuracy (8/12) among compared models. Our model gave a prediction accuracy of 96.6% for the target NR.PPAR.gamma, whose ratio of negative to positive samples was up to 36:1. These results suggested that multitask CapsNet could overcome the bias problems and would provide a novel, accurate, and efficient approach for predicting the toxicities of compounds. American Chemical Society 2021-09-29 /pmc/articles/PMC8515573/ /pubmed/34661009 http://dx.doi.org/10.1021/acsomega.1c03842 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Wang, Yiwei Wang, Binyou Jiang, Jie Guo, Jianmin Lai, Jia Lian, Xiao-Yuan Wu, Jianming Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants |
title | Multitask CapsNet: An Imbalanced Data Deep Learning
Method for Predicting Toxicants |
title_full | Multitask CapsNet: An Imbalanced Data Deep Learning
Method for Predicting Toxicants |
title_fullStr | Multitask CapsNet: An Imbalanced Data Deep Learning
Method for Predicting Toxicants |
title_full_unstemmed | Multitask CapsNet: An Imbalanced Data Deep Learning
Method for Predicting Toxicants |
title_short | Multitask CapsNet: An Imbalanced Data Deep Learning
Method for Predicting Toxicants |
title_sort | multitask capsnet: an imbalanced data deep learning
method for predicting toxicants |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8515573/ https://www.ncbi.nlm.nih.gov/pubmed/34661009 http://dx.doi.org/10.1021/acsomega.1c03842 |
work_keys_str_mv | AT wangyiwei multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT wangbinyou multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT jiangjie multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT guojianmin multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT laijia multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT lianxiaoyuan multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT wujianming multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants |