Cargando…

Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants

[Image: see text] Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Yiwei, Wang, Binyou, Jiang, Jie, Guo, Jianmin, Lai, Jia, Lian, Xiao-Yuan, Wu, Jianming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Chemical Society 2021
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8515573/ https://www.ncbi.nlm.nih.gov/pubmed/34661009 http://dx.doi.org/10.1021/acsomega.1c03842

_version_	1784583637897838592
author	Wang, Yiwei Wang, Binyou Jiang, Jie Guo, Jianmin Lai, Jia Lian, Xiao-Yuan Wu, Jianming
author_facet	Wang, Yiwei Wang, Binyou Jiang, Jie Guo, Jianmin Lai, Jia Lian, Xiao-Yuan Wu, Jianming
author_sort	Wang, Yiwei
collection	PubMed
description	[Image: see text] Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem: the training data sets are usually biased (imbalanced positive and negative samples), which would result in model training difficulty and unsatisfactory prediction accuracy. Multitask networks obtained significantly better predictive accuracies than single-task methods, and capsule neural networks showed excellent performance in sparse data sets in previous studies. In this study, we developed a new multitask framework based on a capsule neural network (multitask CapsNet) to measure 12 different toxic effects simultaneously. We found that multitask CapsNet excelled in toxicity prediction and outperformed many other computational approaches using the multitask strategy. Only after training on biased data sets did multitask CapsNet achieve significantly improved prediction accuracy on the Tox21 Data Challenge, which gave the largest ratio of highest accuracy (8/12) among compared models. Our model gave a prediction accuracy of 96.6% for the target NR.PPAR.gamma, whose ratio of negative to positive samples was up to 36:1. These results suggested that multitask CapsNet could overcome the bias problems and would provide a novel, accurate, and efficient approach for predicting the toxicities of compounds.
format	Online Article Text
id	pubmed-8515573
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	American Chemical Society
record_format	MEDLINE/PubMed
spelling	pubmed-85155732021-10-15 Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants Wang, Yiwei Wang, Binyou Jiang, Jie Guo, Jianmin Lai, Jia Lian, Xiao-Yuan Wu, Jianming ACS Omega [Image: see text] Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem: the training data sets are usually biased (imbalanced positive and negative samples), which would result in model training difficulty and unsatisfactory prediction accuracy. Multitask networks obtained significantly better predictive accuracies than single-task methods, and capsule neural networks showed excellent performance in sparse data sets in previous studies. In this study, we developed a new multitask framework based on a capsule neural network (multitask CapsNet) to measure 12 different toxic effects simultaneously. We found that multitask CapsNet excelled in toxicity prediction and outperformed many other computational approaches using the multitask strategy. Only after training on biased data sets did multitask CapsNet achieve significantly improved prediction accuracy on the Tox21 Data Challenge, which gave the largest ratio of highest accuracy (8/12) among compared models. Our model gave a prediction accuracy of 96.6% for the target NR.PPAR.gamma, whose ratio of negative to positive samples was up to 36:1. These results suggested that multitask CapsNet could overcome the bias problems and would provide a novel, accurate, and efficient approach for predicting the toxicities of compounds. American Chemical Society 2021-09-29 /pmc/articles/PMC8515573/ /pubmed/34661009 http://dx.doi.org/10.1021/acsomega.1c03842 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle	Wang, Yiwei Wang, Binyou Jiang, Jie Guo, Jianmin Lai, Jia Lian, Xiao-Yuan Wu, Jianming Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title	Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_full	Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_fullStr	Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_full_unstemmed	Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_short	Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants
title_sort	multitask capsnet: an imbalanced data deep learning method for predicting toxicants
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8515573/ https://www.ncbi.nlm.nih.gov/pubmed/34661009 http://dx.doi.org/10.1021/acsomega.1c03842
work_keys_str_mv	AT wangyiwei multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT wangbinyou multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT jiangjie multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT guojianmin multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT laijia multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT lianxiaoyuan multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants AT wujianming multitaskcapsnetanimbalanceddatadeeplearningmethodforpredictingtoxicants

Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants

Ejemplares similares