Cargando…

A novel adaptive ensemble classification framework for ADME prediction

It has now become clear that in silico prediction of ADME (absorption, distribution, metabolism, and elimination) characteristics is an important component of the drug discovery process. Therefore, there has been considerable interest in the development of in silico modeling of ADME prediction in re...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Ming, Chen, Jialei, Xu, Liwen, Shi, Xiufeng, Zhou, Xin, Xi, Zhijun, An, Rui, Wang, Xinhong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9079056/
https://www.ncbi.nlm.nih.gov/pubmed/35542768
http://dx.doi.org/10.1039/c8ra01206g
_version_ 1784702477556252672
author Yang, Ming
Chen, Jialei
Xu, Liwen
Shi, Xiufeng
Zhou, Xin
Xi, Zhijun
An, Rui
Wang, Xinhong
author_facet Yang, Ming
Chen, Jialei
Xu, Liwen
Shi, Xiufeng
Zhou, Xin
Xi, Zhijun
An, Rui
Wang, Xinhong
author_sort Yang, Ming
collection PubMed
description It has now become clear that in silico prediction of ADME (absorption, distribution, metabolism, and elimination) characteristics is an important component of the drug discovery process. Therefore, there has been considerable interest in the development of in silico modeling of ADME prediction in recent years. Despite the advances in this field, there remains challenges when facing the unbalanced and high dimensionality problems simultaneously. In this work, we introduce a novel adaptive ensemble classification framework named as AECF to deal with the above issues. AECF includes four components which are (1) data balancing, (2) generating individual models, (3) combining individual models, and (4) optimizing the ensemble. We considered five sampling methods, seven base modeling techniques, and ten ensemble rules to build a choice pool. The proper route of constructing predictive models was determined automatically according to the imbalance ratio (IR). With the adaptive characteristics of AECF, it can be used to work on the different kinds of ADME data, and the balanced data is a special case in AECF. We evaluated the performance of our approach using five extensive ADME datasets concerning Caco-2 cell permeability (CacoP), human intestinal absorption (HIA), oral bioavailability (OB), and P-glycoprotein (P-gp) binders (substrates/inhibitors, PS/PI). The performance of AECF was evaluated on two independent datasets, and the average AUC values were 0.8574–0.8602, 0.8968–0.9182, 0.7821–0.7981, 0.8139–0.8311, and 0.8874–0.8898 for CacoP, HIA, OB, PS and PI, respectively. Our results show that AECF can provide better performance and generality compared with individual models and two representative ensemble methods bagging and boosting. Furthermore, the degree of complementarity among the AECF ensemble members was investigated for the purpose of elucidating the potential advantages of our framework. We found that AECF can effectively select complementary members to construct predictive models by our auto-adaptive optimization approach, and the additional diversity in both sample and feature space mainly contribute to the complementarity of ensemble members.
format Online
Article
Text
id pubmed-9079056
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-90790562022-05-09 A novel adaptive ensemble classification framework for ADME prediction Yang, Ming Chen, Jialei Xu, Liwen Shi, Xiufeng Zhou, Xin Xi, Zhijun An, Rui Wang, Xinhong RSC Adv Chemistry It has now become clear that in silico prediction of ADME (absorption, distribution, metabolism, and elimination) characteristics is an important component of the drug discovery process. Therefore, there has been considerable interest in the development of in silico modeling of ADME prediction in recent years. Despite the advances in this field, there remains challenges when facing the unbalanced and high dimensionality problems simultaneously. In this work, we introduce a novel adaptive ensemble classification framework named as AECF to deal with the above issues. AECF includes four components which are (1) data balancing, (2) generating individual models, (3) combining individual models, and (4) optimizing the ensemble. We considered five sampling methods, seven base modeling techniques, and ten ensemble rules to build a choice pool. The proper route of constructing predictive models was determined automatically according to the imbalance ratio (IR). With the adaptive characteristics of AECF, it can be used to work on the different kinds of ADME data, and the balanced data is a special case in AECF. We evaluated the performance of our approach using five extensive ADME datasets concerning Caco-2 cell permeability (CacoP), human intestinal absorption (HIA), oral bioavailability (OB), and P-glycoprotein (P-gp) binders (substrates/inhibitors, PS/PI). The performance of AECF was evaluated on two independent datasets, and the average AUC values were 0.8574–0.8602, 0.8968–0.9182, 0.7821–0.7981, 0.8139–0.8311, and 0.8874–0.8898 for CacoP, HIA, OB, PS and PI, respectively. Our results show that AECF can provide better performance and generality compared with individual models and two representative ensemble methods bagging and boosting. Furthermore, the degree of complementarity among the AECF ensemble members was investigated for the purpose of elucidating the potential advantages of our framework. We found that AECF can effectively select complementary members to construct predictive models by our auto-adaptive optimization approach, and the additional diversity in both sample and feature space mainly contribute to the complementarity of ensemble members. The Royal Society of Chemistry 2018-03-26 /pmc/articles/PMC9079056/ /pubmed/35542768 http://dx.doi.org/10.1039/c8ra01206g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Yang, Ming
Chen, Jialei
Xu, Liwen
Shi, Xiufeng
Zhou, Xin
Xi, Zhijun
An, Rui
Wang, Xinhong
A novel adaptive ensemble classification framework for ADME prediction
title A novel adaptive ensemble classification framework for ADME prediction
title_full A novel adaptive ensemble classification framework for ADME prediction
title_fullStr A novel adaptive ensemble classification framework for ADME prediction
title_full_unstemmed A novel adaptive ensemble classification framework for ADME prediction
title_short A novel adaptive ensemble classification framework for ADME prediction
title_sort novel adaptive ensemble classification framework for adme prediction
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9079056/
https://www.ncbi.nlm.nih.gov/pubmed/35542768
http://dx.doi.org/10.1039/c8ra01206g
work_keys_str_mv AT yangming anoveladaptiveensembleclassificationframeworkforadmeprediction
AT chenjialei anoveladaptiveensembleclassificationframeworkforadmeprediction
AT xuliwen anoveladaptiveensembleclassificationframeworkforadmeprediction
AT shixiufeng anoveladaptiveensembleclassificationframeworkforadmeprediction
AT zhouxin anoveladaptiveensembleclassificationframeworkforadmeprediction
AT xizhijun anoveladaptiveensembleclassificationframeworkforadmeprediction
AT anrui anoveladaptiveensembleclassificationframeworkforadmeprediction
AT wangxinhong anoveladaptiveensembleclassificationframeworkforadmeprediction
AT yangming noveladaptiveensembleclassificationframeworkforadmeprediction
AT chenjialei noveladaptiveensembleclassificationframeworkforadmeprediction
AT xuliwen noveladaptiveensembleclassificationframeworkforadmeprediction
AT shixiufeng noveladaptiveensembleclassificationframeworkforadmeprediction
AT zhouxin noveladaptiveensembleclassificationframeworkforadmeprediction
AT xizhijun noveladaptiveensembleclassificationframeworkforadmeprediction
AT anrui noveladaptiveensembleclassificationframeworkforadmeprediction
AT wangxinhong noveladaptiveensembleclassificationframeworkforadmeprediction