Cargando…
A novel adaptive ensemble classification framework for ADME prediction
It has now become clear that in silico prediction of ADME (absorption, distribution, metabolism, and elimination) characteristics is an important component of the drug discovery process. Therefore, there has been considerable interest in the development of in silico modeling of ADME prediction in re...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9079056/ https://www.ncbi.nlm.nih.gov/pubmed/35542768 http://dx.doi.org/10.1039/c8ra01206g |
_version_ | 1784702477556252672 |
---|---|
author | Yang, Ming Chen, Jialei Xu, Liwen Shi, Xiufeng Zhou, Xin Xi, Zhijun An, Rui Wang, Xinhong |
author_facet | Yang, Ming Chen, Jialei Xu, Liwen Shi, Xiufeng Zhou, Xin Xi, Zhijun An, Rui Wang, Xinhong |
author_sort | Yang, Ming |
collection | PubMed |
description | It has now become clear that in silico prediction of ADME (absorption, distribution, metabolism, and elimination) characteristics is an important component of the drug discovery process. Therefore, there has been considerable interest in the development of in silico modeling of ADME prediction in recent years. Despite the advances in this field, there remains challenges when facing the unbalanced and high dimensionality problems simultaneously. In this work, we introduce a novel adaptive ensemble classification framework named as AECF to deal with the above issues. AECF includes four components which are (1) data balancing, (2) generating individual models, (3) combining individual models, and (4) optimizing the ensemble. We considered five sampling methods, seven base modeling techniques, and ten ensemble rules to build a choice pool. The proper route of constructing predictive models was determined automatically according to the imbalance ratio (IR). With the adaptive characteristics of AECF, it can be used to work on the different kinds of ADME data, and the balanced data is a special case in AECF. We evaluated the performance of our approach using five extensive ADME datasets concerning Caco-2 cell permeability (CacoP), human intestinal absorption (HIA), oral bioavailability (OB), and P-glycoprotein (P-gp) binders (substrates/inhibitors, PS/PI). The performance of AECF was evaluated on two independent datasets, and the average AUC values were 0.8574–0.8602, 0.8968–0.9182, 0.7821–0.7981, 0.8139–0.8311, and 0.8874–0.8898 for CacoP, HIA, OB, PS and PI, respectively. Our results show that AECF can provide better performance and generality compared with individual models and two representative ensemble methods bagging and boosting. Furthermore, the degree of complementarity among the AECF ensemble members was investigated for the purpose of elucidating the potential advantages of our framework. We found that AECF can effectively select complementary members to construct predictive models by our auto-adaptive optimization approach, and the additional diversity in both sample and feature space mainly contribute to the complementarity of ensemble members. |
format | Online Article Text |
id | pubmed-9079056 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-90790562022-05-09 A novel adaptive ensemble classification framework for ADME prediction Yang, Ming Chen, Jialei Xu, Liwen Shi, Xiufeng Zhou, Xin Xi, Zhijun An, Rui Wang, Xinhong RSC Adv Chemistry It has now become clear that in silico prediction of ADME (absorption, distribution, metabolism, and elimination) characteristics is an important component of the drug discovery process. Therefore, there has been considerable interest in the development of in silico modeling of ADME prediction in recent years. Despite the advances in this field, there remains challenges when facing the unbalanced and high dimensionality problems simultaneously. In this work, we introduce a novel adaptive ensemble classification framework named as AECF to deal with the above issues. AECF includes four components which are (1) data balancing, (2) generating individual models, (3) combining individual models, and (4) optimizing the ensemble. We considered five sampling methods, seven base modeling techniques, and ten ensemble rules to build a choice pool. The proper route of constructing predictive models was determined automatically according to the imbalance ratio (IR). With the adaptive characteristics of AECF, it can be used to work on the different kinds of ADME data, and the balanced data is a special case in AECF. We evaluated the performance of our approach using five extensive ADME datasets concerning Caco-2 cell permeability (CacoP), human intestinal absorption (HIA), oral bioavailability (OB), and P-glycoprotein (P-gp) binders (substrates/inhibitors, PS/PI). The performance of AECF was evaluated on two independent datasets, and the average AUC values were 0.8574–0.8602, 0.8968–0.9182, 0.7821–0.7981, 0.8139–0.8311, and 0.8874–0.8898 for CacoP, HIA, OB, PS and PI, respectively. Our results show that AECF can provide better performance and generality compared with individual models and two representative ensemble methods bagging and boosting. Furthermore, the degree of complementarity among the AECF ensemble members was investigated for the purpose of elucidating the potential advantages of our framework. We found that AECF can effectively select complementary members to construct predictive models by our auto-adaptive optimization approach, and the additional diversity in both sample and feature space mainly contribute to the complementarity of ensemble members. The Royal Society of Chemistry 2018-03-26 /pmc/articles/PMC9079056/ /pubmed/35542768 http://dx.doi.org/10.1039/c8ra01206g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/ |
spellingShingle | Chemistry Yang, Ming Chen, Jialei Xu, Liwen Shi, Xiufeng Zhou, Xin Xi, Zhijun An, Rui Wang, Xinhong A novel adaptive ensemble classification framework for ADME prediction |
title | A novel adaptive ensemble classification framework for ADME prediction |
title_full | A novel adaptive ensemble classification framework for ADME prediction |
title_fullStr | A novel adaptive ensemble classification framework for ADME prediction |
title_full_unstemmed | A novel adaptive ensemble classification framework for ADME prediction |
title_short | A novel adaptive ensemble classification framework for ADME prediction |
title_sort | novel adaptive ensemble classification framework for adme prediction |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9079056/ https://www.ncbi.nlm.nih.gov/pubmed/35542768 http://dx.doi.org/10.1039/c8ra01206g |
work_keys_str_mv | AT yangming anoveladaptiveensembleclassificationframeworkforadmeprediction AT chenjialei anoveladaptiveensembleclassificationframeworkforadmeprediction AT xuliwen anoveladaptiveensembleclassificationframeworkforadmeprediction AT shixiufeng anoveladaptiveensembleclassificationframeworkforadmeprediction AT zhouxin anoveladaptiveensembleclassificationframeworkforadmeprediction AT xizhijun anoveladaptiveensembleclassificationframeworkforadmeprediction AT anrui anoveladaptiveensembleclassificationframeworkforadmeprediction AT wangxinhong anoveladaptiveensembleclassificationframeworkforadmeprediction AT yangming noveladaptiveensembleclassificationframeworkforadmeprediction AT chenjialei noveladaptiveensembleclassificationframeworkforadmeprediction AT xuliwen noveladaptiveensembleclassificationframeworkforadmeprediction AT shixiufeng noveladaptiveensembleclassificationframeworkforadmeprediction AT zhouxin noveladaptiveensembleclassificationframeworkforadmeprediction AT xizhijun noveladaptiveensembleclassificationframeworkforadmeprediction AT anrui noveladaptiveensembleclassificationframeworkforadmeprediction AT wangxinhong noveladaptiveensembleclassificationframeworkforadmeprediction |