Cargando…
Optimization of drug–target affinity prediction methods through feature processing schemes
MOTIVATION: Numerous high-accuracy drug–target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that signi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10636279/ https://www.ncbi.nlm.nih.gov/pubmed/37812388 http://dx.doi.org/10.1093/bioinformatics/btad615 |
_version_ | 1785133178350993408 |
---|---|
author | Ru, Xiaoqing Zou, Quan Lin, Chen |
author_facet | Ru, Xiaoqing Zou, Quan Lin, Chen |
author_sort | Ru, Xiaoqing |
collection | PubMed |
description | MOTIVATION: Numerous high-accuracy drug–target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that significantly influences the enhancement of model performance, robustness, and interpretability. Many existing studies aim to comprehensively characterize drugs and targets by extracting features from multiple perspectives; however, this approach has drawbacks: (i) an abundance of redundant or noisy features; and (ii) the feature sets often suffer from high dimensionality. RESULTS: In this study, to obtain a model with high accuracy and strong interpretability, we utilize various traditional and cutting-edge feature selection and dimensionality reduction techniques to process self-associated features and adjacent associated features. These optimized features are then fed into learning to rank to achieve efficient DTA prediction. Extensive experimental results on two commonly used datasets indicate that, among various feature optimization methods, the regression tree-based feature selection method is most beneficial for constructing models with good performance and strong robustness. Then, by utilizing Shapley Additive Explanations values and the incremental feature selection approach, we obtain that the high-quality feature subset consists of the top 150D features and the top 20D features have a breakthrough impact on the DTA prediction. In conclusion, our study thoroughly validates the importance of feature optimization in DTA prediction and serves as inspiration for constructing high-performance and high-interpretable models. AVAILABILITY AND IMPLEMENTATION: https://github.com/RUXIAOQING964914140/FS_DTA. |
format | Online Article Text |
id | pubmed-10636279 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106362792023-11-11 Optimization of drug–target affinity prediction methods through feature processing schemes Ru, Xiaoqing Zou, Quan Lin, Chen Bioinformatics Original Paper MOTIVATION: Numerous high-accuracy drug–target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that significantly influences the enhancement of model performance, robustness, and interpretability. Many existing studies aim to comprehensively characterize drugs and targets by extracting features from multiple perspectives; however, this approach has drawbacks: (i) an abundance of redundant or noisy features; and (ii) the feature sets often suffer from high dimensionality. RESULTS: In this study, to obtain a model with high accuracy and strong interpretability, we utilize various traditional and cutting-edge feature selection and dimensionality reduction techniques to process self-associated features and adjacent associated features. These optimized features are then fed into learning to rank to achieve efficient DTA prediction. Extensive experimental results on two commonly used datasets indicate that, among various feature optimization methods, the regression tree-based feature selection method is most beneficial for constructing models with good performance and strong robustness. Then, by utilizing Shapley Additive Explanations values and the incremental feature selection approach, we obtain that the high-quality feature subset consists of the top 150D features and the top 20D features have a breakthrough impact on the DTA prediction. In conclusion, our study thoroughly validates the importance of feature optimization in DTA prediction and serves as inspiration for constructing high-performance and high-interpretable models. AVAILABILITY AND IMPLEMENTATION: https://github.com/RUXIAOQING964914140/FS_DTA. Oxford University Press 2023-10-09 /pmc/articles/PMC10636279/ /pubmed/37812388 http://dx.doi.org/10.1093/bioinformatics/btad615 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Ru, Xiaoqing Zou, Quan Lin, Chen Optimization of drug–target affinity prediction methods through feature processing schemes |
title | Optimization of drug–target affinity prediction methods through feature processing schemes |
title_full | Optimization of drug–target affinity prediction methods through feature processing schemes |
title_fullStr | Optimization of drug–target affinity prediction methods through feature processing schemes |
title_full_unstemmed | Optimization of drug–target affinity prediction methods through feature processing schemes |
title_short | Optimization of drug–target affinity prediction methods through feature processing schemes |
title_sort | optimization of drug–target affinity prediction methods through feature processing schemes |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10636279/ https://www.ncbi.nlm.nih.gov/pubmed/37812388 http://dx.doi.org/10.1093/bioinformatics/btad615 |
work_keys_str_mv | AT ruxiaoqing optimizationofdrugtargetaffinitypredictionmethodsthroughfeatureprocessingschemes AT zouquan optimizationofdrugtargetaffinitypredictionmethodsthroughfeatureprocessingschemes AT linchen optimizationofdrugtargetaffinitypredictionmethodsthroughfeatureprocessingschemes |