Cargando…

Optimization of drug–target affinity prediction methods through feature processing schemes

MOTIVATION: Numerous high-accuracy drug–target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that signi...

Descripción completa

Detalles Bibliográficos
Autores principales: Ru, Xiaoqing, Zou, Quan, Lin, Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10636279/
https://www.ncbi.nlm.nih.gov/pubmed/37812388
http://dx.doi.org/10.1093/bioinformatics/btad615
_version_ 1785133178350993408
author Ru, Xiaoqing
Zou, Quan
Lin, Chen
author_facet Ru, Xiaoqing
Zou, Quan
Lin, Chen
author_sort Ru, Xiaoqing
collection PubMed
description MOTIVATION: Numerous high-accuracy drug–target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that significantly influences the enhancement of model performance, robustness, and interpretability. Many existing studies aim to comprehensively characterize drugs and targets by extracting features from multiple perspectives; however, this approach has drawbacks: (i) an abundance of redundant or noisy features; and (ii) the feature sets often suffer from high dimensionality. RESULTS: In this study, to obtain a model with high accuracy and strong interpretability, we utilize various traditional and cutting-edge feature selection and dimensionality reduction techniques to process self-associated features and adjacent associated features. These optimized features are then fed into learning to rank to achieve efficient DTA prediction. Extensive experimental results on two commonly used datasets indicate that, among various feature optimization methods, the regression tree-based feature selection method is most beneficial for constructing models with good performance and strong robustness. Then, by utilizing Shapley Additive Explanations values and the incremental feature selection approach, we obtain that the high-quality feature subset consists of the top 150D features and the top 20D features have a breakthrough impact on the DTA prediction. In conclusion, our study thoroughly validates the importance of feature optimization in DTA prediction and serves as inspiration for constructing high-performance and high-interpretable models. AVAILABILITY AND IMPLEMENTATION: https://github.com/RUXIAOQING964914140/FS_DTA.
format Online
Article
Text
id pubmed-10636279
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106362792023-11-11 Optimization of drug–target affinity prediction methods through feature processing schemes Ru, Xiaoqing Zou, Quan Lin, Chen Bioinformatics Original Paper MOTIVATION: Numerous high-accuracy drug–target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that significantly influences the enhancement of model performance, robustness, and interpretability. Many existing studies aim to comprehensively characterize drugs and targets by extracting features from multiple perspectives; however, this approach has drawbacks: (i) an abundance of redundant or noisy features; and (ii) the feature sets often suffer from high dimensionality. RESULTS: In this study, to obtain a model with high accuracy and strong interpretability, we utilize various traditional and cutting-edge feature selection and dimensionality reduction techniques to process self-associated features and adjacent associated features. These optimized features are then fed into learning to rank to achieve efficient DTA prediction. Extensive experimental results on two commonly used datasets indicate that, among various feature optimization methods, the regression tree-based feature selection method is most beneficial for constructing models with good performance and strong robustness. Then, by utilizing Shapley Additive Explanations values and the incremental feature selection approach, we obtain that the high-quality feature subset consists of the top 150D features and the top 20D features have a breakthrough impact on the DTA prediction. In conclusion, our study thoroughly validates the importance of feature optimization in DTA prediction and serves as inspiration for constructing high-performance and high-interpretable models. AVAILABILITY AND IMPLEMENTATION: https://github.com/RUXIAOQING964914140/FS_DTA. Oxford University Press 2023-10-09 /pmc/articles/PMC10636279/ /pubmed/37812388 http://dx.doi.org/10.1093/bioinformatics/btad615 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Ru, Xiaoqing
Zou, Quan
Lin, Chen
Optimization of drug–target affinity prediction methods through feature processing schemes
title Optimization of drug–target affinity prediction methods through feature processing schemes
title_full Optimization of drug–target affinity prediction methods through feature processing schemes
title_fullStr Optimization of drug–target affinity prediction methods through feature processing schemes
title_full_unstemmed Optimization of drug–target affinity prediction methods through feature processing schemes
title_short Optimization of drug–target affinity prediction methods through feature processing schemes
title_sort optimization of drug–target affinity prediction methods through feature processing schemes
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10636279/
https://www.ncbi.nlm.nih.gov/pubmed/37812388
http://dx.doi.org/10.1093/bioinformatics/btad615
work_keys_str_mv AT ruxiaoqing optimizationofdrugtargetaffinitypredictionmethodsthroughfeatureprocessingschemes
AT zouquan optimizationofdrugtargetaffinitypredictionmethodsthroughfeatureprocessingschemes
AT linchen optimizationofdrugtargetaffinitypredictionmethodsthroughfeatureprocessingschemes