Cargando…

Optimization of drug–target affinity prediction methods through feature processing schemes

MOTIVATION: Numerous high-accuracy drug–target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that signi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ru, Xiaoqing, Zou, Quan, Lin, Chen
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10636279/ https://www.ncbi.nlm.nih.gov/pubmed/37812388 http://dx.doi.org/10.1093/bioinformatics/btad615

_version_	1785133178350993408
author	Ru, Xiaoqing Zou, Quan Lin, Chen
author_facet	Ru, Xiaoqing Zou, Quan Lin, Chen
author_sort	Ru, Xiaoqing
collection	PubMed
description	MOTIVATION: Numerous high-accuracy drug–target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that significantly influences the enhancement of model performance, robustness, and interpretability. Many existing studies aim to comprehensively characterize drugs and targets by extracting features from multiple perspectives; however, this approach has drawbacks: (i) an abundance of redundant or noisy features; and (ii) the feature sets often suffer from high dimensionality. RESULTS: In this study, to obtain a model with high accuracy and strong interpretability, we utilize various traditional and cutting-edge feature selection and dimensionality reduction techniques to process self-associated features and adjacent associated features. These optimized features are then fed into learning to rank to achieve efficient DTA prediction. Extensive experimental results on two commonly used datasets indicate that, among various feature optimization methods, the regression tree-based feature selection method is most beneficial for constructing models with good performance and strong robustness. Then, by utilizing Shapley Additive Explanations values and the incremental feature selection approach, we obtain that the high-quality feature subset consists of the top 150D features and the top 20D features have a breakthrough impact on the DTA prediction. In conclusion, our study thoroughly validates the importance of feature optimization in DTA prediction and serves as inspiration for constructing high-performance and high-interpretable models. AVAILABILITY AND IMPLEMENTATION: https://github.com/RUXIAOQING964914140/FS_DTA.
format	Online Article Text
id	pubmed-10636279
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-106362792023-11-11 Optimization of drug–target affinity prediction methods through feature processing schemes Ru, Xiaoqing Zou, Quan Lin, Chen Bioinformatics Original Paper MOTIVATION: Numerous high-accuracy drug–target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that significantly influences the enhancement of model performance, robustness, and interpretability. Many existing studies aim to comprehensively characterize drugs and targets by extracting features from multiple perspectives; however, this approach has drawbacks: (i) an abundance of redundant or noisy features; and (ii) the feature sets often suffer from high dimensionality. RESULTS: In this study, to obtain a model with high accuracy and strong interpretability, we utilize various traditional and cutting-edge feature selection and dimensionality reduction techniques to process self-associated features and adjacent associated features. These optimized features are then fed into learning to rank to achieve efficient DTA prediction. Extensive experimental results on two commonly used datasets indicate that, among various feature optimization methods, the regression tree-based feature selection method is most beneficial for constructing models with good performance and strong robustness. Then, by utilizing Shapley Additive Explanations values and the incremental feature selection approach, we obtain that the high-quality feature subset consists of the top 150D features and the top 20D features have a breakthrough impact on the DTA prediction. In conclusion, our study thoroughly validates the importance of feature optimization in DTA prediction and serves as inspiration for constructing high-performance and high-interpretable models. AVAILABILITY AND IMPLEMENTATION: https://github.com/RUXIAOQING964914140/FS_DTA. Oxford University Press 2023-10-09 /pmc/articles/PMC10636279/ /pubmed/37812388 http://dx.doi.org/10.1093/bioinformatics/btad615 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Paper Ru, Xiaoqing Zou, Quan Lin, Chen Optimization of drug–target affinity prediction methods through feature processing schemes
title	Optimization of drug–target affinity prediction methods through feature processing schemes
title_full	Optimization of drug–target affinity prediction methods through feature processing schemes
title_fullStr	Optimization of drug–target affinity prediction methods through feature processing schemes
title_full_unstemmed	Optimization of drug–target affinity prediction methods through feature processing schemes
title_short	Optimization of drug–target affinity prediction methods through feature processing schemes
title_sort	optimization of drug–target affinity prediction methods through feature processing schemes
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10636279/ https://www.ncbi.nlm.nih.gov/pubmed/37812388 http://dx.doi.org/10.1093/bioinformatics/btad615
work_keys_str_mv	AT ruxiaoqing optimizationofdrugtargetaffinitypredictionmethodsthroughfeatureprocessingschemes AT zouquan optimizationofdrugtargetaffinitypredictionmethodsthroughfeatureprocessingschemes AT linchen optimizationofdrugtargetaffinitypredictionmethodsthroughfeatureprocessingschemes

Optimization of drug–target affinity prediction methods through feature processing schemes

Ejemplares similares