Cargando…

BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach

Drug-target binding affinity prediction plays a key role in the early stage of drug discovery. Numerous experimental and data-driven approaches have been developed for predicting drug-target binding affinity. However, experimental methods highly rely on the limited structural-related information fro...

Descripción completa

Detalles Bibliográficos
Autores principales: Kalemati, Mahmood, Zamani Emani, Mojtaba, Koohi, Somayyeh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10096306/
https://www.ncbi.nlm.nih.gov/pubmed/37000857
http://dx.doi.org/10.1371/journal.pcbi.1011036
_version_ 1785024303998173184
author Kalemati, Mahmood
Zamani Emani, Mojtaba
Koohi, Somayyeh
author_facet Kalemati, Mahmood
Zamani Emani, Mojtaba
Koohi, Somayyeh
author_sort Kalemati, Mahmood
collection PubMed
description Drug-target binding affinity prediction plays a key role in the early stage of drug discovery. Numerous experimental and data-driven approaches have been developed for predicting drug-target binding affinity. However, experimental methods highly rely on the limited structural-related information from drug-target pairs, domain knowledge, and time-consuming assays. On the other hand, learning-based methods have shown an acceptable prediction performance. However, most of them utilize several simple and complex types of proteins and drug compounds data, ranging from the protein sequences to the topology of a graph representation of drug compounds, employing multiple deep neural networks for encoding and feature extraction, and so, leads to the computational overheads. In this study, we propose a unified measure for protein sequence encoding, named BiComp, which provides compression-based and evolutionary-related features from the protein sequences. Specifically, we employ Normalized Compression Distance and Smith-Waterman measures for capturing complementary information from the algorithmic information theory and biological domains, respectively. We utilize the proposed measure to encode the input proteins feeding a new deep neural network-based method for drug-target binding affinity prediction, named BiComp-DTA. BiComp-DTA is evaluated utilizing four benchmark datasets for drug-target binding affinity prediction. Compared to the state-of-the-art methods, which employ complex models for protein encoding and feature extraction, BiComp-DTA provides superior efficiency in terms of accuracy, runtime, and the number of trainable parameters. The latter achievement facilitates execution of BiComp-DTA on a normal desktop computer in a fast fashion. As a comparative study, we evaluate BiComp’s efficiency against its components for drug-target binding affinity prediction. The results have shown superior accuracy of BiComp due to the orthogonality and complementary nature of Smith-Waterman and Normalized Compression Distance measures for protein sequences. Such a protein sequence encoding provides efficient representation with no need for multiple sources of information, deep domain knowledge, and complex neural networks.
format Online
Article
Text
id pubmed-10096306
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-100963062023-04-13 BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach Kalemati, Mahmood Zamani Emani, Mojtaba Koohi, Somayyeh PLoS Comput Biol Research Article Drug-target binding affinity prediction plays a key role in the early stage of drug discovery. Numerous experimental and data-driven approaches have been developed for predicting drug-target binding affinity. However, experimental methods highly rely on the limited structural-related information from drug-target pairs, domain knowledge, and time-consuming assays. On the other hand, learning-based methods have shown an acceptable prediction performance. However, most of them utilize several simple and complex types of proteins and drug compounds data, ranging from the protein sequences to the topology of a graph representation of drug compounds, employing multiple deep neural networks for encoding and feature extraction, and so, leads to the computational overheads. In this study, we propose a unified measure for protein sequence encoding, named BiComp, which provides compression-based and evolutionary-related features from the protein sequences. Specifically, we employ Normalized Compression Distance and Smith-Waterman measures for capturing complementary information from the algorithmic information theory and biological domains, respectively. We utilize the proposed measure to encode the input proteins feeding a new deep neural network-based method for drug-target binding affinity prediction, named BiComp-DTA. BiComp-DTA is evaluated utilizing four benchmark datasets for drug-target binding affinity prediction. Compared to the state-of-the-art methods, which employ complex models for protein encoding and feature extraction, BiComp-DTA provides superior efficiency in terms of accuracy, runtime, and the number of trainable parameters. The latter achievement facilitates execution of BiComp-DTA on a normal desktop computer in a fast fashion. As a comparative study, we evaluate BiComp’s efficiency against its components for drug-target binding affinity prediction. The results have shown superior accuracy of BiComp due to the orthogonality and complementary nature of Smith-Waterman and Normalized Compression Distance measures for protein sequences. Such a protein sequence encoding provides efficient representation with no need for multiple sources of information, deep domain knowledge, and complex neural networks. Public Library of Science 2023-03-31 /pmc/articles/PMC10096306/ /pubmed/37000857 http://dx.doi.org/10.1371/journal.pcbi.1011036 Text en © 2023 Kalemati et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kalemati, Mahmood
Zamani Emani, Mojtaba
Koohi, Somayyeh
BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach
title BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach
title_full BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach
title_fullStr BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach
title_full_unstemmed BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach
title_short BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach
title_sort bicomp-dta: drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10096306/
https://www.ncbi.nlm.nih.gov/pubmed/37000857
http://dx.doi.org/10.1371/journal.pcbi.1011036
work_keys_str_mv AT kalematimahmood bicompdtadrugtargetbindingaffinitypredictionthroughcomplementarybiologicalrelatedandcompressionbasedfeaturizationapproach
AT zamaniemanimojtaba bicompdtadrugtargetbindingaffinitypredictionthroughcomplementarybiologicalrelatedandcompressionbasedfeaturizationapproach
AT koohisomayyeh bicompdtadrugtargetbindingaffinitypredictionthroughcomplementarybiologicalrelatedandcompressionbasedfeaturizationapproach