Cargando…
BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach
Drug-target binding affinity prediction plays a key role in the early stage of drug discovery. Numerous experimental and data-driven approaches have been developed for predicting drug-target binding affinity. However, experimental methods highly rely on the limited structural-related information fro...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10096306/ https://www.ncbi.nlm.nih.gov/pubmed/37000857 http://dx.doi.org/10.1371/journal.pcbi.1011036 |
_version_ | 1785024303998173184 |
---|---|
author | Kalemati, Mahmood Zamani Emani, Mojtaba Koohi, Somayyeh |
author_facet | Kalemati, Mahmood Zamani Emani, Mojtaba Koohi, Somayyeh |
author_sort | Kalemati, Mahmood |
collection | PubMed |
description | Drug-target binding affinity prediction plays a key role in the early stage of drug discovery. Numerous experimental and data-driven approaches have been developed for predicting drug-target binding affinity. However, experimental methods highly rely on the limited structural-related information from drug-target pairs, domain knowledge, and time-consuming assays. On the other hand, learning-based methods have shown an acceptable prediction performance. However, most of them utilize several simple and complex types of proteins and drug compounds data, ranging from the protein sequences to the topology of a graph representation of drug compounds, employing multiple deep neural networks for encoding and feature extraction, and so, leads to the computational overheads. In this study, we propose a unified measure for protein sequence encoding, named BiComp, which provides compression-based and evolutionary-related features from the protein sequences. Specifically, we employ Normalized Compression Distance and Smith-Waterman measures for capturing complementary information from the algorithmic information theory and biological domains, respectively. We utilize the proposed measure to encode the input proteins feeding a new deep neural network-based method for drug-target binding affinity prediction, named BiComp-DTA. BiComp-DTA is evaluated utilizing four benchmark datasets for drug-target binding affinity prediction. Compared to the state-of-the-art methods, which employ complex models for protein encoding and feature extraction, BiComp-DTA provides superior efficiency in terms of accuracy, runtime, and the number of trainable parameters. The latter achievement facilitates execution of BiComp-DTA on a normal desktop computer in a fast fashion. As a comparative study, we evaluate BiComp’s efficiency against its components for drug-target binding affinity prediction. The results have shown superior accuracy of BiComp due to the orthogonality and complementary nature of Smith-Waterman and Normalized Compression Distance measures for protein sequences. Such a protein sequence encoding provides efficient representation with no need for multiple sources of information, deep domain knowledge, and complex neural networks. |
format | Online Article Text |
id | pubmed-10096306 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-100963062023-04-13 BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach Kalemati, Mahmood Zamani Emani, Mojtaba Koohi, Somayyeh PLoS Comput Biol Research Article Drug-target binding affinity prediction plays a key role in the early stage of drug discovery. Numerous experimental and data-driven approaches have been developed for predicting drug-target binding affinity. However, experimental methods highly rely on the limited structural-related information from drug-target pairs, domain knowledge, and time-consuming assays. On the other hand, learning-based methods have shown an acceptable prediction performance. However, most of them utilize several simple and complex types of proteins and drug compounds data, ranging from the protein sequences to the topology of a graph representation of drug compounds, employing multiple deep neural networks for encoding and feature extraction, and so, leads to the computational overheads. In this study, we propose a unified measure for protein sequence encoding, named BiComp, which provides compression-based and evolutionary-related features from the protein sequences. Specifically, we employ Normalized Compression Distance and Smith-Waterman measures for capturing complementary information from the algorithmic information theory and biological domains, respectively. We utilize the proposed measure to encode the input proteins feeding a new deep neural network-based method for drug-target binding affinity prediction, named BiComp-DTA. BiComp-DTA is evaluated utilizing four benchmark datasets for drug-target binding affinity prediction. Compared to the state-of-the-art methods, which employ complex models for protein encoding and feature extraction, BiComp-DTA provides superior efficiency in terms of accuracy, runtime, and the number of trainable parameters. The latter achievement facilitates execution of BiComp-DTA on a normal desktop computer in a fast fashion. As a comparative study, we evaluate BiComp’s efficiency against its components for drug-target binding affinity prediction. The results have shown superior accuracy of BiComp due to the orthogonality and complementary nature of Smith-Waterman and Normalized Compression Distance measures for protein sequences. Such a protein sequence encoding provides efficient representation with no need for multiple sources of information, deep domain knowledge, and complex neural networks. Public Library of Science 2023-03-31 /pmc/articles/PMC10096306/ /pubmed/37000857 http://dx.doi.org/10.1371/journal.pcbi.1011036 Text en © 2023 Kalemati et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Kalemati, Mahmood Zamani Emani, Mojtaba Koohi, Somayyeh BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach |
title | BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach |
title_full | BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach |
title_fullStr | BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach |
title_full_unstemmed | BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach |
title_short | BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach |
title_sort | bicomp-dta: drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10096306/ https://www.ncbi.nlm.nih.gov/pubmed/37000857 http://dx.doi.org/10.1371/journal.pcbi.1011036 |
work_keys_str_mv | AT kalematimahmood bicompdtadrugtargetbindingaffinitypredictionthroughcomplementarybiologicalrelatedandcompressionbasedfeaturizationapproach AT zamaniemanimojtaba bicompdtadrugtargetbindingaffinitypredictionthroughcomplementarybiologicalrelatedandcompressionbasedfeaturizationapproach AT koohisomayyeh bicompdtadrugtargetbindingaffinitypredictionthroughcomplementarybiologicalrelatedandcompressionbasedfeaturizationapproach |