Cargando…

Transfer learning for drug–target interaction prediction

MOTIVATION: Utilizing AI-driven approaches for drug–target interaction (DTI) prediction require large volumes of training data which are not available for the majority of target proteins. In this study, we investigate the use of deep transfer learning for the prediction of interactions between drug...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dalkıran, Alperen, Atakan, Ahmet, Rifaioğlu, Ahmet S, Martin, Maria J, Atalay, Rengül Çetin, Acar, Aybar C, Doğan, Tunca, Atalay, Volkan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Biomedical Informatics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311347/ https://www.ncbi.nlm.nih.gov/pubmed/37387156 http://dx.doi.org/10.1093/bioinformatics/btad234

_version_	1785066724591140864
author	Dalkıran, Alperen Atakan, Ahmet Rifaioğlu, Ahmet S Martin, Maria J Atalay, Rengül Çetin Acar, Aybar C Doğan, Tunca Atalay, Volkan
author_facet	Dalkıran, Alperen Atakan, Ahmet Rifaioğlu, Ahmet S Martin, Maria J Atalay, Rengül Çetin Acar, Aybar C Doğan, Tunca Atalay, Volkan
author_sort	Dalkıran, Alperen
collection	PubMed
description	MOTIVATION: Utilizing AI-driven approaches for drug–target interaction (DTI) prediction require large volumes of training data which are not available for the majority of target proteins. In this study, we investigate the use of deep transfer learning for the prediction of interactions between drug candidate compounds and understudied target proteins with scarce training data. The idea here is to first train a deep neural network classifier with a generalized source training dataset of large size and then to reuse this pre-trained neural network as an initial configuration for re-training/fine-tuning purposes with a small-sized specialized target training dataset. To explore this idea, we selected six protein families that have critical importance in biomedicine: kinases, G-protein-coupled receptors (GPCRs), ion channels, nuclear receptors, proteases, and transporters. In two independent experiments, the protein families of transporters and nuclear receptors were individually set as the target datasets, while the remaining five families were used as the source datasets. Several size-based target family training datasets were formed in a controlled manner to assess the benefit provided by the transfer learning approach. RESULTS: Here, we present a systematic evaluation of our approach by pre-training a feed-forward neural network with source training datasets and applying different modes of transfer learning from the pre-trained source network to a target dataset. The performance of deep transfer learning is evaluated and compared with that of training the same deep neural network from scratch. We found that when the training dataset contains fewer than 100 compounds, transfer learning outperforms the conventional strategy of training the system from scratch, suggesting that transfer learning is advantageous for predicting binders to under-studied targets. AVAILABILITY AND IMPLEMENTATION: The source code and datasets are available at https://github.com/cansyl/TransferLearning4DTI. Our web-based service containing the ready-to-use pre-trained models is accessible at https://tl4dti.kansil.org.
format	Online Article Text
id	pubmed-10311347
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-103113472023-07-01 Transfer learning for drug–target interaction prediction Dalkıran, Alperen Atakan, Ahmet Rifaioğlu, Ahmet S Martin, Maria J Atalay, Rengül Çetin Acar, Aybar C Doğan, Tunca Atalay, Volkan Bioinformatics Biomedical Informatics MOTIVATION: Utilizing AI-driven approaches for drug–target interaction (DTI) prediction require large volumes of training data which are not available for the majority of target proteins. In this study, we investigate the use of deep transfer learning for the prediction of interactions between drug candidate compounds and understudied target proteins with scarce training data. The idea here is to first train a deep neural network classifier with a generalized source training dataset of large size and then to reuse this pre-trained neural network as an initial configuration for re-training/fine-tuning purposes with a small-sized specialized target training dataset. To explore this idea, we selected six protein families that have critical importance in biomedicine: kinases, G-protein-coupled receptors (GPCRs), ion channels, nuclear receptors, proteases, and transporters. In two independent experiments, the protein families of transporters and nuclear receptors were individually set as the target datasets, while the remaining five families were used as the source datasets. Several size-based target family training datasets were formed in a controlled manner to assess the benefit provided by the transfer learning approach. RESULTS: Here, we present a systematic evaluation of our approach by pre-training a feed-forward neural network with source training datasets and applying different modes of transfer learning from the pre-trained source network to a target dataset. The performance of deep transfer learning is evaluated and compared with that of training the same deep neural network from scratch. We found that when the training dataset contains fewer than 100 compounds, transfer learning outperforms the conventional strategy of training the system from scratch, suggesting that transfer learning is advantageous for predicting binders to under-studied targets. AVAILABILITY AND IMPLEMENTATION: The source code and datasets are available at https://github.com/cansyl/TransferLearning4DTI. Our web-based service containing the ready-to-use pre-trained models is accessible at https://tl4dti.kansil.org. Oxford University Press 2023-06-30 /pmc/articles/PMC10311347/ /pubmed/37387156 http://dx.doi.org/10.1093/bioinformatics/btad234 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Biomedical Informatics Dalkıran, Alperen Atakan, Ahmet Rifaioğlu, Ahmet S Martin, Maria J Atalay, Rengül Çetin Acar, Aybar C Doğan, Tunca Atalay, Volkan Transfer learning for drug–target interaction prediction
title	Transfer learning for drug–target interaction prediction
title_full	Transfer learning for drug–target interaction prediction
title_fullStr	Transfer learning for drug–target interaction prediction
title_full_unstemmed	Transfer learning for drug–target interaction prediction
title_short	Transfer learning for drug–target interaction prediction
title_sort	transfer learning for drug–target interaction prediction
topic	Biomedical Informatics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311347/ https://www.ncbi.nlm.nih.gov/pubmed/37387156 http://dx.doi.org/10.1093/bioinformatics/btad234
work_keys_str_mv	AT dalkıranalperen transferlearningfordrugtargetinteractionprediction AT atakanahmet transferlearningfordrugtargetinteractionprediction AT rifaiogluahmets transferlearningfordrugtargetinteractionprediction AT martinmariaj transferlearningfordrugtargetinteractionprediction AT atalayrengulcetin transferlearningfordrugtargetinteractionprediction AT acaraybarc transferlearningfordrugtargetinteractionprediction AT dogantunca transferlearningfordrugtargetinteractionprediction AT atalayvolkan transferlearningfordrugtargetinteractionprediction

Transfer learning for drug–target interaction prediction

Ejemplares similares