Cargando…

Efficient machine learning model for predicting drug-target interactions with case study for Covid-19

BACKGROUND: Discover possible Drug Target Interactions (DTIs) is a decisive step in the detection of the effects of drugs as well as drug repositioning. There is a strong incentive to develop effective computational methods that can effectively predict potential DTIs, as traditional DTI laboratory e...

Descripción completa

Detalles Bibliográficos
Autores principales: El-Behery, Heba, Attia, Abdel-Fattah, El-Fishawy, Nawal, Torkey, Hanaa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8256690/
https://www.ncbi.nlm.nih.gov/pubmed/34271420
http://dx.doi.org/10.1016/j.compbiolchem.2021.107536
_version_ 1783718148320526336
author El-Behery, Heba
Attia, Abdel-Fattah
El-Fishawy, Nawal
Torkey, Hanaa
author_facet El-Behery, Heba
Attia, Abdel-Fattah
El-Fishawy, Nawal
Torkey, Hanaa
author_sort El-Behery, Heba
collection PubMed
description BACKGROUND: Discover possible Drug Target Interactions (DTIs) is a decisive step in the detection of the effects of drugs as well as drug repositioning. There is a strong incentive to develop effective computational methods that can effectively predict potential DTIs, as traditional DTI laboratory experiments are expensive, time-consuming, and labor-intensive. Some technologies have been developed for this purpose, however large numbers of interactions have not yet been detected, the accuracy of their prediction still low, and protein sequences and structured data are rarely used together in the prediction process. METHODS: This paper presents DTIs prediction model that takes advantage of the special capacity of the structured form of proteins and drugs. Our model obtains features from protein amino-acid sequences using physical and chemical properties, and from drugs smiles (Simplified Molecular Input Line Entry System) strings using encoding techniques. Comparing the proposed model with different existing methods under K-fold cross validation, empirical results show that our model based on ensemble learning algorithms for DTI prediction provide more accurate results from both structures and features data. RESULTS: The proposed model is applied on two datasets:Benchmark (feature only) datasets and DrugBank (Structure data) datasets. Experimental results obtained by Light-Boost and ExtraTree using structures and feature data results in 98 % accuracy and 0.97 f-score comparing to 94 % and 0.92 achieved by the existing methods. Moreover, our model can successfully predict more yet undiscovered interactions, and hence can be used as a practical tool to drug repositioning. A case study of applying our prediction model on the proteins that are known to be affected by Corona viruses in order to predict the possible interactions among these proteins and existing drugs is performed. Also, our model is applied on Covid-19 related drugs announced on DrugBank. The results show that some drugs like DB00691 and DB05203 are predicted with 100 % accuracy to interact with ACE2 protein. This protein is a self-membrane protein that enables Covid-19 infection. Hence, our model can be used as an effective tool in drug reposition to predict possible drug treatments for Covid-19.
format Online
Article
Text
id pubmed-8256690
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-82566902021-07-06 Efficient machine learning model for predicting drug-target interactions with case study for Covid-19 El-Behery, Heba Attia, Abdel-Fattah El-Fishawy, Nawal Torkey, Hanaa Comput Biol Chem Article BACKGROUND: Discover possible Drug Target Interactions (DTIs) is a decisive step in the detection of the effects of drugs as well as drug repositioning. There is a strong incentive to develop effective computational methods that can effectively predict potential DTIs, as traditional DTI laboratory experiments are expensive, time-consuming, and labor-intensive. Some technologies have been developed for this purpose, however large numbers of interactions have not yet been detected, the accuracy of their prediction still low, and protein sequences and structured data are rarely used together in the prediction process. METHODS: This paper presents DTIs prediction model that takes advantage of the special capacity of the structured form of proteins and drugs. Our model obtains features from protein amino-acid sequences using physical and chemical properties, and from drugs smiles (Simplified Molecular Input Line Entry System) strings using encoding techniques. Comparing the proposed model with different existing methods under K-fold cross validation, empirical results show that our model based on ensemble learning algorithms for DTI prediction provide more accurate results from both structures and features data. RESULTS: The proposed model is applied on two datasets:Benchmark (feature only) datasets and DrugBank (Structure data) datasets. Experimental results obtained by Light-Boost and ExtraTree using structures and feature data results in 98 % accuracy and 0.97 f-score comparing to 94 % and 0.92 achieved by the existing methods. Moreover, our model can successfully predict more yet undiscovered interactions, and hence can be used as a practical tool to drug repositioning. A case study of applying our prediction model on the proteins that are known to be affected by Corona viruses in order to predict the possible interactions among these proteins and existing drugs is performed. Also, our model is applied on Covid-19 related drugs announced on DrugBank. The results show that some drugs like DB00691 and DB05203 are predicted with 100 % accuracy to interact with ACE2 protein. This protein is a self-membrane protein that enables Covid-19 infection. Hence, our model can be used as an effective tool in drug reposition to predict possible drug treatments for Covid-19. Elsevier Ltd. 2021-08 2021-07-05 /pmc/articles/PMC8256690/ /pubmed/34271420 http://dx.doi.org/10.1016/j.compbiolchem.2021.107536 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
El-Behery, Heba
Attia, Abdel-Fattah
El-Fishawy, Nawal
Torkey, Hanaa
Efficient machine learning model for predicting drug-target interactions with case study for Covid-19
title Efficient machine learning model for predicting drug-target interactions with case study for Covid-19
title_full Efficient machine learning model for predicting drug-target interactions with case study for Covid-19
title_fullStr Efficient machine learning model for predicting drug-target interactions with case study for Covid-19
title_full_unstemmed Efficient machine learning model for predicting drug-target interactions with case study for Covid-19
title_short Efficient machine learning model for predicting drug-target interactions with case study for Covid-19
title_sort efficient machine learning model for predicting drug-target interactions with case study for covid-19
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8256690/
https://www.ncbi.nlm.nih.gov/pubmed/34271420
http://dx.doi.org/10.1016/j.compbiolchem.2021.107536
work_keys_str_mv AT elbeheryheba efficientmachinelearningmodelforpredictingdrugtargetinteractionswithcasestudyforcovid19
AT attiaabdelfattah efficientmachinelearningmodelforpredictingdrugtargetinteractionswithcasestudyforcovid19
AT elfishawynawal efficientmachinelearningmodelforpredictingdrugtargetinteractionswithcasestudyforcovid19
AT torkeyhanaa efficientmachinelearningmodelforpredictingdrugtargetinteractionswithcasestudyforcovid19