Cargando…

A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties

There is an exigency of transformation of the enormous amount of biological data available in various forms into some significant knowledge. We have tried to implement Machine Learning (ML) algorithm models on the protein–ligand binding affinity data already available to predict the binding affinity...

Descripción completa

Detalles Bibliográficos
Autores principales: Kundu, Indra, Paul, Goutam, Banerjee, Raja
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9079328/
https://www.ncbi.nlm.nih.gov/pubmed/35539386
http://dx.doi.org/10.1039/c8ra00003d
_version_ 1784702540135268352
author Kundu, Indra
Paul, Goutam
Banerjee, Raja
author_facet Kundu, Indra
Paul, Goutam
Banerjee, Raja
author_sort Kundu, Indra
collection PubMed
description There is an exigency of transformation of the enormous amount of biological data available in various forms into some significant knowledge. We have tried to implement Machine Learning (ML) algorithm models on the protein–ligand binding affinity data already available to predict the binding affinity of the unknown. ML methods are appreciably faster and cheaper as compared to traditional experimental methods or computational scoring approaches. The prerequisites of this prediction are sufficient and unbiased features of training data and a prediction model which can fit the data well. In our study, we have applied Random forest and Gaussian process regression algorithms from the Weka package on protein–ligand binding affinity, which encompasses protein and ligand binding information from PdbBind database. The models are trained on the basis of selective fundamental information of both proteins and ligand, which can be effortlessly fetched from online databases or can be calculated with the availability of structure. The assessment of the models was made on the basis of correlation coefficient (R(2)) and root mean square error (RMSE). The Random forest model gave R(2) and RMSE of 0.76 and 1.31 respectively. We have also used our features and prediction models on the dataset used by others and found that our model with our features outperformed the existing ones.
format Online
Article
Text
id pubmed-9079328
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-90793282022-05-09 A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties Kundu, Indra Paul, Goutam Banerjee, Raja RSC Adv Chemistry There is an exigency of transformation of the enormous amount of biological data available in various forms into some significant knowledge. We have tried to implement Machine Learning (ML) algorithm models on the protein–ligand binding affinity data already available to predict the binding affinity of the unknown. ML methods are appreciably faster and cheaper as compared to traditional experimental methods or computational scoring approaches. The prerequisites of this prediction are sufficient and unbiased features of training data and a prediction model which can fit the data well. In our study, we have applied Random forest and Gaussian process regression algorithms from the Weka package on protein–ligand binding affinity, which encompasses protein and ligand binding information from PdbBind database. The models are trained on the basis of selective fundamental information of both proteins and ligand, which can be effortlessly fetched from online databases or can be calculated with the availability of structure. The assessment of the models was made on the basis of correlation coefficient (R(2)) and root mean square error (RMSE). The Random forest model gave R(2) and RMSE of 0.76 and 1.31 respectively. We have also used our features and prediction models on the dataset used by others and found that our model with our features outperformed the existing ones. The Royal Society of Chemistry 2018-03-28 /pmc/articles/PMC9079328/ /pubmed/35539386 http://dx.doi.org/10.1039/c8ra00003d Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Kundu, Indra
Paul, Goutam
Banerjee, Raja
A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties
title A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties
title_full A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties
title_fullStr A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties
title_full_unstemmed A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties
title_short A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties
title_sort machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9079328/
https://www.ncbi.nlm.nih.gov/pubmed/35539386
http://dx.doi.org/10.1039/c8ra00003d
work_keys_str_mv AT kunduindra amachinelearningapproachtowardsthepredictionofproteinligandbindingaffinitybasedonfundamentalmolecularproperties
AT paulgoutam amachinelearningapproachtowardsthepredictionofproteinligandbindingaffinitybasedonfundamentalmolecularproperties
AT banerjeeraja amachinelearningapproachtowardsthepredictionofproteinligandbindingaffinitybasedonfundamentalmolecularproperties
AT kunduindra machinelearningapproachtowardsthepredictionofproteinligandbindingaffinitybasedonfundamentalmolecularproperties
AT paulgoutam machinelearningapproachtowardsthepredictionofproteinligandbindingaffinitybasedonfundamentalmolecularproperties
AT banerjeeraja machinelearningapproachtowardsthepredictionofproteinligandbindingaffinitybasedonfundamentalmolecularproperties