Cargando…

Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study

BACKGROUND: There is a need for automated methods to learn general features of the interactions of a ligand class with its diverse set of protein receptors. An appropriate machine learning approach is Inductive Logic Programming (ILP), which automatically generates comprehensible rules in addition t...

Descripción completa

Detalles Bibliográficos
Autores principales: A Santos, Jose C, Nassif, Houssam, Page, David, Muggleton, Stephen H, E Sternberg, Michael J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3458898/
https://www.ncbi.nlm.nih.gov/pubmed/22783946
http://dx.doi.org/10.1186/1471-2105-13-162
_version_ 1782244727792336896
author A Santos, Jose C
Nassif, Houssam
Page, David
Muggleton, Stephen H
E Sternberg, Michael J
author_facet A Santos, Jose C
Nassif, Houssam
Page, David
Muggleton, Stephen H
E Sternberg, Michael J
author_sort A Santos, Jose C
collection PubMed
description BACKGROUND: There is a need for automated methods to learn general features of the interactions of a ligand class with its diverse set of protein receptors. An appropriate machine learning approach is Inductive Logic Programming (ILP), which automatically generates comprehensible rules in addition to prediction. The development of ILP systems which can learn rules of the complexity required for studies on protein structure remains a challenge. In this work we use a new ILP system, ProGolem, and demonstrate its performance on learning features of hexose-protein interactions. RESULTS: The rules induced by ProGolem detect interactions mediated by aromatics and by planar-polar residues, in addition to less common features such as the aromatic sandwich. The rules also reveal a previously unreported dependency for residues cys and leu. They also specify interactions involving aromatic and hydrogen bonding residues. This paper shows that Inductive Logic Programming implemented in ProGolem can derive rules giving structural features of protein/ligand interactions. Several of these rules are consistent with descriptions in the literature. CONCLUSIONS: In addition to confirming literature results, ProGolem’s model has a 10-fold cross-validated predictive accuracy that is superior, at the 95% confidence level, to another ILP system previously used to study protein/hexose interactions and is comparable with state-of-the-art statistical learners.
format Online
Article
Text
id pubmed-3458898
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34588982012-09-27 Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study A Santos, Jose C Nassif, Houssam Page, David Muggleton, Stephen H E Sternberg, Michael J BMC Bioinformatics Research Article BACKGROUND: There is a need for automated methods to learn general features of the interactions of a ligand class with its diverse set of protein receptors. An appropriate machine learning approach is Inductive Logic Programming (ILP), which automatically generates comprehensible rules in addition to prediction. The development of ILP systems which can learn rules of the complexity required for studies on protein structure remains a challenge. In this work we use a new ILP system, ProGolem, and demonstrate its performance on learning features of hexose-protein interactions. RESULTS: The rules induced by ProGolem detect interactions mediated by aromatics and by planar-polar residues, in addition to less common features such as the aromatic sandwich. The rules also reveal a previously unreported dependency for residues cys and leu. They also specify interactions involving aromatic and hydrogen bonding residues. This paper shows that Inductive Logic Programming implemented in ProGolem can derive rules giving structural features of protein/ligand interactions. Several of these rules are consistent with descriptions in the literature. CONCLUSIONS: In addition to confirming literature results, ProGolem’s model has a 10-fold cross-validated predictive accuracy that is superior, at the 95% confidence level, to another ILP system previously used to study protein/hexose interactions and is comparable with state-of-the-art statistical learners. BioMed Central 2012-07-11 /pmc/articles/PMC3458898/ /pubmed/22783946 http://dx.doi.org/10.1186/1471-2105-13-162 Text en Copyright ©2012 Santos et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
A Santos, Jose C
Nassif, Houssam
Page, David
Muggleton, Stephen H
E Sternberg, Michael J
Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study
title Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study
title_full Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study
title_fullStr Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study
title_full_unstemmed Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study
title_short Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study
title_sort automated identification of protein-ligand interaction features using inductive logic programming: a hexose binding case study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3458898/
https://www.ncbi.nlm.nih.gov/pubmed/22783946
http://dx.doi.org/10.1186/1471-2105-13-162
work_keys_str_mv AT asantosjosec automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy
AT nassifhoussam automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy
AT pagedavid automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy
AT muggletonstephenh automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy
AT esternbergmichaelj automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy