Cargando…
Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study
BACKGROUND: There is a need for automated methods to learn general features of the interactions of a ligand class with its diverse set of protein receptors. An appropriate machine learning approach is Inductive Logic Programming (ILP), which automatically generates comprehensible rules in addition t...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3458898/ https://www.ncbi.nlm.nih.gov/pubmed/22783946 http://dx.doi.org/10.1186/1471-2105-13-162 |
_version_ | 1782244727792336896 |
---|---|
author | A Santos, Jose C Nassif, Houssam Page, David Muggleton, Stephen H E Sternberg, Michael J |
author_facet | A Santos, Jose C Nassif, Houssam Page, David Muggleton, Stephen H E Sternberg, Michael J |
author_sort | A Santos, Jose C |
collection | PubMed |
description | BACKGROUND: There is a need for automated methods to learn general features of the interactions of a ligand class with its diverse set of protein receptors. An appropriate machine learning approach is Inductive Logic Programming (ILP), which automatically generates comprehensible rules in addition to prediction. The development of ILP systems which can learn rules of the complexity required for studies on protein structure remains a challenge. In this work we use a new ILP system, ProGolem, and demonstrate its performance on learning features of hexose-protein interactions. RESULTS: The rules induced by ProGolem detect interactions mediated by aromatics and by planar-polar residues, in addition to less common features such as the aromatic sandwich. The rules also reveal a previously unreported dependency for residues cys and leu. They also specify interactions involving aromatic and hydrogen bonding residues. This paper shows that Inductive Logic Programming implemented in ProGolem can derive rules giving structural features of protein/ligand interactions. Several of these rules are consistent with descriptions in the literature. CONCLUSIONS: In addition to confirming literature results, ProGolem’s model has a 10-fold cross-validated predictive accuracy that is superior, at the 95% confidence level, to another ILP system previously used to study protein/hexose interactions and is comparable with state-of-the-art statistical learners. |
format | Online Article Text |
id | pubmed-3458898 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-34588982012-09-27 Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study A Santos, Jose C Nassif, Houssam Page, David Muggleton, Stephen H E Sternberg, Michael J BMC Bioinformatics Research Article BACKGROUND: There is a need for automated methods to learn general features of the interactions of a ligand class with its diverse set of protein receptors. An appropriate machine learning approach is Inductive Logic Programming (ILP), which automatically generates comprehensible rules in addition to prediction. The development of ILP systems which can learn rules of the complexity required for studies on protein structure remains a challenge. In this work we use a new ILP system, ProGolem, and demonstrate its performance on learning features of hexose-protein interactions. RESULTS: The rules induced by ProGolem detect interactions mediated by aromatics and by planar-polar residues, in addition to less common features such as the aromatic sandwich. The rules also reveal a previously unreported dependency for residues cys and leu. They also specify interactions involving aromatic and hydrogen bonding residues. This paper shows that Inductive Logic Programming implemented in ProGolem can derive rules giving structural features of protein/ligand interactions. Several of these rules are consistent with descriptions in the literature. CONCLUSIONS: In addition to confirming literature results, ProGolem’s model has a 10-fold cross-validated predictive accuracy that is superior, at the 95% confidence level, to another ILP system previously used to study protein/hexose interactions and is comparable with state-of-the-art statistical learners. BioMed Central 2012-07-11 /pmc/articles/PMC3458898/ /pubmed/22783946 http://dx.doi.org/10.1186/1471-2105-13-162 Text en Copyright ©2012 Santos et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article A Santos, Jose C Nassif, Houssam Page, David Muggleton, Stephen H E Sternberg, Michael J Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study |
title | Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study |
title_full | Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study |
title_fullStr | Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study |
title_full_unstemmed | Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study |
title_short | Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study |
title_sort | automated identification of protein-ligand interaction features using inductive logic programming: a hexose binding case study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3458898/ https://www.ncbi.nlm.nih.gov/pubmed/22783946 http://dx.doi.org/10.1186/1471-2105-13-162 |
work_keys_str_mv | AT asantosjosec automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy AT nassifhoussam automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy AT pagedavid automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy AT muggletonstephenh automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy AT esternbergmichaelj automatedidentificationofproteinligandinteractionfeaturesusinginductivelogicprogrammingahexosebindingcasestudy |