Cargando…

Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data

BACKGROUND: This paper addresses the prediction of the free energy of binding of a drug candidate with enzyme InhA associated with Mycobacterium tuberculosis. This problem is found within rational drug design, where interactions between drug candidates and target proteins are verified through molecu...

Descripción completa

Detalles Bibliográficos
Autores principales: Barros, Rodrigo C, Winck, Ana T, Machado, Karina S, Basgalupp, Márcio P, de Carvalho, André CPLF, Ruiz, Duncan D, de Souza, Osmar Norberto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534569/
https://www.ncbi.nlm.nih.gov/pubmed/23171000
http://dx.doi.org/10.1186/1471-2105-13-310
_version_ 1782475357078683648
author Barros, Rodrigo C
Winck, Ana T
Machado, Karina S
Basgalupp, Márcio P
de Carvalho, André CPLF
Ruiz, Duncan D
de Souza, Osmar Norberto
author_facet Barros, Rodrigo C
Winck, Ana T
Machado, Karina S
Basgalupp, Márcio P
de Carvalho, André CPLF
Ruiz, Duncan D
de Souza, Osmar Norberto
author_sort Barros, Rodrigo C
collection PubMed
description BACKGROUND: This paper addresses the prediction of the free energy of binding of a drug candidate with enzyme InhA associated with Mycobacterium tuberculosis. This problem is found within rational drug design, where interactions between drug candidates and target proteins are verified through molecular docking simulations. In this application, it is important not only to correctly predict the free energy of binding, but also to provide a comprehensible model that could be validated by a domain specialist. Decision-tree induction algorithms have been successfully used in drug-design related applications, specially considering that decision trees are simple to understand, interpret, and validate. There are several decision-tree induction algorithms available for general-use, but each one has a bias that makes it more suitable for a particular data distribution. In this article, we propose and investigate the automatic design of decision-tree induction algorithms tailored to particular drug-enzyme binding data sets. We investigate the performance of our new method for evaluating binding conformations of different drug candidates to InhA, and we analyze our findings with respect to decision tree accuracy, comprehensibility, and biological relevance. RESULTS: The empirical analysis indicates that our method is capable of automatically generating decision-tree induction algorithms that significantly outperform the traditional C4.5 algorithm with respect to both accuracy and comprehensibility. In addition, we provide the biological interpretation of the rules generated by our approach, reinforcing the importance of comprehensible predictive models in this particular bioinformatics application. CONCLUSIONS: We conclude that automatically designing a decision-tree algorithm tailored to molecular docking data is a promising alternative for the prediction of the free energy from the binding of a drug candidate with a flexible-receptor.
format Online
Article
Text
id pubmed-3534569
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35345692013-01-03 Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data Barros, Rodrigo C Winck, Ana T Machado, Karina S Basgalupp, Márcio P de Carvalho, André CPLF Ruiz, Duncan D de Souza, Osmar Norberto BMC Bioinformatics Research Article BACKGROUND: This paper addresses the prediction of the free energy of binding of a drug candidate with enzyme InhA associated with Mycobacterium tuberculosis. This problem is found within rational drug design, where interactions between drug candidates and target proteins are verified through molecular docking simulations. In this application, it is important not only to correctly predict the free energy of binding, but also to provide a comprehensible model that could be validated by a domain specialist. Decision-tree induction algorithms have been successfully used in drug-design related applications, specially considering that decision trees are simple to understand, interpret, and validate. There are several decision-tree induction algorithms available for general-use, but each one has a bias that makes it more suitable for a particular data distribution. In this article, we propose and investigate the automatic design of decision-tree induction algorithms tailored to particular drug-enzyme binding data sets. We investigate the performance of our new method for evaluating binding conformations of different drug candidates to InhA, and we analyze our findings with respect to decision tree accuracy, comprehensibility, and biological relevance. RESULTS: The empirical analysis indicates that our method is capable of automatically generating decision-tree induction algorithms that significantly outperform the traditional C4.5 algorithm with respect to both accuracy and comprehensibility. In addition, we provide the biological interpretation of the rules generated by our approach, reinforcing the importance of comprehensible predictive models in this particular bioinformatics application. CONCLUSIONS: We conclude that automatically designing a decision-tree algorithm tailored to molecular docking data is a promising alternative for the prediction of the free energy from the binding of a drug candidate with a flexible-receptor. BioMed Central 2012-11-21 /pmc/articles/PMC3534569/ /pubmed/23171000 http://dx.doi.org/10.1186/1471-2105-13-310 Text en Copyright ©2012 Barros et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Barros, Rodrigo C
Winck, Ana T
Machado, Karina S
Basgalupp, Márcio P
de Carvalho, André CPLF
Ruiz, Duncan D
de Souza, Osmar Norberto
Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data
title Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data
title_full Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data
title_fullStr Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data
title_full_unstemmed Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data
title_short Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data
title_sort automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534569/
https://www.ncbi.nlm.nih.gov/pubmed/23171000
http://dx.doi.org/10.1186/1471-2105-13-310
work_keys_str_mv AT barrosrodrigoc automaticdesignofdecisiontreeinductionalgorithmstailoredtoflexiblereceptordockingdata
AT winckanat automaticdesignofdecisiontreeinductionalgorithmstailoredtoflexiblereceptordockingdata
AT machadokarinas automaticdesignofdecisiontreeinductionalgorithmstailoredtoflexiblereceptordockingdata
AT basgaluppmarciop automaticdesignofdecisiontreeinductionalgorithmstailoredtoflexiblereceptordockingdata
AT decarvalhoandrecplf automaticdesignofdecisiontreeinductionalgorithmstailoredtoflexiblereceptordockingdata
AT ruizduncand automaticdesignofdecisiontreeinductionalgorithmstailoredtoflexiblereceptordockingdata
AT desouzaosmarnorberto automaticdesignofdecisiontreeinductionalgorithmstailoredtoflexiblereceptordockingdata