Cargando…

QSAR based predictive modeling for anti-malarial molecules

Malaria is a predominant infectious disease, with a global footprint, but especially severe in developing countries in the African subcontinent. In recent years, drug-resistant malaria has become an alarming factor, and hence the requirement of new and improved drugs is more crucial than ever before...

Descripción completa

Detalles Bibliográficos
Autores principales: Bharti, Deepak R., Lynn, Andrew M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5498782/
https://www.ncbi.nlm.nih.gov/pubmed/28690382
http://dx.doi.org/10.6026/97320630013154
_version_ 1783248360230092800
author Bharti, Deepak R.
Lynn, Andrew M.
author_facet Bharti, Deepak R.
Lynn, Andrew M.
author_sort Bharti, Deepak R.
collection PubMed
description Malaria is a predominant infectious disease, with a global footprint, but especially severe in developing countries in the African subcontinent. In recent years, drug-resistant malaria has become an alarming factor, and hence the requirement of new and improved drugs is more crucial than ever before. One of the promising locations for antimalarial drug target is the apicoplast, as this organelle does not occur in humans. The apicoplast is associated with many unique and essential pathways in many Apicomplexan pathogens, including Plasmodium. The use of machine learning methods is now commonly available through open source programs. In the present work, we describe a standard protocol to develop molecular descriptor based predictive models (QSAR models), which can be further utilized for the screening of large chemical libraries. This protocol is used to build models using training data sourced from apicoplast specific bioassays. Multiple model building methods are used including Generalized Linear Models (GLM), Random Forest (RF), C5.0 implementation of a decision tree, Support Vector Machines (SVM), K-Nearest Neighbour and Naive Bayes. Methods to evaluate the accuracy of the model building method are included in the protocol. For the given dataset, the C5.0, SVM and RF perform better than other methods, with comparable accuracy over the test data.
format Online
Article
Text
id pubmed-5498782
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-54987822017-07-07 QSAR based predictive modeling for anti-malarial molecules Bharti, Deepak R. Lynn, Andrew M. Bioinformation Hypothesis Malaria is a predominant infectious disease, with a global footprint, but especially severe in developing countries in the African subcontinent. In recent years, drug-resistant malaria has become an alarming factor, and hence the requirement of new and improved drugs is more crucial than ever before. One of the promising locations for antimalarial drug target is the apicoplast, as this organelle does not occur in humans. The apicoplast is associated with many unique and essential pathways in many Apicomplexan pathogens, including Plasmodium. The use of machine learning methods is now commonly available through open source programs. In the present work, we describe a standard protocol to develop molecular descriptor based predictive models (QSAR models), which can be further utilized for the screening of large chemical libraries. This protocol is used to build models using training data sourced from apicoplast specific bioassays. Multiple model building methods are used including Generalized Linear Models (GLM), Random Forest (RF), C5.0 implementation of a decision tree, Support Vector Machines (SVM), K-Nearest Neighbour and Naive Bayes. Methods to evaluate the accuracy of the model building method are included in the protocol. For the given dataset, the C5.0, SVM and RF perform better than other methods, with comparable accuracy over the test data. Biomedical Informatics 2017-05-31 /pmc/articles/PMC5498782/ /pubmed/28690382 http://dx.doi.org/10.6026/97320630013154 Text en © 2017 Biomedical Informatics http://creativecommons.org/licenses/by/3.0/ This is an Open Access article which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. This is distributed under the terms of the Creative Commons Attribution License.
spellingShingle Hypothesis
Bharti, Deepak R.
Lynn, Andrew M.
QSAR based predictive modeling for anti-malarial molecules
title QSAR based predictive modeling for anti-malarial molecules
title_full QSAR based predictive modeling for anti-malarial molecules
title_fullStr QSAR based predictive modeling for anti-malarial molecules
title_full_unstemmed QSAR based predictive modeling for anti-malarial molecules
title_short QSAR based predictive modeling for anti-malarial molecules
title_sort qsar based predictive modeling for anti-malarial molecules
topic Hypothesis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5498782/
https://www.ncbi.nlm.nih.gov/pubmed/28690382
http://dx.doi.org/10.6026/97320630013154
work_keys_str_mv AT bhartideepakr qsarbasedpredictivemodelingforantimalarialmolecules
AT lynnandrewm qsarbasedpredictivemodelingforantimalarialmolecules