Cargando…
Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets
BACKGROUND: Tuberculosis is a contagious disease caused by Mycobacterium tuberculosis (Mtb), affecting more than two billion people around the globe and is one of the major causes of morbidity and mortality in the developing world. Recent reports suggest that Mtb has been developing resistance to th...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228709/ https://www.ncbi.nlm.nih.gov/pubmed/22099929 http://dx.doi.org/10.1186/1756-0500-4-504 |
_version_ | 1782217853792944128 |
---|---|
author | Periwal, Vinita Rajappan, Jinuraj K Jaleel, Abdul UC Scaria, Vinod |
author_facet | Periwal, Vinita Rajappan, Jinuraj K Jaleel, Abdul UC Scaria, Vinod |
author_sort | Periwal, Vinita |
collection | PubMed |
description | BACKGROUND: Tuberculosis is a contagious disease caused by Mycobacterium tuberculosis (Mtb), affecting more than two billion people around the globe and is one of the major causes of morbidity and mortality in the developing world. Recent reports suggest that Mtb has been developing resistance to the widely used anti-tubercular drugs resulting in the emergence and spread of multi drug-resistant (MDR) and extensively drug-resistant (XDR) strains throughout the world. In view of this global epidemic, there is an urgent need to facilitate fast and efficient lead identification methodologies. Target based screening of large compound libraries has been widely used as a fast and efficient approach for lead identification, but is restricted by the knowledge about the target structure. Whole organism screens on the other hand are target-agnostic and have been now widely employed as an alternative for lead identification but they are limited by the time and cost involved in running the screens for large compound libraries. This could be possibly be circumvented by using computational approaches to prioritize molecules for screening programmes. RESULTS: We utilized physicochemical properties of compounds to train four supervised classifiers (Naïve Bayes, Random Forest, J48 and SMO) on three publicly available bioassay screens of Mtb inhibitors and validated the robustness of the predictive models using various statistical measures. CONCLUSIONS: This study is a comprehensive analysis of high-throughput bioassay data for anti-tubercular activity and the application of machine learning approaches to create target-agnostic predictive models for anti-tubercular agents. |
format | Online Article Text |
id | pubmed-3228709 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32287092011-12-02 Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets Periwal, Vinita Rajappan, Jinuraj K Jaleel, Abdul UC Scaria, Vinod BMC Res Notes Research Article BACKGROUND: Tuberculosis is a contagious disease caused by Mycobacterium tuberculosis (Mtb), affecting more than two billion people around the globe and is one of the major causes of morbidity and mortality in the developing world. Recent reports suggest that Mtb has been developing resistance to the widely used anti-tubercular drugs resulting in the emergence and spread of multi drug-resistant (MDR) and extensively drug-resistant (XDR) strains throughout the world. In view of this global epidemic, there is an urgent need to facilitate fast and efficient lead identification methodologies. Target based screening of large compound libraries has been widely used as a fast and efficient approach for lead identification, but is restricted by the knowledge about the target structure. Whole organism screens on the other hand are target-agnostic and have been now widely employed as an alternative for lead identification but they are limited by the time and cost involved in running the screens for large compound libraries. This could be possibly be circumvented by using computational approaches to prioritize molecules for screening programmes. RESULTS: We utilized physicochemical properties of compounds to train four supervised classifiers (Naïve Bayes, Random Forest, J48 and SMO) on three publicly available bioassay screens of Mtb inhibitors and validated the robustness of the predictive models using various statistical measures. CONCLUSIONS: This study is a comprehensive analysis of high-throughput bioassay data for anti-tubercular activity and the application of machine learning approaches to create target-agnostic predictive models for anti-tubercular agents. BioMed Central 2011-11-18 /pmc/articles/PMC3228709/ /pubmed/22099929 http://dx.doi.org/10.1186/1756-0500-4-504 Text en Copyright ©2011 Jaleel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Periwal, Vinita Rajappan, Jinuraj K Jaleel, Abdul UC Scaria, Vinod Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets |
title | Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets |
title_full | Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets |
title_fullStr | Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets |
title_full_unstemmed | Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets |
title_short | Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets |
title_sort | predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228709/ https://www.ncbi.nlm.nih.gov/pubmed/22099929 http://dx.doi.org/10.1186/1756-0500-4-504 |
work_keys_str_mv | AT periwalvinita predictivemodelsforantitubercularmoleculesusingmachinelearningonhighthroughputbiologicalscreeningdatasets AT rajappanjinurajk predictivemodelsforantitubercularmoleculesusingmachinelearningonhighthroughputbiologicalscreeningdatasets AT jaleelabduluc predictivemodelsforantitubercularmoleculesusingmachinelearningonhighthroughputbiologicalscreeningdatasets AT scariavinod predictivemodelsforantitubercularmoleculesusingmachinelearningonhighthroughputbiologicalscreeningdatasets |