Cargando…
GPURFSCREEN: a GPU based virtual screening tool using random forest classifier
BACKGROUND: In-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale c...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4772510/ https://www.ncbi.nlm.nih.gov/pubmed/26933453 http://dx.doi.org/10.1186/s13321-016-0124-8 |
_version_ | 1782418583154851840 |
---|---|
author | Jayaraj, P. B. Ajay, Mathias K. Nufail, M. Gopakumar, G. Jaleel, U. C. A. |
author_facet | Jayaraj, P. B. Ajay, Mathias K. Nufail, M. Gopakumar, G. Jaleel, U. C. A. |
author_sort | Jayaraj, P. B. |
collection | PubMed |
description | BACKGROUND: In-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale computations, making it a highly time consuming task. This process can be speeded up by implementing parallelized algorithms on a Graphical Processing Unit (GPU). RESULTS: Random Forest is a robust classification algorithm that can be employed in the virtual screening. A ligand based virtual screening tool (GPURFSCREEN) that uses random forests on GPU systems has been proposed and evaluated in this paper. This tool produces optimized results at a lower execution time for large bioassay data sets. The quality of results produced by our tool on GPU is same as that on a regular serial environment. CONCLUSION: Considering the magnitude of data to be screened, the parallelized virtual screening has a significantly lower running time at high throughput. The proposed parallel tool outperforms its serial counterpart by successfully screening billions of molecules in training and prediction phases. |
format | Online Article Text |
id | pubmed-4772510 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-47725102016-03-02 GPURFSCREEN: a GPU based virtual screening tool using random forest classifier Jayaraj, P. B. Ajay, Mathias K. Nufail, M. Gopakumar, G. Jaleel, U. C. A. J Cheminform Software BACKGROUND: In-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale computations, making it a highly time consuming task. This process can be speeded up by implementing parallelized algorithms on a Graphical Processing Unit (GPU). RESULTS: Random Forest is a robust classification algorithm that can be employed in the virtual screening. A ligand based virtual screening tool (GPURFSCREEN) that uses random forests on GPU systems has been proposed and evaluated in this paper. This tool produces optimized results at a lower execution time for large bioassay data sets. The quality of results produced by our tool on GPU is same as that on a regular serial environment. CONCLUSION: Considering the magnitude of data to be screened, the parallelized virtual screening has a significantly lower running time at high throughput. The proposed parallel tool outperforms its serial counterpart by successfully screening billions of molecules in training and prediction phases. Springer International Publishing 2016-03-01 /pmc/articles/PMC4772510/ /pubmed/26933453 http://dx.doi.org/10.1186/s13321-016-0124-8 Text en © Jayaraj et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Jayaraj, P. B. Ajay, Mathias K. Nufail, M. Gopakumar, G. Jaleel, U. C. A. GPURFSCREEN: a GPU based virtual screening tool using random forest classifier |
title | GPURFSCREEN: a GPU based virtual screening tool using random forest classifier |
title_full | GPURFSCREEN: a GPU based virtual screening tool using random forest classifier |
title_fullStr | GPURFSCREEN: a GPU based virtual screening tool using random forest classifier |
title_full_unstemmed | GPURFSCREEN: a GPU based virtual screening tool using random forest classifier |
title_short | GPURFSCREEN: a GPU based virtual screening tool using random forest classifier |
title_sort | gpurfscreen: a gpu based virtual screening tool using random forest classifier |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4772510/ https://www.ncbi.nlm.nih.gov/pubmed/26933453 http://dx.doi.org/10.1186/s13321-016-0124-8 |
work_keys_str_mv | AT jayarajpb gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier AT ajaymathiask gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier AT nufailm gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier AT gopakumarg gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier AT jaleeluca gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier |