Cargando…

GPURFSCREEN: a GPU based virtual screening tool using random forest classifier

BACKGROUND: In-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale c...

Descripción completa

Detalles Bibliográficos
Autores principales: Jayaraj, P. B., Ajay, Mathias K., Nufail, M., Gopakumar, G., Jaleel, U. C. A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4772510/
https://www.ncbi.nlm.nih.gov/pubmed/26933453
http://dx.doi.org/10.1186/s13321-016-0124-8
_version_ 1782418583154851840
author Jayaraj, P. B.
Ajay, Mathias K.
Nufail, M.
Gopakumar, G.
Jaleel, U. C. A.
author_facet Jayaraj, P. B.
Ajay, Mathias K.
Nufail, M.
Gopakumar, G.
Jaleel, U. C. A.
author_sort Jayaraj, P. B.
collection PubMed
description BACKGROUND: In-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale computations, making it a highly time consuming task. This process can be speeded up by implementing parallelized algorithms on a Graphical Processing Unit (GPU). RESULTS: Random Forest is a robust classification algorithm that can be employed in the virtual screening. A ligand based virtual screening tool (GPURFSCREEN) that uses random forests on GPU systems has been proposed and evaluated in this paper. This tool produces optimized results at a lower execution time for large bioassay data sets. The quality of results produced by our tool on GPU is same as that on a regular serial environment. CONCLUSION: Considering the magnitude of data to be screened, the parallelized virtual screening has a significantly lower running time at high throughput. The proposed parallel tool outperforms its serial counterpart by successfully screening billions of molecules in training and prediction phases.
format Online
Article
Text
id pubmed-4772510
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-47725102016-03-02 GPURFSCREEN: a GPU based virtual screening tool using random forest classifier Jayaraj, P. B. Ajay, Mathias K. Nufail, M. Gopakumar, G. Jaleel, U. C. A. J Cheminform Software BACKGROUND: In-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale computations, making it a highly time consuming task. This process can be speeded up by implementing parallelized algorithms on a Graphical Processing Unit (GPU). RESULTS: Random Forest is a robust classification algorithm that can be employed in the virtual screening. A ligand based virtual screening tool (GPURFSCREEN) that uses random forests on GPU systems has been proposed and evaluated in this paper. This tool produces optimized results at a lower execution time for large bioassay data sets. The quality of results produced by our tool on GPU is same as that on a regular serial environment. CONCLUSION: Considering the magnitude of data to be screened, the parallelized virtual screening has a significantly lower running time at high throughput. The proposed parallel tool outperforms its serial counterpart by successfully screening billions of molecules in training and prediction phases. Springer International Publishing 2016-03-01 /pmc/articles/PMC4772510/ /pubmed/26933453 http://dx.doi.org/10.1186/s13321-016-0124-8 Text en © Jayaraj et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Jayaraj, P. B.
Ajay, Mathias K.
Nufail, M.
Gopakumar, G.
Jaleel, U. C. A.
GPURFSCREEN: a GPU based virtual screening tool using random forest classifier
title GPURFSCREEN: a GPU based virtual screening tool using random forest classifier
title_full GPURFSCREEN: a GPU based virtual screening tool using random forest classifier
title_fullStr GPURFSCREEN: a GPU based virtual screening tool using random forest classifier
title_full_unstemmed GPURFSCREEN: a GPU based virtual screening tool using random forest classifier
title_short GPURFSCREEN: a GPU based virtual screening tool using random forest classifier
title_sort gpurfscreen: a gpu based virtual screening tool using random forest classifier
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4772510/
https://www.ncbi.nlm.nih.gov/pubmed/26933453
http://dx.doi.org/10.1186/s13321-016-0124-8
work_keys_str_mv AT jayarajpb gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier
AT ajaymathiask gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier
AT nufailm gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier
AT gopakumarg gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier
AT jaleeluca gpurfscreenagpubasedvirtualscreeningtoolusingrandomforestclassifier