Cargando…
Performance Studies on Distributed Virtual Screening
Virtual high-throughput screening (vHTS) is an invaluable method in modern drug discovery. It permits screening large datasets or databases of chemical structures for those structures binding possibly to a drug target. Virtual screening is typically performed by docking code, which often runs sequen...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4083208/ https://www.ncbi.nlm.nih.gov/pubmed/25032219 http://dx.doi.org/10.1155/2014/624024 |
_version_ | 1782324342209642496 |
---|---|
author | Krüger, Jens Grunzke, Richard Herres-Pawlis, Sonja Hoffmann, Alexander de la Garza, Luis Kohlbacher, Oliver Nagel, Wolfgang E. Gesing, Sandra |
author_facet | Krüger, Jens Grunzke, Richard Herres-Pawlis, Sonja Hoffmann, Alexander de la Garza, Luis Kohlbacher, Oliver Nagel, Wolfgang E. Gesing, Sandra |
author_sort | Krüger, Jens |
collection | PubMed |
description | Virtual high-throughput screening (vHTS) is an invaluable method in modern drug discovery. It permits screening large datasets or databases of chemical structures for those structures binding possibly to a drug target. Virtual screening is typically performed by docking code, which often runs sequentially. Processing of huge vHTS datasets can be parallelized by chunking the data because individual docking runs are independent of each other. The goal of this work is to find an optimal splitting maximizing the speedup while considering overhead and available cores on Distributed Computing Infrastructures (DCIs). We have conducted thorough performance studies accounting not only for the runtime of the docking itself, but also for structure preparation. Performance studies were conducted via the workflow-enabled science gateway MoSGrid (Molecular Simulation Grid). As input we used benchmark datasets for protein kinases. Our performance studies show that docking workflows can be made to scale almost linearly up to 500 concurrent processes distributed even over large DCIs, thus accelerating vHTS campaigns significantly. |
format | Online Article Text |
id | pubmed-4083208 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-40832082014-07-16 Performance Studies on Distributed Virtual Screening Krüger, Jens Grunzke, Richard Herres-Pawlis, Sonja Hoffmann, Alexander de la Garza, Luis Kohlbacher, Oliver Nagel, Wolfgang E. Gesing, Sandra Biomed Res Int Research Article Virtual high-throughput screening (vHTS) is an invaluable method in modern drug discovery. It permits screening large datasets or databases of chemical structures for those structures binding possibly to a drug target. Virtual screening is typically performed by docking code, which often runs sequentially. Processing of huge vHTS datasets can be parallelized by chunking the data because individual docking runs are independent of each other. The goal of this work is to find an optimal splitting maximizing the speedup while considering overhead and available cores on Distributed Computing Infrastructures (DCIs). We have conducted thorough performance studies accounting not only for the runtime of the docking itself, but also for structure preparation. Performance studies were conducted via the workflow-enabled science gateway MoSGrid (Molecular Simulation Grid). As input we used benchmark datasets for protein kinases. Our performance studies show that docking workflows can be made to scale almost linearly up to 500 concurrent processes distributed even over large DCIs, thus accelerating vHTS campaigns significantly. Hindawi Publishing Corporation 2014 2014-06-17 /pmc/articles/PMC4083208/ /pubmed/25032219 http://dx.doi.org/10.1155/2014/624024 Text en Copyright © 2014 Jens Krüger et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Krüger, Jens Grunzke, Richard Herres-Pawlis, Sonja Hoffmann, Alexander de la Garza, Luis Kohlbacher, Oliver Nagel, Wolfgang E. Gesing, Sandra Performance Studies on Distributed Virtual Screening |
title | Performance Studies on Distributed Virtual Screening |
title_full | Performance Studies on Distributed Virtual Screening |
title_fullStr | Performance Studies on Distributed Virtual Screening |
title_full_unstemmed | Performance Studies on Distributed Virtual Screening |
title_short | Performance Studies on Distributed Virtual Screening |
title_sort | performance studies on distributed virtual screening |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4083208/ https://www.ncbi.nlm.nih.gov/pubmed/25032219 http://dx.doi.org/10.1155/2014/624024 |
work_keys_str_mv | AT krugerjens performancestudiesondistributedvirtualscreening AT grunzkerichard performancestudiesondistributedvirtualscreening AT herrespawlissonja performancestudiesondistributedvirtualscreening AT hoffmannalexander performancestudiesondistributedvirtualscreening AT delagarzaluis performancestudiesondistributedvirtualscreening AT kohlbacheroliver performancestudiesondistributedvirtualscreening AT nagelwolfgange performancestudiesondistributedvirtualscreening AT gesingsandra performancestudiesondistributedvirtualscreening |