Cargando…

Bayesian Multi-Plate High-Throughput Screening of Compounds

High-throughput screening of compounds (chemicals) is an essential part of drug discovery, involving thousands to millions of compounds, with the purpose of identifying candidate hits. Most statistical tools, including the industry standard B-score method, work on individual compound plates and do n...

Descripción completa

Detalles Bibliográficos
Autores principales: Shterev, Ivo D., Dunson, David B., Chan, Cliburn, Sempowski, Gregory D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6015058/
https://www.ncbi.nlm.nih.gov/pubmed/29934615
http://dx.doi.org/10.1038/s41598-018-27531-w
_version_ 1783334318191411200
author Shterev, Ivo D.
Dunson, David B.
Chan, Cliburn
Sempowski, Gregory D.
author_facet Shterev, Ivo D.
Dunson, David B.
Chan, Cliburn
Sempowski, Gregory D.
author_sort Shterev, Ivo D.
collection PubMed
description High-throughput screening of compounds (chemicals) is an essential part of drug discovery, involving thousands to millions of compounds, with the purpose of identifying candidate hits. Most statistical tools, including the industry standard B-score method, work on individual compound plates and do not exploit cross-plate correlation or statistical strength among plates. We present a new statistical framework for high-throughput screening of compounds based on Bayesian nonparametric modeling. The proposed approach is able to identify candidate hits from multiple plates simultaneously, sharing statistical strength among plates and providing more robust estimates of compound activity. It can flexibly accommodate arbitrary distributions of compound activities and is applicable to any plate geometry. The algorithm provides a principled statistical approach for hit identification and false discovery rate control. Experiments demonstrate significant improvements in hit identification sensitivity and specificity over the B-score and R-score methods, which are highly sensitive to threshold choice. These improvements are maintained at low hit rates. The framework is implemented as an efficient R extension package BHTSpack and is suitable for large scale data sets.
format Online
Article
Text
id pubmed-6015058
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-60150582018-07-06 Bayesian Multi-Plate High-Throughput Screening of Compounds Shterev, Ivo D. Dunson, David B. Chan, Cliburn Sempowski, Gregory D. Sci Rep Article High-throughput screening of compounds (chemicals) is an essential part of drug discovery, involving thousands to millions of compounds, with the purpose of identifying candidate hits. Most statistical tools, including the industry standard B-score method, work on individual compound plates and do not exploit cross-plate correlation or statistical strength among plates. We present a new statistical framework for high-throughput screening of compounds based on Bayesian nonparametric modeling. The proposed approach is able to identify candidate hits from multiple plates simultaneously, sharing statistical strength among plates and providing more robust estimates of compound activity. It can flexibly accommodate arbitrary distributions of compound activities and is applicable to any plate geometry. The algorithm provides a principled statistical approach for hit identification and false discovery rate control. Experiments demonstrate significant improvements in hit identification sensitivity and specificity over the B-score and R-score methods, which are highly sensitive to threshold choice. These improvements are maintained at low hit rates. The framework is implemented as an efficient R extension package BHTSpack and is suitable for large scale data sets. Nature Publishing Group UK 2018-06-22 /pmc/articles/PMC6015058/ /pubmed/29934615 http://dx.doi.org/10.1038/s41598-018-27531-w Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Shterev, Ivo D.
Dunson, David B.
Chan, Cliburn
Sempowski, Gregory D.
Bayesian Multi-Plate High-Throughput Screening of Compounds
title Bayesian Multi-Plate High-Throughput Screening of Compounds
title_full Bayesian Multi-Plate High-Throughput Screening of Compounds
title_fullStr Bayesian Multi-Plate High-Throughput Screening of Compounds
title_full_unstemmed Bayesian Multi-Plate High-Throughput Screening of Compounds
title_short Bayesian Multi-Plate High-Throughput Screening of Compounds
title_sort bayesian multi-plate high-throughput screening of compounds
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6015058/
https://www.ncbi.nlm.nih.gov/pubmed/29934615
http://dx.doi.org/10.1038/s41598-018-27531-w
work_keys_str_mv AT shterevivod bayesianmultiplatehighthroughputscreeningofcompounds
AT dunsondavidb bayesianmultiplatehighthroughputscreeningofcompounds
AT chancliburn bayesianmultiplatehighthroughputscreeningofcompounds
AT sempowskigregoryd bayesianmultiplatehighthroughputscreeningofcompounds