Cargando…
Bayesian Multi-Plate High-Throughput Screening of Compounds
High-throughput screening of compounds (chemicals) is an essential part of drug discovery, involving thousands to millions of compounds, with the purpose of identifying candidate hits. Most statistical tools, including the industry standard B-score method, work on individual compound plates and do n...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6015058/ https://www.ncbi.nlm.nih.gov/pubmed/29934615 http://dx.doi.org/10.1038/s41598-018-27531-w |
_version_ | 1783334318191411200 |
---|---|
author | Shterev, Ivo D. Dunson, David B. Chan, Cliburn Sempowski, Gregory D. |
author_facet | Shterev, Ivo D. Dunson, David B. Chan, Cliburn Sempowski, Gregory D. |
author_sort | Shterev, Ivo D. |
collection | PubMed |
description | High-throughput screening of compounds (chemicals) is an essential part of drug discovery, involving thousands to millions of compounds, with the purpose of identifying candidate hits. Most statistical tools, including the industry standard B-score method, work on individual compound plates and do not exploit cross-plate correlation or statistical strength among plates. We present a new statistical framework for high-throughput screening of compounds based on Bayesian nonparametric modeling. The proposed approach is able to identify candidate hits from multiple plates simultaneously, sharing statistical strength among plates and providing more robust estimates of compound activity. It can flexibly accommodate arbitrary distributions of compound activities and is applicable to any plate geometry. The algorithm provides a principled statistical approach for hit identification and false discovery rate control. Experiments demonstrate significant improvements in hit identification sensitivity and specificity over the B-score and R-score methods, which are highly sensitive to threshold choice. These improvements are maintained at low hit rates. The framework is implemented as an efficient R extension package BHTSpack and is suitable for large scale data sets. |
format | Online Article Text |
id | pubmed-6015058 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-60150582018-07-06 Bayesian Multi-Plate High-Throughput Screening of Compounds Shterev, Ivo D. Dunson, David B. Chan, Cliburn Sempowski, Gregory D. Sci Rep Article High-throughput screening of compounds (chemicals) is an essential part of drug discovery, involving thousands to millions of compounds, with the purpose of identifying candidate hits. Most statistical tools, including the industry standard B-score method, work on individual compound plates and do not exploit cross-plate correlation or statistical strength among plates. We present a new statistical framework for high-throughput screening of compounds based on Bayesian nonparametric modeling. The proposed approach is able to identify candidate hits from multiple plates simultaneously, sharing statistical strength among plates and providing more robust estimates of compound activity. It can flexibly accommodate arbitrary distributions of compound activities and is applicable to any plate geometry. The algorithm provides a principled statistical approach for hit identification and false discovery rate control. Experiments demonstrate significant improvements in hit identification sensitivity and specificity over the B-score and R-score methods, which are highly sensitive to threshold choice. These improvements are maintained at low hit rates. The framework is implemented as an efficient R extension package BHTSpack and is suitable for large scale data sets. Nature Publishing Group UK 2018-06-22 /pmc/articles/PMC6015058/ /pubmed/29934615 http://dx.doi.org/10.1038/s41598-018-27531-w Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Shterev, Ivo D. Dunson, David B. Chan, Cliburn Sempowski, Gregory D. Bayesian Multi-Plate High-Throughput Screening of Compounds |
title | Bayesian Multi-Plate High-Throughput Screening of Compounds |
title_full | Bayesian Multi-Plate High-Throughput Screening of Compounds |
title_fullStr | Bayesian Multi-Plate High-Throughput Screening of Compounds |
title_full_unstemmed | Bayesian Multi-Plate High-Throughput Screening of Compounds |
title_short | Bayesian Multi-Plate High-Throughput Screening of Compounds |
title_sort | bayesian multi-plate high-throughput screening of compounds |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6015058/ https://www.ncbi.nlm.nih.gov/pubmed/29934615 http://dx.doi.org/10.1038/s41598-018-27531-w |
work_keys_str_mv | AT shterevivod bayesianmultiplatehighthroughputscreeningofcompounds AT dunsondavidb bayesianmultiplatehighthroughputscreeningofcompounds AT chancliburn bayesianmultiplatehighthroughputscreeningofcompounds AT sempowskigregoryd bayesianmultiplatehighthroughputscreeningofcompounds |