Cargando…
Can human experts predict solubility better than computers?
In this study, we design and carry out a survey, asking human experts to predict the aqueous solubility of druglike organic compounds. We investigate whether these experts, drawn largely from the pharmaceutical industry and academia, can match or exceed the predictive power of algorithms. Alongside...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5729181/ https://www.ncbi.nlm.nih.gov/pubmed/29238891 http://dx.doi.org/10.1186/s13321-017-0250-y |
_version_ | 1783286140771500032 |
---|---|
author | Boobier, Samuel Osbourn, Anne Mitchell, John B. O. |
author_facet | Boobier, Samuel Osbourn, Anne Mitchell, John B. O. |
author_sort | Boobier, Samuel |
collection | PubMed |
description | In this study, we design and carry out a survey, asking human experts to predict the aqueous solubility of druglike organic compounds. We investigate whether these experts, drawn largely from the pharmaceutical industry and academia, can match or exceed the predictive power of algorithms. Alongside this, we implement 10 typical machine learning algorithms on the same dataset. The best algorithm, a variety of neural network known as a multi-layer perceptron, gave an RMSE of 0.985 log S units and an R(2) of 0.706. We would not have predicted the relative success of this particular algorithm in advance. We found that the best individual human predictor generated an almost identical prediction quality with an RMSE of 0.942 log S units and an R(2) of 0.723. The collection of algorithms contained a higher proportion of reasonably good predictors, nine out of ten compared with around half of the humans. We found that, for either humans or algorithms, combining individual predictions into a consensus predictor by taking their median generated excellent predictivity. While our consensus human predictor achieved very slightly better headline figures on various statistical measures, the difference between it and the consensus machine learning predictor was both small and statistically insignificant. We conclude that human experts can predict the aqueous solubility of druglike molecules essentially equally well as machine learning algorithms. We find that, for either humans or algorithms, combining individual predictions into a consensus predictor by taking their median is a powerful way of benefitting from the wisdom of crowds. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-017-0250-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5729181 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-57291812017-12-14 Can human experts predict solubility better than computers? Boobier, Samuel Osbourn, Anne Mitchell, John B. O. J Cheminform Research Article In this study, we design and carry out a survey, asking human experts to predict the aqueous solubility of druglike organic compounds. We investigate whether these experts, drawn largely from the pharmaceutical industry and academia, can match or exceed the predictive power of algorithms. Alongside this, we implement 10 typical machine learning algorithms on the same dataset. The best algorithm, a variety of neural network known as a multi-layer perceptron, gave an RMSE of 0.985 log S units and an R(2) of 0.706. We would not have predicted the relative success of this particular algorithm in advance. We found that the best individual human predictor generated an almost identical prediction quality with an RMSE of 0.942 log S units and an R(2) of 0.723. The collection of algorithms contained a higher proportion of reasonably good predictors, nine out of ten compared with around half of the humans. We found that, for either humans or algorithms, combining individual predictions into a consensus predictor by taking their median generated excellent predictivity. While our consensus human predictor achieved very slightly better headline figures on various statistical measures, the difference between it and the consensus machine learning predictor was both small and statistically insignificant. We conclude that human experts can predict the aqueous solubility of druglike molecules essentially equally well as machine learning algorithms. We find that, for either humans or algorithms, combining individual predictions into a consensus predictor by taking their median is a powerful way of benefitting from the wisdom of crowds. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-017-0250-y) contains supplementary material, which is available to authorized users. Springer International Publishing 2017-12-13 /pmc/articles/PMC5729181/ /pubmed/29238891 http://dx.doi.org/10.1186/s13321-017-0250-y Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Boobier, Samuel Osbourn, Anne Mitchell, John B. O. Can human experts predict solubility better than computers? |
title | Can human experts predict solubility better than computers? |
title_full | Can human experts predict solubility better than computers? |
title_fullStr | Can human experts predict solubility better than computers? |
title_full_unstemmed | Can human experts predict solubility better than computers? |
title_short | Can human experts predict solubility better than computers? |
title_sort | can human experts predict solubility better than computers? |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5729181/ https://www.ncbi.nlm.nih.gov/pubmed/29238891 http://dx.doi.org/10.1186/s13321-017-0250-y |
work_keys_str_mv | AT boobiersamuel canhumanexpertspredictsolubilitybetterthancomputers AT osbournanne canhumanexpertspredictsolubilitybetterthancomputers AT mitchelljohnbo canhumanexpertspredictsolubilitybetterthancomputers |