Cargando…

Three machine learning models for the 2019 Solubility Challenge

We describe three machine learning models submitted to the 2019 Solubility Challenge. All are founded on tree-like classifiers, with one model being based on Random Forest and another on the related Extra Trees algorithm. The third model is a consensus predictor combining the former two with a Baggi...

Descripción completa

Detalles Bibliográficos
Autor principal: Mitchell, John B. O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Association of Physical Chemists 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915607/
https://www.ncbi.nlm.nih.gov/pubmed/35300305
http://dx.doi.org/10.5599/admet.835
_version_ 1784668070178979840
author Mitchell, John B. O.
author_facet Mitchell, John B. O.
author_sort Mitchell, John B. O.
collection PubMed
description We describe three machine learning models submitted to the 2019 Solubility Challenge. All are founded on tree-like classifiers, with one model being based on Random Forest and another on the related Extra Trees algorithm. The third model is a consensus predictor combining the former two with a Bagging classifier. We call this consensus classifier Vox Machinarum, and here discuss how it benefits from the Wisdom of Crowds. On the first 2019 Solubility Challenge test set of 100 low-variance intrinsic aqueous solubilities, Extra Trees is our best classifier. One the other, a high-variance set of 32 molecules, we find that Vox Machinarum and Random Forest both perform a little better than Extra Trees, and almost equally to one another. We also compare the gold standard solubilities from the 2019 Solubility Challenge with a set of literature-based solubilities for most of the same compounds.
format Online
Article
Text
id pubmed-8915607
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher International Association of Physical Chemists
record_format MEDLINE/PubMed
spelling pubmed-89156072022-03-16 Three machine learning models for the 2019 Solubility Challenge Mitchell, John B. O. ADMET DMPK Original Scientific Papers We describe three machine learning models submitted to the 2019 Solubility Challenge. All are founded on tree-like classifiers, with one model being based on Random Forest and another on the related Extra Trees algorithm. The third model is a consensus predictor combining the former two with a Bagging classifier. We call this consensus classifier Vox Machinarum, and here discuss how it benefits from the Wisdom of Crowds. On the first 2019 Solubility Challenge test set of 100 low-variance intrinsic aqueous solubilities, Extra Trees is our best classifier. One the other, a high-variance set of 32 molecules, we find that Vox Machinarum and Random Forest both perform a little better than Extra Trees, and almost equally to one another. We also compare the gold standard solubilities from the 2019 Solubility Challenge with a set of literature-based solubilities for most of the same compounds. International Association of Physical Chemists 2020-06-15 /pmc/articles/PMC8915607/ /pubmed/35300305 http://dx.doi.org/10.5599/admet.835 Text en Copyright © 2020 by the authors. https://creativecommons.org/licenses/by/4.0/This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Original Scientific Papers
Mitchell, John B. O.
Three machine learning models for the 2019 Solubility Challenge
title Three machine learning models for the 2019 Solubility Challenge
title_full Three machine learning models for the 2019 Solubility Challenge
title_fullStr Three machine learning models for the 2019 Solubility Challenge
title_full_unstemmed Three machine learning models for the 2019 Solubility Challenge
title_short Three machine learning models for the 2019 Solubility Challenge
title_sort three machine learning models for the 2019 solubility challenge
topic Original Scientific Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915607/
https://www.ncbi.nlm.nih.gov/pubmed/35300305
http://dx.doi.org/10.5599/admet.835
work_keys_str_mv AT mitchelljohnbo threemachinelearningmodelsforthe2019solubilitychallenge