Cargando…

Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients

BACKGROUND: Volume of distribution is an important pharmacokinetic property that indicates the extent of a drug’s distribution in the body tissues. This paper addresses the problem of how to estimate the apparent volume of distribution at steady state (Vss) of chemical compounds in the human body us...

Descripción completa

Detalles Bibliográficos
Autores principales: Freitas, Alex A, Limbu, Kriti, Ghafourian, Taravat
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4356883/
https://www.ncbi.nlm.nih.gov/pubmed/25767566
http://dx.doi.org/10.1186/s13321-015-0054-x
_version_ 1782361058553364480
author Freitas, Alex A
Limbu, Kriti
Ghafourian, Taravat
author_facet Freitas, Alex A
Limbu, Kriti
Ghafourian, Taravat
author_sort Freitas, Alex A
collection PubMed
description BACKGROUND: Volume of distribution is an important pharmacokinetic property that indicates the extent of a drug’s distribution in the body tissues. This paper addresses the problem of how to estimate the apparent volume of distribution at steady state (Vss) of chemical compounds in the human body using decision tree-based regression methods from the area of data mining (or machine learning). Hence, the pros and cons of several different types of decision tree-based regression methods have been discussed. The regression methods predict Vss using, as predictive features, both the compounds’ molecular descriptors and the compounds’ tissue:plasma partition coefficients (K(t:p)) – often used in physiologically-based pharmacokinetics. Therefore, this work has assessed whether the data mining-based prediction of Vss can be made more accurate by using as input not only the compounds’ molecular descriptors but also (a subset of) their predicted K(t:p) values. RESULTS: Comparison of the models that used only molecular descriptors, in particular, the Bagging decision tree (mean fold error of 2.33), with those employing predicted K(t:p) values in addition to the molecular descriptors, such as the Bagging decision tree using adipose K(t:p) (mean fold error of 2.29), indicated that the use of predicted K(t:p) values as descriptors may be beneficial for accurate prediction of Vss using decision trees if prior feature selection is applied. CONCLUSIONS: Decision tree based models presented in this work have an accuracy that is reasonable and similar to the accuracy of reported Vss inter-species extrapolations in the literature. The estimation of Vss for new compounds in drug discovery will benefit from methods that are able to integrate large and varied sources of data and flexible non-linear data mining methods such as decision trees, which can produce interpretable models. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-015-0054-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4356883
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-43568832015-03-13 Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients Freitas, Alex A Limbu, Kriti Ghafourian, Taravat J Cheminform Research Article BACKGROUND: Volume of distribution is an important pharmacokinetic property that indicates the extent of a drug’s distribution in the body tissues. This paper addresses the problem of how to estimate the apparent volume of distribution at steady state (Vss) of chemical compounds in the human body using decision tree-based regression methods from the area of data mining (or machine learning). Hence, the pros and cons of several different types of decision tree-based regression methods have been discussed. The regression methods predict Vss using, as predictive features, both the compounds’ molecular descriptors and the compounds’ tissue:plasma partition coefficients (K(t:p)) – often used in physiologically-based pharmacokinetics. Therefore, this work has assessed whether the data mining-based prediction of Vss can be made more accurate by using as input not only the compounds’ molecular descriptors but also (a subset of) their predicted K(t:p) values. RESULTS: Comparison of the models that used only molecular descriptors, in particular, the Bagging decision tree (mean fold error of 2.33), with those employing predicted K(t:p) values in addition to the molecular descriptors, such as the Bagging decision tree using adipose K(t:p) (mean fold error of 2.29), indicated that the use of predicted K(t:p) values as descriptors may be beneficial for accurate prediction of Vss using decision trees if prior feature selection is applied. CONCLUSIONS: Decision tree based models presented in this work have an accuracy that is reasonable and similar to the accuracy of reported Vss inter-species extrapolations in the literature. The estimation of Vss for new compounds in drug discovery will benefit from methods that are able to integrate large and varied sources of data and flexible non-linear data mining methods such as decision trees, which can produce interpretable models. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-015-0054-x) contains supplementary material, which is available to authorized users. Springer International Publishing 2015-02-26 /pmc/articles/PMC4356883/ /pubmed/25767566 http://dx.doi.org/10.1186/s13321-015-0054-x Text en © Freitas et al.; licensee Springer. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Freitas, Alex A
Limbu, Kriti
Ghafourian, Taravat
Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients
title Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients
title_full Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients
title_fullStr Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients
title_full_unstemmed Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients
title_short Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients
title_sort predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4356883/
https://www.ncbi.nlm.nih.gov/pubmed/25767566
http://dx.doi.org/10.1186/s13321-015-0054-x
work_keys_str_mv AT freitasalexa predictingvolumeofdistributionwithdecisiontreebasedregressionmethodsusingpredictedtissueplasmapartitioncoefficients
AT limbukriti predictingvolumeofdistributionwithdecisiontreebasedregressionmethodsusingpredictedtissueplasmapartitioncoefficients
AT ghafouriantaravat predictingvolumeofdistributionwithdecisiontreebasedregressionmethodsusingpredictedtissueplasmapartitioncoefficients