Cargando…

Multiple linear regression models for predicting the n‑octanol/water partition coefficients in the SAMPL7 blind challenge

A multiple linear regression model called MLR-3 is used for predicting the experimental n-octanol/water partition coefficient (log P(N)) of 22 N-sulfonamides proposed by the organizers of the SAMPL7 blind challenge. The MLR-3 method was trained with 82 molecules including drug-like sulfonamides and...

Descripción completa

Detalles Bibliográficos
Autores principales: Lopez, Kenneth, Pinheiro, Silvana, Zamora, William J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273033/
https://www.ncbi.nlm.nih.gov/pubmed/34251523
http://dx.doi.org/10.1007/s10822-021-00409-2
_version_ 1783721301729345536
author Lopez, Kenneth
Pinheiro, Silvana
Zamora, William J.
author_facet Lopez, Kenneth
Pinheiro, Silvana
Zamora, William J.
author_sort Lopez, Kenneth
collection PubMed
description A multiple linear regression model called MLR-3 is used for predicting the experimental n-octanol/water partition coefficient (log P(N)) of 22 N-sulfonamides proposed by the organizers of the SAMPL7 blind challenge. The MLR-3 method was trained with 82 molecules including drug-like sulfonamides and small organic molecules, which resembled the main functional groups present in the challenge dataset. Our model, submitted as “TFE-MLR”, presented a root-mean-square error of 0.58 and mean absolute error of 0.41 in log P units, accomplishing the highest accuracy, among empirical methods and also in all submissions based on the ranked ones. Overall, the results support the appropriateness of multiple linear regression approach MLR-3 for computing the n-octanol/water partition coefficient in sulfonamide-bearing compounds. In this context, the outstanding performance of empirical methodologies, where 75% of the ranked submissions achieved root-mean-square errors < 1 log P units, support the suitability of these strategies for obtaining accurate and fast predictions of physicochemical properties as partition coefficients of bioorganic compounds. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10822-021-00409-2.
format Online
Article
Text
id pubmed-8273033
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-82730332021-07-12 Multiple linear regression models for predicting the n‑octanol/water partition coefficients in the SAMPL7 blind challenge Lopez, Kenneth Pinheiro, Silvana Zamora, William J. J Comput Aided Mol Des Article A multiple linear regression model called MLR-3 is used for predicting the experimental n-octanol/water partition coefficient (log P(N)) of 22 N-sulfonamides proposed by the organizers of the SAMPL7 blind challenge. The MLR-3 method was trained with 82 molecules including drug-like sulfonamides and small organic molecules, which resembled the main functional groups present in the challenge dataset. Our model, submitted as “TFE-MLR”, presented a root-mean-square error of 0.58 and mean absolute error of 0.41 in log P units, accomplishing the highest accuracy, among empirical methods and also in all submissions based on the ranked ones. Overall, the results support the appropriateness of multiple linear regression approach MLR-3 for computing the n-octanol/water partition coefficient in sulfonamide-bearing compounds. In this context, the outstanding performance of empirical methodologies, where 75% of the ranked submissions achieved root-mean-square errors < 1 log P units, support the suitability of these strategies for obtaining accurate and fast predictions of physicochemical properties as partition coefficients of bioorganic compounds. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10822-021-00409-2. Springer International Publishing 2021-07-12 2021 /pmc/articles/PMC8273033/ /pubmed/34251523 http://dx.doi.org/10.1007/s10822-021-00409-2 Text en © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Lopez, Kenneth
Pinheiro, Silvana
Zamora, William J.
Multiple linear regression models for predicting the n‑octanol/water partition coefficients in the SAMPL7 blind challenge
title Multiple linear regression models for predicting the n‑octanol/water partition coefficients in the SAMPL7 blind challenge
title_full Multiple linear regression models for predicting the n‑octanol/water partition coefficients in the SAMPL7 blind challenge
title_fullStr Multiple linear regression models for predicting the n‑octanol/water partition coefficients in the SAMPL7 blind challenge
title_full_unstemmed Multiple linear regression models for predicting the n‑octanol/water partition coefficients in the SAMPL7 blind challenge
title_short Multiple linear regression models for predicting the n‑octanol/water partition coefficients in the SAMPL7 blind challenge
title_sort multiple linear regression models for predicting the n‑octanol/water partition coefficients in the sampl7 blind challenge
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273033/
https://www.ncbi.nlm.nih.gov/pubmed/34251523
http://dx.doi.org/10.1007/s10822-021-00409-2
work_keys_str_mv AT lopezkenneth multiplelinearregressionmodelsforpredictingthenoctanolwaterpartitioncoefficientsinthesampl7blindchallenge
AT pinheirosilvana multiplelinearregressionmodelsforpredictingthenoctanolwaterpartitioncoefficientsinthesampl7blindchallenge
AT zamorawilliamj multiplelinearregressionmodelsforpredictingthenoctanolwaterpartitioncoefficientsinthesampl7blindchallenge