Cargando…

QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge

Volatile organic compounds (VOCs) are contained in a variety of chemicals that can be found in household products and may have undesirable effects on health. Thereby, it is important to model blood-to-liver partition coefficients (log P(liver)) for VOCs in a fast and inexpensive way. In this paper,...

Descripción completa

Detalles Bibliográficos
Autores principales: Palomba, Damián, Martínez, María J., Ponzoni, Ignacio, Díaz, Mónica F., Vazquez, Gustavo E., Soto, Axel J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6268846/
https://www.ncbi.nlm.nih.gov/pubmed/23247367
http://dx.doi.org/10.3390/molecules171214937
_version_ 1783376380080160768
author Palomba, Damián
Martínez, María J.
Ponzoni, Ignacio
Díaz, Mónica F.
Vazquez, Gustavo E.
Soto, Axel J.
author_facet Palomba, Damián
Martínez, María J.
Ponzoni, Ignacio
Díaz, Mónica F.
Vazquez, Gustavo E.
Soto, Axel J.
author_sort Palomba, Damián
collection PubMed
description Volatile organic compounds (VOCs) are contained in a variety of chemicals that can be found in household products and may have undesirable effects on health. Thereby, it is important to model blood-to-liver partition coefficients (log P(liver)) for VOCs in a fast and inexpensive way. In this paper, we present two new quantitative structure-property relationship (QSPR) models for the prediction of log P(liver), where we also propose a hybrid approach for the selection of the descriptors. This hybrid methodology combines a machine learning method with a manual selection based on expert knowledge. This allows obtaining a set of descriptors that is interpretable in physicochemical terms. Our regression models were trained using decision trees and neural networks and validated using an external test set. Results show high prediction accuracy compared to previous log P(liver) models, and the descriptor selection approach provides a means to get a small set of descriptors that is in agreement with theoretical understanding of the target property.
format Online
Article
Text
id pubmed-6268846
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62688462018-12-14 QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge Palomba, Damián Martínez, María J. Ponzoni, Ignacio Díaz, Mónica F. Vazquez, Gustavo E. Soto, Axel J. Molecules Article Volatile organic compounds (VOCs) are contained in a variety of chemicals that can be found in household products and may have undesirable effects on health. Thereby, it is important to model blood-to-liver partition coefficients (log P(liver)) for VOCs in a fast and inexpensive way. In this paper, we present two new quantitative structure-property relationship (QSPR) models for the prediction of log P(liver), where we also propose a hybrid approach for the selection of the descriptors. This hybrid methodology combines a machine learning method with a manual selection based on expert knowledge. This allows obtaining a set of descriptors that is interpretable in physicochemical terms. Our regression models were trained using decision trees and neural networks and validated using an external test set. Results show high prediction accuracy compared to previous log P(liver) models, and the descriptor selection approach provides a means to get a small set of descriptors that is in agreement with theoretical understanding of the target property. MDPI 2012-12-17 /pmc/articles/PMC6268846/ /pubmed/23247367 http://dx.doi.org/10.3390/molecules171214937 Text en © 2012 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Palomba, Damián
Martínez, María J.
Ponzoni, Ignacio
Díaz, Mónica F.
Vazquez, Gustavo E.
Soto, Axel J.
QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge
title QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge
title_full QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge
title_fullStr QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge
title_full_unstemmed QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge
title_short QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge
title_sort qspr models for predicting log p(liver) values for volatile organic compounds combining statistical methods and domain knowledge
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6268846/
https://www.ncbi.nlm.nih.gov/pubmed/23247367
http://dx.doi.org/10.3390/molecules171214937
work_keys_str_mv AT palombadamian qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge
AT martinezmariaj qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge
AT ponzoniignacio qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge
AT diazmonicaf qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge
AT vazquezgustavoe qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge
AT sotoaxelj qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge