Cargando…
QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge
Volatile organic compounds (VOCs) are contained in a variety of chemicals that can be found in household products and may have undesirable effects on health. Thereby, it is important to model blood-to-liver partition coefficients (log P(liver)) for VOCs in a fast and inexpensive way. In this paper,...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6268846/ https://www.ncbi.nlm.nih.gov/pubmed/23247367 http://dx.doi.org/10.3390/molecules171214937 |
_version_ | 1783376380080160768 |
---|---|
author | Palomba, Damián Martínez, María J. Ponzoni, Ignacio Díaz, Mónica F. Vazquez, Gustavo E. Soto, Axel J. |
author_facet | Palomba, Damián Martínez, María J. Ponzoni, Ignacio Díaz, Mónica F. Vazquez, Gustavo E. Soto, Axel J. |
author_sort | Palomba, Damián |
collection | PubMed |
description | Volatile organic compounds (VOCs) are contained in a variety of chemicals that can be found in household products and may have undesirable effects on health. Thereby, it is important to model blood-to-liver partition coefficients (log P(liver)) for VOCs in a fast and inexpensive way. In this paper, we present two new quantitative structure-property relationship (QSPR) models for the prediction of log P(liver), where we also propose a hybrid approach for the selection of the descriptors. This hybrid methodology combines a machine learning method with a manual selection based on expert knowledge. This allows obtaining a set of descriptors that is interpretable in physicochemical terms. Our regression models were trained using decision trees and neural networks and validated using an external test set. Results show high prediction accuracy compared to previous log P(liver) models, and the descriptor selection approach provides a means to get a small set of descriptors that is in agreement with theoretical understanding of the target property. |
format | Online Article Text |
id | pubmed-6268846 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-62688462018-12-14 QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge Palomba, Damián Martínez, María J. Ponzoni, Ignacio Díaz, Mónica F. Vazquez, Gustavo E. Soto, Axel J. Molecules Article Volatile organic compounds (VOCs) are contained in a variety of chemicals that can be found in household products and may have undesirable effects on health. Thereby, it is important to model blood-to-liver partition coefficients (log P(liver)) for VOCs in a fast and inexpensive way. In this paper, we present two new quantitative structure-property relationship (QSPR) models for the prediction of log P(liver), where we also propose a hybrid approach for the selection of the descriptors. This hybrid methodology combines a machine learning method with a manual selection based on expert knowledge. This allows obtaining a set of descriptors that is interpretable in physicochemical terms. Our regression models were trained using decision trees and neural networks and validated using an external test set. Results show high prediction accuracy compared to previous log P(liver) models, and the descriptor selection approach provides a means to get a small set of descriptors that is in agreement with theoretical understanding of the target property. MDPI 2012-12-17 /pmc/articles/PMC6268846/ /pubmed/23247367 http://dx.doi.org/10.3390/molecules171214937 Text en © 2012 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/). |
spellingShingle | Article Palomba, Damián Martínez, María J. Ponzoni, Ignacio Díaz, Mónica F. Vazquez, Gustavo E. Soto, Axel J. QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge |
title | QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge |
title_full | QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge |
title_fullStr | QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge |
title_full_unstemmed | QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge |
title_short | QSPR Models for Predicting Log P(liver) Values for Volatile Organic Compounds Combining Statistical Methods and Domain Knowledge |
title_sort | qspr models for predicting log p(liver) values for volatile organic compounds combining statistical methods and domain knowledge |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6268846/ https://www.ncbi.nlm.nih.gov/pubmed/23247367 http://dx.doi.org/10.3390/molecules171214937 |
work_keys_str_mv | AT palombadamian qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge AT martinezmariaj qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge AT ponzoniignacio qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge AT diazmonicaf qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge AT vazquezgustavoe qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge AT sotoaxelj qsprmodelsforpredictinglogplivervaluesforvolatileorganiccompoundscombiningstatisticalmethodsanddomainknowledge |