Cargando…

Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information

Despite decades of intensive research and a number of demonstrable successes, quantitative structure-activity relationship (QSAR) models still fail to yield predictions with reasonable accuracy in some circumstances, especially when the QSAR paradox occurs. In this study, to avoid the QSAR paradox,...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Qian, Wu, Leihong, Liu, Wei, Xing, Li, Fan, Xiaohui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6270197/
https://www.ncbi.nlm.nih.gov/pubmed/24008242
http://dx.doi.org/10.3390/molecules180910789
_version_ 1783376642423390208
author Chen, Qian
Wu, Leihong
Liu, Wei
Xing, Li
Fan, Xiaohui
author_facet Chen, Qian
Wu, Leihong
Liu, Wei
Xing, Li
Fan, Xiaohui
author_sort Chen, Qian
collection PubMed
description Despite decades of intensive research and a number of demonstrable successes, quantitative structure-activity relationship (QSAR) models still fail to yield predictions with reasonable accuracy in some circumstances, especially when the QSAR paradox occurs. In this study, to avoid the QSAR paradox, we proposed a novel integrated approach to improve the model performance through using both structural and biological information from compounds. As a proof-of-concept, the integrated models were built on a toxicological dataset to predict non-genotoxic carcinogenicity of compounds, using not only the conventional molecular descriptors but also expression profiles of significant genes selected from microarray data. For test set data, our results demonstrated that the prediction accuracy of QSAR model was dramatically increased from 0.57 to 0.67 with incorporation of expression data of just one selected signature gene. Our successful integration of biological information into classic QSAR model provided a new insight and methodology for building predictive models especially when QSAR paradox occurred.
format Online
Article
Text
id pubmed-6270197
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62701972018-12-18 Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information Chen, Qian Wu, Leihong Liu, Wei Xing, Li Fan, Xiaohui Molecules Article Despite decades of intensive research and a number of demonstrable successes, quantitative structure-activity relationship (QSAR) models still fail to yield predictions with reasonable accuracy in some circumstances, especially when the QSAR paradox occurs. In this study, to avoid the QSAR paradox, we proposed a novel integrated approach to improve the model performance through using both structural and biological information from compounds. As a proof-of-concept, the integrated models were built on a toxicological dataset to predict non-genotoxic carcinogenicity of compounds, using not only the conventional molecular descriptors but also expression profiles of significant genes selected from microarray data. For test set data, our results demonstrated that the prediction accuracy of QSAR model was dramatically increased from 0.57 to 0.67 with incorporation of expression data of just one selected signature gene. Our successful integration of biological information into classic QSAR model provided a new insight and methodology for building predictive models especially when QSAR paradox occurred. MDPI 2013-09-04 /pmc/articles/PMC6270197/ /pubmed/24008242 http://dx.doi.org/10.3390/molecules180910789 Text en © 2013 by the authors; licensee MDPI, Basel, Switzerland. http://creativecommons.org/licenses/by/3.0/ This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Chen, Qian
Wu, Leihong
Liu, Wei
Xing, Li
Fan, Xiaohui
Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information
title Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information
title_full Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information
title_fullStr Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information
title_full_unstemmed Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information
title_short Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information
title_sort enhanced qsar model performance by integrating structural and gene expression information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6270197/
https://www.ncbi.nlm.nih.gov/pubmed/24008242
http://dx.doi.org/10.3390/molecules180910789
work_keys_str_mv AT chenqian enhancedqsarmodelperformancebyintegratingstructuralandgeneexpressioninformation
AT wuleihong enhancedqsarmodelperformancebyintegratingstructuralandgeneexpressioninformation
AT liuwei enhancedqsarmodelperformancebyintegratingstructuralandgeneexpressioninformation
AT xingli enhancedqsarmodelperformancebyintegratingstructuralandgeneexpressioninformation
AT fanxiaohui enhancedqsarmodelperformancebyintegratingstructuralandgeneexpressioninformation