Cargando…
Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure
The quantitative structure-activity relationship (QSAR) model searches for a reliable relationship between the chemical structure and biological activities in the field of drug design and discovery. (1) Background: In the study of QSAR, the chemical structures of compounds are encoded by a substanti...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5795980/ https://www.ncbi.nlm.nih.gov/pubmed/29271922 http://dx.doi.org/10.3390/ijms19010030 |
_version_ | 1783297404800335872 |
---|---|
author | Xia, Liang-Yong Wang, Yu-Wei Meng, De-Yu Yao, Xiao-Jun Chai, Hua Liang, Yong |
author_facet | Xia, Liang-Yong Wang, Yu-Wei Meng, De-Yu Yao, Xiao-Jun Chai, Hua Liang, Yong |
author_sort | Xia, Liang-Yong |
collection | PubMed |
description | The quantitative structure-activity relationship (QSAR) model searches for a reliable relationship between the chemical structure and biological activities in the field of drug design and discovery. (1) Background: In the study of QSAR, the chemical structures of compounds are encoded by a substantial number of descriptors. Some redundant, noisy and irrelevant descriptors result in a side-effect for the QSAR model. Meanwhile, too many descriptors can result in overfitting or low correlation between chemical structure and biological bioactivity. (2) Methods: We use novel log-sum regularization to select quite a few descriptors that are relevant to biological activities. In addition, a coordinate descent algorithm, which uses novel univariate log-sum thresholding for updating the estimated coefficients, has been developed for the QSAR model. (3) Results: Experimental results on artificial and four QSAR datasets demonstrate that our proposed log-sum method has good performance among state-of-the-art methods. (4) Conclusions: Our proposed multiple linear regression with log-sum penalty is an effective technique for both descriptor selection and prediction of biological activity. |
format | Online Article Text |
id | pubmed-5795980 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-57959802018-02-09 Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure Xia, Liang-Yong Wang, Yu-Wei Meng, De-Yu Yao, Xiao-Jun Chai, Hua Liang, Yong Int J Mol Sci Article The quantitative structure-activity relationship (QSAR) model searches for a reliable relationship between the chemical structure and biological activities in the field of drug design and discovery. (1) Background: In the study of QSAR, the chemical structures of compounds are encoded by a substantial number of descriptors. Some redundant, noisy and irrelevant descriptors result in a side-effect for the QSAR model. Meanwhile, too many descriptors can result in overfitting or low correlation between chemical structure and biological bioactivity. (2) Methods: We use novel log-sum regularization to select quite a few descriptors that are relevant to biological activities. In addition, a coordinate descent algorithm, which uses novel univariate log-sum thresholding for updating the estimated coefficients, has been developed for the QSAR model. (3) Results: Experimental results on artificial and four QSAR datasets demonstrate that our proposed log-sum method has good performance among state-of-the-art methods. (4) Conclusions: Our proposed multiple linear regression with log-sum penalty is an effective technique for both descriptor selection and prediction of biological activity. MDPI 2017-12-22 /pmc/articles/PMC5795980/ /pubmed/29271922 http://dx.doi.org/10.3390/ijms19010030 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Xia, Liang-Yong Wang, Yu-Wei Meng, De-Yu Yao, Xiao-Jun Chai, Hua Liang, Yong Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure |
title | Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure |
title_full | Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure |
title_fullStr | Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure |
title_full_unstemmed | Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure |
title_short | Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure |
title_sort | descriptor selection via log-sum regularization for the biological activities of chemical structure |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5795980/ https://www.ncbi.nlm.nih.gov/pubmed/29271922 http://dx.doi.org/10.3390/ijms19010030 |
work_keys_str_mv | AT xialiangyong descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure AT wangyuwei descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure AT mengdeyu descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure AT yaoxiaojun descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure AT chaihua descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure AT liangyong descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure |