Cargando…

Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure

The quantitative structure-activity relationship (QSAR) model searches for a reliable relationship between the chemical structure and biological activities in the field of drug design and discovery. (1) Background: In the study of QSAR, the chemical structures of compounds are encoded by a substanti...

Descripción completa

Detalles Bibliográficos
Autores principales: Xia, Liang-Yong, Wang, Yu-Wei, Meng, De-Yu, Yao, Xiao-Jun, Chai, Hua, Liang, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5795980/
https://www.ncbi.nlm.nih.gov/pubmed/29271922
http://dx.doi.org/10.3390/ijms19010030
_version_ 1783297404800335872
author Xia, Liang-Yong
Wang, Yu-Wei
Meng, De-Yu
Yao, Xiao-Jun
Chai, Hua
Liang, Yong
author_facet Xia, Liang-Yong
Wang, Yu-Wei
Meng, De-Yu
Yao, Xiao-Jun
Chai, Hua
Liang, Yong
author_sort Xia, Liang-Yong
collection PubMed
description The quantitative structure-activity relationship (QSAR) model searches for a reliable relationship between the chemical structure and biological activities in the field of drug design and discovery. (1) Background: In the study of QSAR, the chemical structures of compounds are encoded by a substantial number of descriptors. Some redundant, noisy and irrelevant descriptors result in a side-effect for the QSAR model. Meanwhile, too many descriptors can result in overfitting or low correlation between chemical structure and biological bioactivity. (2) Methods: We use novel log-sum regularization to select quite a few descriptors that are relevant to biological activities. In addition, a coordinate descent algorithm, which uses novel univariate log-sum thresholding for updating the estimated coefficients, has been developed for the QSAR model. (3) Results: Experimental results on artificial and four QSAR datasets demonstrate that our proposed log-sum method has good performance among state-of-the-art methods. (4) Conclusions: Our proposed multiple linear regression with log-sum penalty is an effective technique for both descriptor selection and prediction of biological activity.
format Online
Article
Text
id pubmed-5795980
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-57959802018-02-09 Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure Xia, Liang-Yong Wang, Yu-Wei Meng, De-Yu Yao, Xiao-Jun Chai, Hua Liang, Yong Int J Mol Sci Article The quantitative structure-activity relationship (QSAR) model searches for a reliable relationship between the chemical structure and biological activities in the field of drug design and discovery. (1) Background: In the study of QSAR, the chemical structures of compounds are encoded by a substantial number of descriptors. Some redundant, noisy and irrelevant descriptors result in a side-effect for the QSAR model. Meanwhile, too many descriptors can result in overfitting or low correlation between chemical structure and biological bioactivity. (2) Methods: We use novel log-sum regularization to select quite a few descriptors that are relevant to biological activities. In addition, a coordinate descent algorithm, which uses novel univariate log-sum thresholding for updating the estimated coefficients, has been developed for the QSAR model. (3) Results: Experimental results on artificial and four QSAR datasets demonstrate that our proposed log-sum method has good performance among state-of-the-art methods. (4) Conclusions: Our proposed multiple linear regression with log-sum penalty is an effective technique for both descriptor selection and prediction of biological activity. MDPI 2017-12-22 /pmc/articles/PMC5795980/ /pubmed/29271922 http://dx.doi.org/10.3390/ijms19010030 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xia, Liang-Yong
Wang, Yu-Wei
Meng, De-Yu
Yao, Xiao-Jun
Chai, Hua
Liang, Yong
Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure
title Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure
title_full Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure
title_fullStr Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure
title_full_unstemmed Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure
title_short Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure
title_sort descriptor selection via log-sum regularization for the biological activities of chemical structure
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5795980/
https://www.ncbi.nlm.nih.gov/pubmed/29271922
http://dx.doi.org/10.3390/ijms19010030
work_keys_str_mv AT xialiangyong descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure
AT wangyuwei descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure
AT mengdeyu descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure
AT yaoxiaojun descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure
AT chaihua descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure
AT liangyong descriptorselectionvialogsumregularizationforthebiologicalactivitiesofchemicalstructure