Cargando…

Asymmetric bagging and feature selection for activities prediction of drug molecules

BACKGROUND: Activities of drug molecules can be predicted by QSAR (quantitative structure activity relationship) models, which overcomes the disadvantages of high cost and long cycle by employing the traditional experimental method. With the fact that the number of drug molecules with positive activ...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Guo-Zheng, Meng, Hao-Hua, Lu, Wen-Cong, Yang, Jack Y, Yang, Mary Qu
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2423448/
https://www.ncbi.nlm.nih.gov/pubmed/18541060
http://dx.doi.org/10.1186/1471-2105-9-S6-S7
_version_ 1782156102039764992
author Li, Guo-Zheng
Meng, Hao-Hua
Lu, Wen-Cong
Yang, Jack Y
Yang, Mary Qu
author_facet Li, Guo-Zheng
Meng, Hao-Hua
Lu, Wen-Cong
Yang, Jack Y
Yang, Mary Qu
author_sort Li, Guo-Zheng
collection PubMed
description BACKGROUND: Activities of drug molecules can be predicted by QSAR (quantitative structure activity relationship) models, which overcomes the disadvantages of high cost and long cycle by employing the traditional experimental method. With the fact that the number of drug molecules with positive activity is rather fewer than that of negatives, it is important to predict molecular activities considering such an unbalanced situation. RESULTS: Here, asymmetric bagging and feature selection are introduced into the problem and asymmetric bagging of support vector machines (asBagging) is proposed on predicting drug activities to treat the unbalanced problem. At the same time, the features extracted from the structures of drug molecules affect prediction accuracy of QSAR models. Therefore, a novel algorithm named PRIFEAB is proposed, which applies an embedded feature selection method to remove redundant and irrelevant features for asBagging. Numerical experimental results on a data set of molecular activities show that asBagging improve the AUC and sensitivity values of molecular activities and PRIFEAB with feature selection further helps to improve the prediction ability. CONCLUSION: Asymmetric bagging can help to improve prediction accuracy of activities of drug molecules, which can be furthermore improved by performing feature selection to select relevant features from the drug molecules data sets.
format Text
id pubmed-2423448
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24234482008-06-11 Asymmetric bagging and feature selection for activities prediction of drug molecules Li, Guo-Zheng Meng, Hao-Hua Lu, Wen-Cong Yang, Jack Y Yang, Mary Qu BMC Bioinformatics Research BACKGROUND: Activities of drug molecules can be predicted by QSAR (quantitative structure activity relationship) models, which overcomes the disadvantages of high cost and long cycle by employing the traditional experimental method. With the fact that the number of drug molecules with positive activity is rather fewer than that of negatives, it is important to predict molecular activities considering such an unbalanced situation. RESULTS: Here, asymmetric bagging and feature selection are introduced into the problem and asymmetric bagging of support vector machines (asBagging) is proposed on predicting drug activities to treat the unbalanced problem. At the same time, the features extracted from the structures of drug molecules affect prediction accuracy of QSAR models. Therefore, a novel algorithm named PRIFEAB is proposed, which applies an embedded feature selection method to remove redundant and irrelevant features for asBagging. Numerical experimental results on a data set of molecular activities show that asBagging improve the AUC and sensitivity values of molecular activities and PRIFEAB with feature selection further helps to improve the prediction ability. CONCLUSION: Asymmetric bagging can help to improve prediction accuracy of activities of drug molecules, which can be furthermore improved by performing feature selection to select relevant features from the drug molecules data sets. BioMed Central 2008-05-28 /pmc/articles/PMC2423448/ /pubmed/18541060 http://dx.doi.org/10.1186/1471-2105-9-S6-S7 Text en Copyright © 2008 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Li, Guo-Zheng
Meng, Hao-Hua
Lu, Wen-Cong
Yang, Jack Y
Yang, Mary Qu
Asymmetric bagging and feature selection for activities prediction of drug molecules
title Asymmetric bagging and feature selection for activities prediction of drug molecules
title_full Asymmetric bagging and feature selection for activities prediction of drug molecules
title_fullStr Asymmetric bagging and feature selection for activities prediction of drug molecules
title_full_unstemmed Asymmetric bagging and feature selection for activities prediction of drug molecules
title_short Asymmetric bagging and feature selection for activities prediction of drug molecules
title_sort asymmetric bagging and feature selection for activities prediction of drug molecules
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2423448/
https://www.ncbi.nlm.nih.gov/pubmed/18541060
http://dx.doi.org/10.1186/1471-2105-9-S6-S7
work_keys_str_mv AT liguozheng asymmetricbaggingandfeatureselectionforactivitiespredictionofdrugmolecules
AT menghaohua asymmetricbaggingandfeatureselectionforactivitiespredictionofdrugmolecules
AT luwencong asymmetricbaggingandfeatureselectionforactivitiespredictionofdrugmolecules
AT yangjacky asymmetricbaggingandfeatureselectionforactivitiespredictionofdrugmolecules
AT yangmaryqu asymmetricbaggingandfeatureselectionforactivitiespredictionofdrugmolecules