Cargando…
A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest
Experimental pEC(50)s for 216 selective respiratory syncytial virus (RSV) inhibitors are used to develop classification models as a potential screening tool for a large library of target compounds. Variable selection algorithm coupled with random forests (VS-RF) is used to extract the physicochemica...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Molecular Diversity Preservation International (MDPI)
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3083704/ https://www.ncbi.nlm.nih.gov/pubmed/21541057 http://dx.doi.org/10.3390/ijms12021259 |
_version_ | 1782202443976671232 |
---|---|
author | Hao, Ming Li, Yan Wang, Yonghua Zhang, Shuwei |
author_facet | Hao, Ming Li, Yan Wang, Yonghua Zhang, Shuwei |
author_sort | Hao, Ming |
collection | PubMed |
description | Experimental pEC(50)s for 216 selective respiratory syncytial virus (RSV) inhibitors are used to develop classification models as a potential screening tool for a large library of target compounds. Variable selection algorithm coupled with random forests (VS-RF) is used to extract the physicochemical features most relevant to the RSV inhibition. Based on the selected small set of descriptors, four other widely used approaches, i.e., support vector machine (SVM), Gaussian process (GP), linear discriminant analysis (LDA) and k nearest neighbors (kNN) routines are also employed and compared with the VS-RF method in terms of several of rigorous evaluation criteria. The obtained results indicate that the VS-RF model is a powerful tool for classification of RSV inhibitors, producing the highest overall accuracy of 94.34% for the external prediction set, which significantly outperforms the other four methods with the average accuracy of 80.66%. The proposed model with excellent prediction capacity from internal to external quality should be important for screening and optimization of potential RSV inhibitors prior to chemical synthesis in drug development. |
format | Text |
id | pubmed-3083704 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Molecular Diversity Preservation International (MDPI) |
record_format | MEDLINE/PubMed |
spelling | pubmed-30837042011-05-03 A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest Hao, Ming Li, Yan Wang, Yonghua Zhang, Shuwei Int J Mol Sci Article Experimental pEC(50)s for 216 selective respiratory syncytial virus (RSV) inhibitors are used to develop classification models as a potential screening tool for a large library of target compounds. Variable selection algorithm coupled with random forests (VS-RF) is used to extract the physicochemical features most relevant to the RSV inhibition. Based on the selected small set of descriptors, four other widely used approaches, i.e., support vector machine (SVM), Gaussian process (GP), linear discriminant analysis (LDA) and k nearest neighbors (kNN) routines are also employed and compared with the VS-RF method in terms of several of rigorous evaluation criteria. The obtained results indicate that the VS-RF model is a powerful tool for classification of RSV inhibitors, producing the highest overall accuracy of 94.34% for the external prediction set, which significantly outperforms the other four methods with the average accuracy of 80.66%. The proposed model with excellent prediction capacity from internal to external quality should be important for screening and optimization of potential RSV inhibitors prior to chemical synthesis in drug development. Molecular Diversity Preservation International (MDPI) 2011-02-21 /pmc/articles/PMC3083704/ /pubmed/21541057 http://dx.doi.org/10.3390/ijms12021259 Text en © 2011 by the authors; licensee MDPI, Basel, Switzerland. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/). |
spellingShingle | Article Hao, Ming Li, Yan Wang, Yonghua Zhang, Shuwei A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest |
title | A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest |
title_full | A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest |
title_fullStr | A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest |
title_full_unstemmed | A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest |
title_short | A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest |
title_sort | classification study of respiratory syncytial virus (rsv) inhibitors by variable selection with random forest |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3083704/ https://www.ncbi.nlm.nih.gov/pubmed/21541057 http://dx.doi.org/10.3390/ijms12021259 |
work_keys_str_mv | AT haoming aclassificationstudyofrespiratorysyncytialvirusrsvinhibitorsbyvariableselectionwithrandomforest AT liyan aclassificationstudyofrespiratorysyncytialvirusrsvinhibitorsbyvariableselectionwithrandomforest AT wangyonghua aclassificationstudyofrespiratorysyncytialvirusrsvinhibitorsbyvariableselectionwithrandomforest AT zhangshuwei aclassificationstudyofrespiratorysyncytialvirusrsvinhibitorsbyvariableselectionwithrandomforest AT haoming classificationstudyofrespiratorysyncytialvirusrsvinhibitorsbyvariableselectionwithrandomforest AT liyan classificationstudyofrespiratorysyncytialvirusrsvinhibitorsbyvariableselectionwithrandomforest AT wangyonghua classificationstudyofrespiratorysyncytialvirusrsvinhibitorsbyvariableselectionwithrandomforest AT zhangshuwei classificationstudyofrespiratorysyncytialvirusrsvinhibitorsbyvariableselectionwithrandomforest |