Cargando…

Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery

Quantitative structure–activity relationship modeling using machine learning techniques constitutes a complex computational problem, where the identification of the most informative molecular descriptors for predicting a specific target property plays a critical role. Two main general approaches can...

Descripción completa

Detalles Bibliográficos
Autores principales: Ponzoni, Ignacio, Sebastián-Pérez, Víctor, Requena-Triguero, Carlos, Roca, Carlos, Martínez, María J., Cravero, Fiorella, Díaz, Mónica F., Páez, Juan A., Arrayás, Ramón Gómez, Adrio, Javier, Campillo, Nuria E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445096/
https://www.ncbi.nlm.nih.gov/pubmed/28546583
http://dx.doi.org/10.1038/s41598-017-02114-3
_version_ 1783238810532839424
author Ponzoni, Ignacio
Sebastián-Pérez, Víctor
Requena-Triguero, Carlos
Roca, Carlos
Martínez, María J.
Cravero, Fiorella
Díaz, Mónica F.
Páez, Juan A.
Arrayás, Ramón Gómez
Adrio, Javier
Campillo, Nuria E.
author_facet Ponzoni, Ignacio
Sebastián-Pérez, Víctor
Requena-Triguero, Carlos
Roca, Carlos
Martínez, María J.
Cravero, Fiorella
Díaz, Mónica F.
Páez, Juan A.
Arrayás, Ramón Gómez
Adrio, Javier
Campillo, Nuria E.
author_sort Ponzoni, Ignacio
collection PubMed
description Quantitative structure–activity relationship modeling using machine learning techniques constitutes a complex computational problem, where the identification of the most informative molecular descriptors for predicting a specific target property plays a critical role. Two main general approaches can be used for this modeling procedure: feature selection and feature learning. In this paper, a performance comparative study of two state-of-art methods related to these two approaches is carried out. In particular, regression and classification models for three different issues are inferred using both methods under different experimental scenarios: two drug-like properties, such as blood-brain-barrier and human intestinal absorption, and enantiomeric excess, as a measurement of purity used for chiral substances. Beyond the contrastive analysis of feature selection and feature learning methods as competitive approaches, the hybridization of these strategies is also evaluated based on previous results obtained in material sciences. From the experimental results, it can be concluded that there is not a clear winner between both approaches because the performance depends on the characteristics of the compound databases used for modeling. Nevertheless, in several cases, it was observed that the accuracy of the models can be improved by combining both approaches when the molecular descriptor sets provided by feature selection and feature learning contain complementary information.
format Online
Article
Text
id pubmed-5445096
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-54450962017-05-30 Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery Ponzoni, Ignacio Sebastián-Pérez, Víctor Requena-Triguero, Carlos Roca, Carlos Martínez, María J. Cravero, Fiorella Díaz, Mónica F. Páez, Juan A. Arrayás, Ramón Gómez Adrio, Javier Campillo, Nuria E. Sci Rep Article Quantitative structure–activity relationship modeling using machine learning techniques constitutes a complex computational problem, where the identification of the most informative molecular descriptors for predicting a specific target property plays a critical role. Two main general approaches can be used for this modeling procedure: feature selection and feature learning. In this paper, a performance comparative study of two state-of-art methods related to these two approaches is carried out. In particular, regression and classification models for three different issues are inferred using both methods under different experimental scenarios: two drug-like properties, such as blood-brain-barrier and human intestinal absorption, and enantiomeric excess, as a measurement of purity used for chiral substances. Beyond the contrastive analysis of feature selection and feature learning methods as competitive approaches, the hybridization of these strategies is also evaluated based on previous results obtained in material sciences. From the experimental results, it can be concluded that there is not a clear winner between both approaches because the performance depends on the characteristics of the compound databases used for modeling. Nevertheless, in several cases, it was observed that the accuracy of the models can be improved by combining both approaches when the molecular descriptor sets provided by feature selection and feature learning contain complementary information. Nature Publishing Group UK 2017-05-25 /pmc/articles/PMC5445096/ /pubmed/28546583 http://dx.doi.org/10.1038/s41598-017-02114-3 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Ponzoni, Ignacio
Sebastián-Pérez, Víctor
Requena-Triguero, Carlos
Roca, Carlos
Martínez, María J.
Cravero, Fiorella
Díaz, Mónica F.
Páez, Juan A.
Arrayás, Ramón Gómez
Adrio, Javier
Campillo, Nuria E.
Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery
title Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery
title_full Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery
title_fullStr Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery
title_full_unstemmed Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery
title_short Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery
title_sort hybridizing feature selection and feature learning approaches in qsar modeling for drug discovery
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445096/
https://www.ncbi.nlm.nih.gov/pubmed/28546583
http://dx.doi.org/10.1038/s41598-017-02114-3
work_keys_str_mv AT ponzoniignacio hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT sebastianperezvictor hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT requenatriguerocarlos hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT rocacarlos hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT martinezmariaj hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT craverofiorella hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT diazmonicaf hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT paezjuana hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT arrayasramongomez hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT adriojavier hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery
AT campillonuriae hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery