Cargando…

Feature selection strategies for drug sensitivity prediction

Drug sensitivity prediction constitutes one of the main challenges in personalized medicine. Critically, the sensitivity of cancer cells to treatment depends on an unknown subset of a large number of biological features. Here, we compare standard, data-driven feature selection approaches to feature...

Descripción completa

Detalles Bibliográficos
Autores principales: Koras, Krzysztof, Juraeva, Dilafruz, Kreis, Julian, Mazur, Johanna, Staub, Eike, Szczurek, Ewa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7287073/
https://www.ncbi.nlm.nih.gov/pubmed/32523056
http://dx.doi.org/10.1038/s41598-020-65927-9
_version_ 1783544990260002816
author Koras, Krzysztof
Juraeva, Dilafruz
Kreis, Julian
Mazur, Johanna
Staub, Eike
Szczurek, Ewa
author_facet Koras, Krzysztof
Juraeva, Dilafruz
Kreis, Julian
Mazur, Johanna
Staub, Eike
Szczurek, Ewa
author_sort Koras, Krzysztof
collection PubMed
description Drug sensitivity prediction constitutes one of the main challenges in personalized medicine. Critically, the sensitivity of cancer cells to treatment depends on an unknown subset of a large number of biological features. Here, we compare standard, data-driven feature selection approaches to feature selection driven by prior knowledge of drug targets, target pathways, and gene expression signatures. We asses these methodologies on Genomics of Drug Sensitivity in Cancer (GDSC) dataset, evaluating 2484 unique models. For 23 drugs, better predictive performance is achieved when the features are selected according to prior knowledge of drug targets and pathways. The best correlation of observed and predicted response using the test set is achieved for Linifanib (r = 0.75). Extending the drug-dependent features with gene expression signatures yields the most predictive models for 60 drugs, with the best performing example of Dabrafenib. For many compounds, even a very small subset of drug-related features is highly predictive of drug sensitivity. Small feature sets selected using prior knowledge are more predictive for drugs targeting specific genes and pathways, while models with wider feature sets perform better for drugs affecting general cellular mechanisms. Appropriate feature selection strategies facilitate the development of interpretable models that are indicative for therapy design.
format Online
Article
Text
id pubmed-7287073
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72870732020-06-15 Feature selection strategies for drug sensitivity prediction Koras, Krzysztof Juraeva, Dilafruz Kreis, Julian Mazur, Johanna Staub, Eike Szczurek, Ewa Sci Rep Article Drug sensitivity prediction constitutes one of the main challenges in personalized medicine. Critically, the sensitivity of cancer cells to treatment depends on an unknown subset of a large number of biological features. Here, we compare standard, data-driven feature selection approaches to feature selection driven by prior knowledge of drug targets, target pathways, and gene expression signatures. We asses these methodologies on Genomics of Drug Sensitivity in Cancer (GDSC) dataset, evaluating 2484 unique models. For 23 drugs, better predictive performance is achieved when the features are selected according to prior knowledge of drug targets and pathways. The best correlation of observed and predicted response using the test set is achieved for Linifanib (r = 0.75). Extending the drug-dependent features with gene expression signatures yields the most predictive models for 60 drugs, with the best performing example of Dabrafenib. For many compounds, even a very small subset of drug-related features is highly predictive of drug sensitivity. Small feature sets selected using prior knowledge are more predictive for drugs targeting specific genes and pathways, while models with wider feature sets perform better for drugs affecting general cellular mechanisms. Appropriate feature selection strategies facilitate the development of interpretable models that are indicative for therapy design. Nature Publishing Group UK 2020-06-10 /pmc/articles/PMC7287073/ /pubmed/32523056 http://dx.doi.org/10.1038/s41598-020-65927-9 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Koras, Krzysztof
Juraeva, Dilafruz
Kreis, Julian
Mazur, Johanna
Staub, Eike
Szczurek, Ewa
Feature selection strategies for drug sensitivity prediction
title Feature selection strategies for drug sensitivity prediction
title_full Feature selection strategies for drug sensitivity prediction
title_fullStr Feature selection strategies for drug sensitivity prediction
title_full_unstemmed Feature selection strategies for drug sensitivity prediction
title_short Feature selection strategies for drug sensitivity prediction
title_sort feature selection strategies for drug sensitivity prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7287073/
https://www.ncbi.nlm.nih.gov/pubmed/32523056
http://dx.doi.org/10.1038/s41598-020-65927-9
work_keys_str_mv AT koraskrzysztof featureselectionstrategiesfordrugsensitivityprediction
AT juraevadilafruz featureselectionstrategiesfordrugsensitivityprediction
AT kreisjulian featureselectionstrategiesfordrugsensitivityprediction
AT mazurjohanna featureselectionstrategiesfordrugsensitivityprediction
AT staubeike featureselectionstrategiesfordrugsensitivityprediction
AT szczurekewa featureselectionstrategiesfordrugsensitivityprediction