Cargando…

BOSO: A novel feature selection algorithm for linear regression with high-dimensional data

With the frenetic growth of high-dimensional datasets in different biomedical domains, there is an urgent need to develop predictive methods able to deal with this complexity. Feature selection is a relevant strategy in machine learning to address this challenge. We introduce a novel feature selecti...

Descripción completa

Detalles Bibliográficos
Autores principales: Valcárcel, Luis V., San José-Enériz, Edurne, Cendoya, Xabier, Rubio, Ángel, Agirre, Xabier, Prósper, Felipe, Planes, Francisco J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9187084/
https://www.ncbi.nlm.nih.gov/pubmed/35639775
http://dx.doi.org/10.1371/journal.pcbi.1010180
_version_ 1784725091066576896
author Valcárcel, Luis V.
San José-Enériz, Edurne
Cendoya, Xabier
Rubio, Ángel
Agirre, Xabier
Prósper, Felipe
Planes, Francisco J.
author_facet Valcárcel, Luis V.
San José-Enériz, Edurne
Cendoya, Xabier
Rubio, Ángel
Agirre, Xabier
Prósper, Felipe
Planes, Francisco J.
author_sort Valcárcel, Luis V.
collection PubMed
description With the frenetic growth of high-dimensional datasets in different biomedical domains, there is an urgent need to develop predictive methods able to deal with this complexity. Feature selection is a relevant strategy in machine learning to address this challenge. We introduce a novel feature selection algorithm for linear regression called BOSO (Bilevel Optimization Selector Operator). We conducted a benchmark of BOSO with key algorithms in the literature, finding a superior accuracy for feature selection in high-dimensional datasets. Proof-of-concept of BOSO for predicting drug sensitivity in cancer is presented. A detailed analysis is carried out for methotrexate, a well-studied drug targeting cancer metabolism.
format Online
Article
Text
id pubmed-9187084
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-91870842022-06-11 BOSO: A novel feature selection algorithm for linear regression with high-dimensional data Valcárcel, Luis V. San José-Enériz, Edurne Cendoya, Xabier Rubio, Ángel Agirre, Xabier Prósper, Felipe Planes, Francisco J. PLoS Comput Biol Research Article With the frenetic growth of high-dimensional datasets in different biomedical domains, there is an urgent need to develop predictive methods able to deal with this complexity. Feature selection is a relevant strategy in machine learning to address this challenge. We introduce a novel feature selection algorithm for linear regression called BOSO (Bilevel Optimization Selector Operator). We conducted a benchmark of BOSO with key algorithms in the literature, finding a superior accuracy for feature selection in high-dimensional datasets. Proof-of-concept of BOSO for predicting drug sensitivity in cancer is presented. A detailed analysis is carried out for methotrexate, a well-studied drug targeting cancer metabolism. Public Library of Science 2022-05-31 /pmc/articles/PMC9187084/ /pubmed/35639775 http://dx.doi.org/10.1371/journal.pcbi.1010180 Text en © 2022 Valcárcel et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Valcárcel, Luis V.
San José-Enériz, Edurne
Cendoya, Xabier
Rubio, Ángel
Agirre, Xabier
Prósper, Felipe
Planes, Francisco J.
BOSO: A novel feature selection algorithm for linear regression with high-dimensional data
title BOSO: A novel feature selection algorithm for linear regression with high-dimensional data
title_full BOSO: A novel feature selection algorithm for linear regression with high-dimensional data
title_fullStr BOSO: A novel feature selection algorithm for linear regression with high-dimensional data
title_full_unstemmed BOSO: A novel feature selection algorithm for linear regression with high-dimensional data
title_short BOSO: A novel feature selection algorithm for linear regression with high-dimensional data
title_sort boso: a novel feature selection algorithm for linear regression with high-dimensional data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9187084/
https://www.ncbi.nlm.nih.gov/pubmed/35639775
http://dx.doi.org/10.1371/journal.pcbi.1010180
work_keys_str_mv AT valcarcelluisv bosoanovelfeatureselectionalgorithmforlinearregressionwithhighdimensionaldata
AT sanjoseenerizedurne bosoanovelfeatureselectionalgorithmforlinearregressionwithhighdimensionaldata
AT cendoyaxabier bosoanovelfeatureselectionalgorithmforlinearregressionwithhighdimensionaldata
AT rubioangel bosoanovelfeatureselectionalgorithmforlinearregressionwithhighdimensionaldata
AT agirrexabier bosoanovelfeatureselectionalgorithmforlinearregressionwithhighdimensionaldata
AT prosperfelipe bosoanovelfeatureselectionalgorithmforlinearregressionwithhighdimensionaldata
AT planesfranciscoj bosoanovelfeatureselectionalgorithmforlinearregressionwithhighdimensionaldata