Cargando…
dRFEtools: dynamic recursive feature elimination for omics
MOTIVATION: Advances in technology have generated larger omics datasets with potential applications for machine learning. In many datasets, however, cost and limited sample availability result in an excessively higher number of features as compared to observations. Moreover, biological processes are...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10471895/ https://www.ncbi.nlm.nih.gov/pubmed/37632789 http://dx.doi.org/10.1093/bioinformatics/btad513 |
_version_ | 1785099952749281280 |
---|---|
author | Benjamin, Kynon J M Katipalli, Tarun Paquola, Apuã C M |
author_facet | Benjamin, Kynon J M Katipalli, Tarun Paquola, Apuã C M |
author_sort | Benjamin, Kynon J M |
collection | PubMed |
description | MOTIVATION: Advances in technology have generated larger omics datasets with potential applications for machine learning. In many datasets, however, cost and limited sample availability result in an excessively higher number of features as compared to observations. Moreover, biological processes are associated with networks of core and peripheral genes, while traditional feature selection approaches capture only core genes. RESULTS: To overcome these limitations, we present dRFEtools that implements dynamic recursive feature elimination (RFE), reducing computational time with high accuracy compared to standard RFE, expanding dynamic RFE to regression algorithms, and outputting the subsets of features that hold predictive power with and without peripheral features. dRFEtools integrates with scikit-learn (the popular Python machine learning platform) and thus provides new opportunities for dynamic RFE in large-scale omics data while enhancing its interpretability. AVAILABILITY AND IMPLEMENTATION: dRFEtools is freely available on PyPI at https://pypi.org/project/drfetools/ or on GitHub https://github.com/LieberInstitute/dRFEtools, implemented in Python 3, and supported on Linux, Windows, and Mac OS. |
format | Online Article Text |
id | pubmed-10471895 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-104718952023-09-02 dRFEtools: dynamic recursive feature elimination for omics Benjamin, Kynon J M Katipalli, Tarun Paquola, Apuã C M Bioinformatics Applications Note MOTIVATION: Advances in technology have generated larger omics datasets with potential applications for machine learning. In many datasets, however, cost and limited sample availability result in an excessively higher number of features as compared to observations. Moreover, biological processes are associated with networks of core and peripheral genes, while traditional feature selection approaches capture only core genes. RESULTS: To overcome these limitations, we present dRFEtools that implements dynamic recursive feature elimination (RFE), reducing computational time with high accuracy compared to standard RFE, expanding dynamic RFE to regression algorithms, and outputting the subsets of features that hold predictive power with and without peripheral features. dRFEtools integrates with scikit-learn (the popular Python machine learning platform) and thus provides new opportunities for dynamic RFE in large-scale omics data while enhancing its interpretability. AVAILABILITY AND IMPLEMENTATION: dRFEtools is freely available on PyPI at https://pypi.org/project/drfetools/ or on GitHub https://github.com/LieberInstitute/dRFEtools, implemented in Python 3, and supported on Linux, Windows, and Mac OS. Oxford University Press 2023-08-26 /pmc/articles/PMC10471895/ /pubmed/37632789 http://dx.doi.org/10.1093/bioinformatics/btad513 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Benjamin, Kynon J M Katipalli, Tarun Paquola, Apuã C M dRFEtools: dynamic recursive feature elimination for omics |
title | dRFEtools: dynamic recursive feature elimination for omics |
title_full | dRFEtools: dynamic recursive feature elimination for omics |
title_fullStr | dRFEtools: dynamic recursive feature elimination for omics |
title_full_unstemmed | dRFEtools: dynamic recursive feature elimination for omics |
title_short | dRFEtools: dynamic recursive feature elimination for omics |
title_sort | drfetools: dynamic recursive feature elimination for omics |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10471895/ https://www.ncbi.nlm.nih.gov/pubmed/37632789 http://dx.doi.org/10.1093/bioinformatics/btad513 |
work_keys_str_mv | AT benjaminkynonjm drfetoolsdynamicrecursivefeatureeliminationforomics AT katipallitarun drfetoolsdynamicrecursivefeatureeliminationforomics AT paquolaapuacm drfetoolsdynamicrecursivefeatureeliminationforomics |