Cargando…
Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation
In this paper, the LASSO method with extended Bayesian information criteria (EBIC) for feature selection in high-dimensional models is studied. We propose the use of the energy distance correlation in place of the ordinary correlation coefficient to measure the dependence of two variables. The energ...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857644/ https://www.ncbi.nlm.nih.gov/pubmed/36673154 http://dx.doi.org/10.3390/e25010014 |
_version_ | 1784873914987446272 |
---|---|
author | Ocloo, Isaac Xoese Chen, Hanfeng |
author_facet | Ocloo, Isaac Xoese Chen, Hanfeng |
author_sort | Ocloo, Isaac Xoese |
collection | PubMed |
description | In this paper, the LASSO method with extended Bayesian information criteria (EBIC) for feature selection in high-dimensional models is studied. We propose the use of the energy distance correlation in place of the ordinary correlation coefficient to measure the dependence of two variables. The energy distance correlation detects linear and non-linear association between two variables, unlike the ordinary correlation coefficient, which detects only linear association. EBIC is adopted as the stopping criterion. It is shown that the new method is more powerful than Luo and Chen’s method for feature selection. This is demonstrated by simulation studies and illustrated by a real-life example. It is also proved that the new algorithm is selection-consistent. |
format | Online Article Text |
id | pubmed-9857644 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-98576442023-01-21 Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation Ocloo, Isaac Xoese Chen, Hanfeng Entropy (Basel) Article In this paper, the LASSO method with extended Bayesian information criteria (EBIC) for feature selection in high-dimensional models is studied. We propose the use of the energy distance correlation in place of the ordinary correlation coefficient to measure the dependence of two variables. The energy distance correlation detects linear and non-linear association between two variables, unlike the ordinary correlation coefficient, which detects only linear association. EBIC is adopted as the stopping criterion. It is shown that the new method is more powerful than Luo and Chen’s method for feature selection. This is demonstrated by simulation studies and illustrated by a real-life example. It is also proved that the new algorithm is selection-consistent. MDPI 2022-12-21 /pmc/articles/PMC9857644/ /pubmed/36673154 http://dx.doi.org/10.3390/e25010014 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ocloo, Isaac Xoese Chen, Hanfeng Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation |
title | Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation |
title_full | Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation |
title_fullStr | Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation |
title_full_unstemmed | Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation |
title_short | Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation |
title_sort | feature selection in high-dimensional models via ebic with energy distance correlation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857644/ https://www.ncbi.nlm.nih.gov/pubmed/36673154 http://dx.doi.org/10.3390/e25010014 |
work_keys_str_mv | AT oclooisaacxoese featureselectioninhighdimensionalmodelsviaebicwithenergydistancecorrelation AT chenhanfeng featureselectioninhighdimensionalmodelsviaebicwithenergydistancecorrelation |