Cargando…
Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance
Feature screening is an important and challenging topic in current class-imbalance learning. Most of the existing feature screening algorithms in class-imbalance learning are based on filtering techniques. However, the variable rankings obtained by various filtering techniques are generally differen...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232202/ https://www.ncbi.nlm.nih.gov/pubmed/34198638 http://dx.doi.org/10.3390/metabo11060389 |
_version_ | 1783713586307137536 |
---|---|
author | Fu, Guang-Hui Wang, Jia-Bao Zong, Min-Jie Yi, Lun-Zhao |
author_facet | Fu, Guang-Hui Wang, Jia-Bao Zong, Min-Jie Yi, Lun-Zhao |
author_sort | Fu, Guang-Hui |
collection | PubMed |
description | Feature screening is an important and challenging topic in current class-imbalance learning. Most of the existing feature screening algorithms in class-imbalance learning are based on filtering techniques. However, the variable rankings obtained by various filtering techniques are generally different, and this inconsistency among different variable ranking methods is usually ignored in practice. To address this problem, we propose a simple strategy called rank aggregation with re-balance (RAR) for finding key variables from class-imbalanced data. RAR fuses each rank to generate a synthetic rank that takes every ranking into account. The class-imbalanced data are modified via different re-sampling procedures, and RAR is performed in this balanced situation. Five class-imbalanced real datasets and their re-balanced ones are employed to test the RAR’s performance, and RAR is compared with several popular feature screening methods. The result shows that RAR is highly competitive and almost better than single filtering screening in terms of several assessing metrics. Performing re-balanced pretreatment is hugely effective in rank aggregation when the data are class-imbalanced. |
format | Online Article Text |
id | pubmed-8232202 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-82322022021-06-26 Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance Fu, Guang-Hui Wang, Jia-Bao Zong, Min-Jie Yi, Lun-Zhao Metabolites Article Feature screening is an important and challenging topic in current class-imbalance learning. Most of the existing feature screening algorithms in class-imbalance learning are based on filtering techniques. However, the variable rankings obtained by various filtering techniques are generally different, and this inconsistency among different variable ranking methods is usually ignored in practice. To address this problem, we propose a simple strategy called rank aggregation with re-balance (RAR) for finding key variables from class-imbalanced data. RAR fuses each rank to generate a synthetic rank that takes every ranking into account. The class-imbalanced data are modified via different re-sampling procedures, and RAR is performed in this balanced situation. Five class-imbalanced real datasets and their re-balanced ones are employed to test the RAR’s performance, and RAR is compared with several popular feature screening methods. The result shows that RAR is highly competitive and almost better than single filtering screening in terms of several assessing metrics. Performing re-balanced pretreatment is hugely effective in rank aggregation when the data are class-imbalanced. MDPI 2021-06-14 /pmc/articles/PMC8232202/ /pubmed/34198638 http://dx.doi.org/10.3390/metabo11060389 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Fu, Guang-Hui Wang, Jia-Bao Zong, Min-Jie Yi, Lun-Zhao Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance |
title | Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance |
title_full | Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance |
title_fullStr | Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance |
title_full_unstemmed | Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance |
title_short | Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance |
title_sort | feature ranking and screening for class-imbalanced metabolomics data based on rank aggregation coupled with re-balance |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232202/ https://www.ncbi.nlm.nih.gov/pubmed/34198638 http://dx.doi.org/10.3390/metabo11060389 |
work_keys_str_mv | AT fuguanghui featurerankingandscreeningforclassimbalancedmetabolomicsdatabasedonrankaggregationcoupledwithrebalance AT wangjiabao featurerankingandscreeningforclassimbalancedmetabolomicsdatabasedonrankaggregationcoupledwithrebalance AT zongminjie featurerankingandscreeningforclassimbalancedmetabolomicsdatabasedonrankaggregationcoupledwithrebalance AT yilunzhao featurerankingandscreeningforclassimbalancedmetabolomicsdatabasedonrankaggregationcoupledwithrebalance |