Cargando…

Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms

Biological omics data such as transcriptomes and methylomes have the inherent “large p small n” paradigm, i.e., the number of features is much larger than that of the samples. A feature selection (FS) algorithm selects a subset of the transcriptomic or methylomic biomarkers in order to build a bette...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Yuanyuan, Huang, Lan, Zhou, Fengfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8621246/
https://www.ncbi.nlm.nih.gov/pubmed/34828418
http://dx.doi.org/10.3390/genes12111814
_version_ 1784605411655024640
author Han, Yuanyuan
Huang, Lan
Zhou, Fengfeng
author_facet Han, Yuanyuan
Huang, Lan
Zhou, Fengfeng
author_sort Han, Yuanyuan
collection PubMed
description Biological omics data such as transcriptomes and methylomes have the inherent “large p small n” paradigm, i.e., the number of features is much larger than that of the samples. A feature selection (FS) algorithm selects a subset of the transcriptomic or methylomic biomarkers in order to build a better prediction model. The hidden patterns in the FS solution space make it challenging to achieve a feature subset with satisfying prediction performances. Swarm intelligence (SI) algorithms mimic the target searching behaviors of various animals and have demonstrated promising capabilities in selecting features with good machine learning performances. Our study revealed that different SI-based feature selection algorithms contributed complementary searching capabilities in the FS solution space, and their collaboration generated a better feature subset than the individual SI feature selection algorithms. Nine SI-based feature selection algorithms were integrated to vote for the selected features, which were further refined by the dynamic recursive feature elimination framework. In most cases, the proposed Zoo algorithm outperformed the existing feature selection algorithms on transcriptomics and methylomics datasets.
format Online
Article
Text
id pubmed-8621246
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86212462021-11-27 Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms Han, Yuanyuan Huang, Lan Zhou, Fengfeng Genes (Basel) Technical Note Biological omics data such as transcriptomes and methylomes have the inherent “large p small n” paradigm, i.e., the number of features is much larger than that of the samples. A feature selection (FS) algorithm selects a subset of the transcriptomic or methylomic biomarkers in order to build a better prediction model. The hidden patterns in the FS solution space make it challenging to achieve a feature subset with satisfying prediction performances. Swarm intelligence (SI) algorithms mimic the target searching behaviors of various animals and have demonstrated promising capabilities in selecting features with good machine learning performances. Our study revealed that different SI-based feature selection algorithms contributed complementary searching capabilities in the FS solution space, and their collaboration generated a better feature subset than the individual SI feature selection algorithms. Nine SI-based feature selection algorithms were integrated to vote for the selected features, which were further refined by the dynamic recursive feature elimination framework. In most cases, the proposed Zoo algorithm outperformed the existing feature selection algorithms on transcriptomics and methylomics datasets. MDPI 2021-11-18 /pmc/articles/PMC8621246/ /pubmed/34828418 http://dx.doi.org/10.3390/genes12111814 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Technical Note
Han, Yuanyuan
Huang, Lan
Zhou, Fengfeng
Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms
title Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms
title_full Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms
title_fullStr Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms
title_full_unstemmed Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms
title_short Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms
title_sort zoo: selecting transcriptomic and methylomic biomarkers by ensembling animal-inspired swarm intelligence feature selection algorithms
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8621246/
https://www.ncbi.nlm.nih.gov/pubmed/34828418
http://dx.doi.org/10.3390/genes12111814
work_keys_str_mv AT hanyuanyuan zooselectingtranscriptomicandmethylomicbiomarkersbyensemblinganimalinspiredswarmintelligencefeatureselectionalgorithms
AT huanglan zooselectingtranscriptomicandmethylomicbiomarkersbyensemblinganimalinspiredswarmintelligencefeatureselectionalgorithms
AT zhoufengfeng zooselectingtranscriptomicandmethylomicbiomarkersbyensemblinganimalinspiredswarmintelligencefeatureselectionalgorithms