Cargando…

EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models

Pulmonary hypertension (PH) is a common disease that affects the normal functioning of the human pulmonary arteries. The peripheral blood mononuclear cells (PMBCs) served as an ideal source for a minimally invasive disease diagnosis. This study hypothesized that the transcriptional fluctuations in t...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xiangju, Zhang, Yu, Fu, Chunli, Zhang, Ruochi, Zhou, Fengfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8110930/
https://www.ncbi.nlm.nih.gov/pubmed/33986767
http://dx.doi.org/10.3389/fgene.2021.636429
_version_ 1783690396154462208
author Liu, Xiangju
Zhang, Yu
Fu, Chunli
Zhang, Ruochi
Zhou, Fengfeng
author_facet Liu, Xiangju
Zhang, Yu
Fu, Chunli
Zhang, Ruochi
Zhou, Fengfeng
author_sort Liu, Xiangju
collection PubMed
description Pulmonary hypertension (PH) is a common disease that affects the normal functioning of the human pulmonary arteries. The peripheral blood mononuclear cells (PMBCs) served as an ideal source for a minimally invasive disease diagnosis. This study hypothesized that the transcriptional fluctuations in the PMBCs exposed to the PH arteries may stably reflect the disease. However, the dimension of a human transcriptome is much higher than the number of samples in all the existing datasets. So, an ensemble feature selection algorithm, EnRank, was proposed to integrate the ranking information of four popular feature selection algorithms, i.e., T-test (Ttest), Chi-squared test (Chi2), ridge regression (Ridge), and Least Absolute Shrinkage and Selection Operator (Lasso). Our results suggested that the EnRank-detected biomarkers provided useful information from these four feature selection algorithms and achieved very good prediction accuracy in predicting the PH patients. Many of the EnRank-detected biomarkers were also supported by the literature.
format Online
Article
Text
id pubmed-8110930
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81109302021-05-12 EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models Liu, Xiangju Zhang, Yu Fu, Chunli Zhang, Ruochi Zhou, Fengfeng Front Genet Genetics Pulmonary hypertension (PH) is a common disease that affects the normal functioning of the human pulmonary arteries. The peripheral blood mononuclear cells (PMBCs) served as an ideal source for a minimally invasive disease diagnosis. This study hypothesized that the transcriptional fluctuations in the PMBCs exposed to the PH arteries may stably reflect the disease. However, the dimension of a human transcriptome is much higher than the number of samples in all the existing datasets. So, an ensemble feature selection algorithm, EnRank, was proposed to integrate the ranking information of four popular feature selection algorithms, i.e., T-test (Ttest), Chi-squared test (Chi2), ridge regression (Ridge), and Least Absolute Shrinkage and Selection Operator (Lasso). Our results suggested that the EnRank-detected biomarkers provided useful information from these four feature selection algorithms and achieved very good prediction accuracy in predicting the PH patients. Many of the EnRank-detected biomarkers were also supported by the literature. Frontiers Media S.A. 2021-04-27 /pmc/articles/PMC8110930/ /pubmed/33986767 http://dx.doi.org/10.3389/fgene.2021.636429 Text en Copyright © 2021 Liu, Zhang, Fu, Zhang and Zhou. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Xiangju
Zhang, Yu
Fu, Chunli
Zhang, Ruochi
Zhou, Fengfeng
EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models
title EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models
title_full EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models
title_fullStr EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models
title_full_unstemmed EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models
title_short EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models
title_sort enrank: an ensemble method to detect pulmonary hypertension biomarkers based on feature selection and machine learning models
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8110930/
https://www.ncbi.nlm.nih.gov/pubmed/33986767
http://dx.doi.org/10.3389/fgene.2021.636429
work_keys_str_mv AT liuxiangju enrankanensemblemethodtodetectpulmonaryhypertensionbiomarkersbasedonfeatureselectionandmachinelearningmodels
AT zhangyu enrankanensemblemethodtodetectpulmonaryhypertensionbiomarkersbasedonfeatureselectionandmachinelearningmodels
AT fuchunli enrankanensemblemethodtodetectpulmonaryhypertensionbiomarkersbasedonfeatureselectionandmachinelearningmodels
AT zhangruochi enrankanensemblemethodtodetectpulmonaryhypertensionbiomarkersbasedonfeatureselectionandmachinelearningmodels
AT zhoufengfeng enrankanensemblemethodtodetectpulmonaryhypertensionbiomarkersbasedonfeatureselectionandmachinelearningmodels