Cargando…

Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles

BACKGROUND: This study aimed to screen the feature modules and characteristic genes related to ulcerative colitis (UC) and construct a support vector machine (SVM) classifier to distinguish UC patients. METHODS: Four datasets that contained UC and control samples were obtained from the Gene Expressi...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Yingbo, Liu, Xiumin, Dong, Hongmei, Wen, Dacheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8556884/
https://www.ncbi.nlm.nih.gov/pubmed/34717557
http://dx.doi.org/10.1186/s12876-021-01940-0
_version_ 1784592263435780096
author Han, Yingbo
Liu, Xiumin
Dong, Hongmei
Wen, Dacheng
author_facet Han, Yingbo
Liu, Xiumin
Dong, Hongmei
Wen, Dacheng
author_sort Han, Yingbo
collection PubMed
description BACKGROUND: This study aimed to screen the feature modules and characteristic genes related to ulcerative colitis (UC) and construct a support vector machine (SVM) classifier to distinguish UC patients. METHODS: Four datasets that contained UC and control samples were obtained from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) with consistency were screened via the MetaDE method. The weighted gene coexpression network (WGCNA) was used to distinguish significant modules based on the four datasets. The protein–protein interaction network was established based on intersection genes. Enrichment analysis of Gene Ontology (GO) biological processes (BPs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were established based on DAVID. An SVM combined with recursive feature elimination was also applied to construct a disease classifier for the disease diagnosis of UC patients. The efficacy of the SVM classifier was evaluated through receiver operating characteristic curves. RESULTS: Twelve highly preserved modules were obtained using the WGCNA, and 2009 DEGs with significant consistency were selected using the MetaDE method. Sixteen significantly related GO BPs and 12 KEGG pathways were obtained, such as cytokine-cytokine receptor interaction, cell adhesion molecules, and leukocyte transendothelial migration. Subsequently, 41 genes were used to construct an SVM classifier, such as CXCL1, CCR2, IL1B, and IL1A. The area under the curve (AUC) was 0.999 in the training dataset, whereas the AUC was 0.886, 0.790, and 0.819 in the validation set (GSE65114, GSE37283, and GSE36807, respectively). CONCLUSIONS: An SVM classifier based on feature genes might correctly identify healthy people or UC patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12876-021-01940-0.
format Online
Article
Text
id pubmed-8556884
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-85568842021-11-01 Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles Han, Yingbo Liu, Xiumin Dong, Hongmei Wen, Dacheng BMC Gastroenterol Research Article BACKGROUND: This study aimed to screen the feature modules and characteristic genes related to ulcerative colitis (UC) and construct a support vector machine (SVM) classifier to distinguish UC patients. METHODS: Four datasets that contained UC and control samples were obtained from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) with consistency were screened via the MetaDE method. The weighted gene coexpression network (WGCNA) was used to distinguish significant modules based on the four datasets. The protein–protein interaction network was established based on intersection genes. Enrichment analysis of Gene Ontology (GO) biological processes (BPs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were established based on DAVID. An SVM combined with recursive feature elimination was also applied to construct a disease classifier for the disease diagnosis of UC patients. The efficacy of the SVM classifier was evaluated through receiver operating characteristic curves. RESULTS: Twelve highly preserved modules were obtained using the WGCNA, and 2009 DEGs with significant consistency were selected using the MetaDE method. Sixteen significantly related GO BPs and 12 KEGG pathways were obtained, such as cytokine-cytokine receptor interaction, cell adhesion molecules, and leukocyte transendothelial migration. Subsequently, 41 genes were used to construct an SVM classifier, such as CXCL1, CCR2, IL1B, and IL1A. The area under the curve (AUC) was 0.999 in the training dataset, whereas the AUC was 0.886, 0.790, and 0.819 in the validation set (GSE65114, GSE37283, and GSE36807, respectively). CONCLUSIONS: An SVM classifier based on feature genes might correctly identify healthy people or UC patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12876-021-01940-0. BioMed Central 2021-10-30 /pmc/articles/PMC8556884/ /pubmed/34717557 http://dx.doi.org/10.1186/s12876-021-01940-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Han, Yingbo
Liu, Xiumin
Dong, Hongmei
Wen, Dacheng
Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles
title Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles
title_full Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles
title_fullStr Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles
title_full_unstemmed Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles
title_short Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles
title_sort screening of characteristic genes in ulcerative colitis by integrating gene expression profiles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8556884/
https://www.ncbi.nlm.nih.gov/pubmed/34717557
http://dx.doi.org/10.1186/s12876-021-01940-0
work_keys_str_mv AT hanyingbo screeningofcharacteristicgenesinulcerativecolitisbyintegratinggeneexpressionprofiles
AT liuxiumin screeningofcharacteristicgenesinulcerativecolitisbyintegratinggeneexpressionprofiles
AT donghongmei screeningofcharacteristicgenesinulcerativecolitisbyintegratinggeneexpressionprofiles
AT wendacheng screeningofcharacteristicgenesinulcerativecolitisbyintegratinggeneexpressionprofiles