Cargando…
Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles
BACKGROUND: This study aimed to screen the feature modules and characteristic genes related to ulcerative colitis (UC) and construct a support vector machine (SVM) classifier to distinguish UC patients. METHODS: Four datasets that contained UC and control samples were obtained from the Gene Expressi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8556884/ https://www.ncbi.nlm.nih.gov/pubmed/34717557 http://dx.doi.org/10.1186/s12876-021-01940-0 |
_version_ | 1784592263435780096 |
---|---|
author | Han, Yingbo Liu, Xiumin Dong, Hongmei Wen, Dacheng |
author_facet | Han, Yingbo Liu, Xiumin Dong, Hongmei Wen, Dacheng |
author_sort | Han, Yingbo |
collection | PubMed |
description | BACKGROUND: This study aimed to screen the feature modules and characteristic genes related to ulcerative colitis (UC) and construct a support vector machine (SVM) classifier to distinguish UC patients. METHODS: Four datasets that contained UC and control samples were obtained from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) with consistency were screened via the MetaDE method. The weighted gene coexpression network (WGCNA) was used to distinguish significant modules based on the four datasets. The protein–protein interaction network was established based on intersection genes. Enrichment analysis of Gene Ontology (GO) biological processes (BPs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were established based on DAVID. An SVM combined with recursive feature elimination was also applied to construct a disease classifier for the disease diagnosis of UC patients. The efficacy of the SVM classifier was evaluated through receiver operating characteristic curves. RESULTS: Twelve highly preserved modules were obtained using the WGCNA, and 2009 DEGs with significant consistency were selected using the MetaDE method. Sixteen significantly related GO BPs and 12 KEGG pathways were obtained, such as cytokine-cytokine receptor interaction, cell adhesion molecules, and leukocyte transendothelial migration. Subsequently, 41 genes were used to construct an SVM classifier, such as CXCL1, CCR2, IL1B, and IL1A. The area under the curve (AUC) was 0.999 in the training dataset, whereas the AUC was 0.886, 0.790, and 0.819 in the validation set (GSE65114, GSE37283, and GSE36807, respectively). CONCLUSIONS: An SVM classifier based on feature genes might correctly identify healthy people or UC patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12876-021-01940-0. |
format | Online Article Text |
id | pubmed-8556884 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-85568842021-11-01 Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles Han, Yingbo Liu, Xiumin Dong, Hongmei Wen, Dacheng BMC Gastroenterol Research Article BACKGROUND: This study aimed to screen the feature modules and characteristic genes related to ulcerative colitis (UC) and construct a support vector machine (SVM) classifier to distinguish UC patients. METHODS: Four datasets that contained UC and control samples were obtained from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) with consistency were screened via the MetaDE method. The weighted gene coexpression network (WGCNA) was used to distinguish significant modules based on the four datasets. The protein–protein interaction network was established based on intersection genes. Enrichment analysis of Gene Ontology (GO) biological processes (BPs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were established based on DAVID. An SVM combined with recursive feature elimination was also applied to construct a disease classifier for the disease diagnosis of UC patients. The efficacy of the SVM classifier was evaluated through receiver operating characteristic curves. RESULTS: Twelve highly preserved modules were obtained using the WGCNA, and 2009 DEGs with significant consistency were selected using the MetaDE method. Sixteen significantly related GO BPs and 12 KEGG pathways were obtained, such as cytokine-cytokine receptor interaction, cell adhesion molecules, and leukocyte transendothelial migration. Subsequently, 41 genes were used to construct an SVM classifier, such as CXCL1, CCR2, IL1B, and IL1A. The area under the curve (AUC) was 0.999 in the training dataset, whereas the AUC was 0.886, 0.790, and 0.819 in the validation set (GSE65114, GSE37283, and GSE36807, respectively). CONCLUSIONS: An SVM classifier based on feature genes might correctly identify healthy people or UC patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12876-021-01940-0. BioMed Central 2021-10-30 /pmc/articles/PMC8556884/ /pubmed/34717557 http://dx.doi.org/10.1186/s12876-021-01940-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Han, Yingbo Liu, Xiumin Dong, Hongmei Wen, Dacheng Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles |
title | Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles |
title_full | Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles |
title_fullStr | Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles |
title_full_unstemmed | Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles |
title_short | Screening of characteristic genes in ulcerative colitis by integrating gene expression profiles |
title_sort | screening of characteristic genes in ulcerative colitis by integrating gene expression profiles |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8556884/ https://www.ncbi.nlm.nih.gov/pubmed/34717557 http://dx.doi.org/10.1186/s12876-021-01940-0 |
work_keys_str_mv | AT hanyingbo screeningofcharacteristicgenesinulcerativecolitisbyintegratinggeneexpressionprofiles AT liuxiumin screeningofcharacteristicgenesinulcerativecolitisbyintegratinggeneexpressionprofiles AT donghongmei screeningofcharacteristicgenesinulcerativecolitisbyintegratinggeneexpressionprofiles AT wendacheng screeningofcharacteristicgenesinulcerativecolitisbyintegratinggeneexpressionprofiles |