Cargando…

Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers

Colorectal cancer (CRC) is a leading cause of cancer deaths worldwide, and the identification of biomarkers can improve early detection and personalized treatment. In this study, RNA-seq data and gene chip data from TCGA and GEO were used to explore potential biomarkers for CRC. The SMOTE method was...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Wei, Li, Yixue, Huang, Tao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10342679/
https://www.ncbi.nlm.nih.gov/pubmed/37446311
http://dx.doi.org/10.3390/ijms241311133
_version_ 1785072557005733888
author Wei, Wei
Li, Yixue
Huang, Tao
author_facet Wei, Wei
Li, Yixue
Huang, Tao
author_sort Wei, Wei
collection PubMed
description Colorectal cancer (CRC) is a leading cause of cancer deaths worldwide, and the identification of biomarkers can improve early detection and personalized treatment. In this study, RNA-seq data and gene chip data from TCGA and GEO were used to explore potential biomarkers for CRC. The SMOTE method was used to address class imbalance, and four feature selection algorithms (MCFS, Borota, mRMR, and LightGBM) were used to select genes from the gene expression matrix. Four machine learning algorithms (SVM, XGBoost, RF, and kNN) were then employed to obtain the optimal number of genes for model construction. Through interpretable machine learning (IML), co-predictive networks were generated to identify rules and uncover underlying relationships among the selected genes. Survival analysis revealed that INHBA, FNBP1, PDE9A, HIST1H2BG, and CADM3 were significantly correlated with prognosis in CRC patients. In addition, the CIBERSORT algorithm was used to investigate the proportion of immune cells in CRC tissues, and gene mutation rates for the five selected biomarkers were explored. The biomarkers identified in this study have significant implications for the development of personalized therapies and could ultimately lead to improved clinical outcomes for CRC patients.
format Online
Article
Text
id pubmed-10342679
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103426792023-07-14 Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers Wei, Wei Li, Yixue Huang, Tao Int J Mol Sci Article Colorectal cancer (CRC) is a leading cause of cancer deaths worldwide, and the identification of biomarkers can improve early detection and personalized treatment. In this study, RNA-seq data and gene chip data from TCGA and GEO were used to explore potential biomarkers for CRC. The SMOTE method was used to address class imbalance, and four feature selection algorithms (MCFS, Borota, mRMR, and LightGBM) were used to select genes from the gene expression matrix. Four machine learning algorithms (SVM, XGBoost, RF, and kNN) were then employed to obtain the optimal number of genes for model construction. Through interpretable machine learning (IML), co-predictive networks were generated to identify rules and uncover underlying relationships among the selected genes. Survival analysis revealed that INHBA, FNBP1, PDE9A, HIST1H2BG, and CADM3 were significantly correlated with prognosis in CRC patients. In addition, the CIBERSORT algorithm was used to investigate the proportion of immune cells in CRC tissues, and gene mutation rates for the five selected biomarkers were explored. The biomarkers identified in this study have significant implications for the development of personalized therapies and could ultimately lead to improved clinical outcomes for CRC patients. MDPI 2023-07-06 /pmc/articles/PMC10342679/ /pubmed/37446311 http://dx.doi.org/10.3390/ijms241311133 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wei, Wei
Li, Yixue
Huang, Tao
Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers
title Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers
title_full Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers
title_fullStr Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers
title_full_unstemmed Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers
title_short Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers
title_sort using machine learning methods to study colorectal cancer tumor micro-environment and its biomarkers
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10342679/
https://www.ncbi.nlm.nih.gov/pubmed/37446311
http://dx.doi.org/10.3390/ijms241311133
work_keys_str_mv AT weiwei usingmachinelearningmethodstostudycolorectalcancertumormicroenvironmentanditsbiomarkers
AT liyixue usingmachinelearningmethodstostudycolorectalcancertumormicroenvironmentanditsbiomarkers
AT huangtao usingmachinelearningmethodstostudycolorectalcancertumormicroenvironmentanditsbiomarkers