Cargando…
Applying machine learning algorithms to develop a survival prediction model for lung adenocarcinoma based on genes related to fatty acid metabolism
Background: The progression of lung adenocarcinoma (LUAD) may be related to abnormal fatty acid metabolism (FAM). The present study investigated the relationship between FAM-related genes and LUAD prognosis. Methods: LUAD samples from The Cancer Genome Atlas were collected. The scores of FAM-associa...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10619909/ https://www.ncbi.nlm.nih.gov/pubmed/37920207 http://dx.doi.org/10.3389/fphar.2023.1260742 |
_version_ | 1785130093783285760 |
---|---|
author | Cong, Dan Zhao, Yanan Zhang, Wenlong Li, Jun Bai, Yuansong |
author_facet | Cong, Dan Zhao, Yanan Zhang, Wenlong Li, Jun Bai, Yuansong |
author_sort | Cong, Dan |
collection | PubMed |
description | Background: The progression of lung adenocarcinoma (LUAD) may be related to abnormal fatty acid metabolism (FAM). The present study investigated the relationship between FAM-related genes and LUAD prognosis. Methods: LUAD samples from The Cancer Genome Atlas were collected. The scores of FAM-associated pathways from the Kyoto Encyclopedia of Genes and Genomes website were calculated using the single sample gene set enrichment analysis. ConsensusClusterPlus and cumulative distribution function were used to classify molecular subtypes for LUAD. Key genes were obtained using limma package, Cox regression analysis, and six machine learning algorithms (GBM, LASSO, XGBoost, SVM, random forest, and decision trees), and a RiskScore model was established. According to the RiskScore model and clinical features, a nomogram was developed and evaluated for its prediction performance using a calibration curve. Differences in immune abnormalities among patients with different subtypes and RiskScores were analyzed by the Estimation of STromal and Immune cells in MAlignant Tumours using Expression data, CIBERSORT, and single sample gene set enrichment analysis. Patients’ drug sensitivity was predicted by the pRRophetic package in R language. Results: LUAD samples had lower scores of FAM-related pathways. Three molecular subtypes (C1, C2, and C3) were defined. Analysis on differential prognosis showed that the C1 subtype had the most favorable prognosis, followed by the C2 subtype, and the C3 subtype had the worst prognosis. The C3 subtype had lower immune infiltration. A total of 12 key genes (SLC2A1, PKP2, FAM83A, TCN1, MS4A1, CLIC6, UBE2S, RRM2, CDC45, IGF2BP1, ANGPTL4, and CD109) were screened and used to develop a RiskScore model. Survival chance of patients in the high-RiskScore group was significantly lower. The low-RiskScore group showed higher immune score and higher expression of most immune checkpoint genes. Patients with a high RiskScore were more likely to benefit from the six anticancer drugs we screened in this study. Conclusion: We developed a RiskScore model using FAM-related genes to help predict LUAD prognosis and develop new targeted drugs. |
format | Online Article Text |
id | pubmed-10619909 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-106199092023-11-02 Applying machine learning algorithms to develop a survival prediction model for lung adenocarcinoma based on genes related to fatty acid metabolism Cong, Dan Zhao, Yanan Zhang, Wenlong Li, Jun Bai, Yuansong Front Pharmacol Pharmacology Background: The progression of lung adenocarcinoma (LUAD) may be related to abnormal fatty acid metabolism (FAM). The present study investigated the relationship between FAM-related genes and LUAD prognosis. Methods: LUAD samples from The Cancer Genome Atlas were collected. The scores of FAM-associated pathways from the Kyoto Encyclopedia of Genes and Genomes website were calculated using the single sample gene set enrichment analysis. ConsensusClusterPlus and cumulative distribution function were used to classify molecular subtypes for LUAD. Key genes were obtained using limma package, Cox regression analysis, and six machine learning algorithms (GBM, LASSO, XGBoost, SVM, random forest, and decision trees), and a RiskScore model was established. According to the RiskScore model and clinical features, a nomogram was developed and evaluated for its prediction performance using a calibration curve. Differences in immune abnormalities among patients with different subtypes and RiskScores were analyzed by the Estimation of STromal and Immune cells in MAlignant Tumours using Expression data, CIBERSORT, and single sample gene set enrichment analysis. Patients’ drug sensitivity was predicted by the pRRophetic package in R language. Results: LUAD samples had lower scores of FAM-related pathways. Three molecular subtypes (C1, C2, and C3) were defined. Analysis on differential prognosis showed that the C1 subtype had the most favorable prognosis, followed by the C2 subtype, and the C3 subtype had the worst prognosis. The C3 subtype had lower immune infiltration. A total of 12 key genes (SLC2A1, PKP2, FAM83A, TCN1, MS4A1, CLIC6, UBE2S, RRM2, CDC45, IGF2BP1, ANGPTL4, and CD109) were screened and used to develop a RiskScore model. Survival chance of patients in the high-RiskScore group was significantly lower. The low-RiskScore group showed higher immune score and higher expression of most immune checkpoint genes. Patients with a high RiskScore were more likely to benefit from the six anticancer drugs we screened in this study. Conclusion: We developed a RiskScore model using FAM-related genes to help predict LUAD prognosis and develop new targeted drugs. Frontiers Media S.A. 2023-10-17 /pmc/articles/PMC10619909/ /pubmed/37920207 http://dx.doi.org/10.3389/fphar.2023.1260742 Text en Copyright © 2023 Cong, Zhao, Zhang, Li and Bai. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Pharmacology Cong, Dan Zhao, Yanan Zhang, Wenlong Li, Jun Bai, Yuansong Applying machine learning algorithms to develop a survival prediction model for lung adenocarcinoma based on genes related to fatty acid metabolism |
title | Applying machine learning algorithms to develop a survival prediction model for lung adenocarcinoma based on genes related to fatty acid metabolism |
title_full | Applying machine learning algorithms to develop a survival prediction model for lung adenocarcinoma based on genes related to fatty acid metabolism |
title_fullStr | Applying machine learning algorithms to develop a survival prediction model for lung adenocarcinoma based on genes related to fatty acid metabolism |
title_full_unstemmed | Applying machine learning algorithms to develop a survival prediction model for lung adenocarcinoma based on genes related to fatty acid metabolism |
title_short | Applying machine learning algorithms to develop a survival prediction model for lung adenocarcinoma based on genes related to fatty acid metabolism |
title_sort | applying machine learning algorithms to develop a survival prediction model for lung adenocarcinoma based on genes related to fatty acid metabolism |
topic | Pharmacology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10619909/ https://www.ncbi.nlm.nih.gov/pubmed/37920207 http://dx.doi.org/10.3389/fphar.2023.1260742 |
work_keys_str_mv | AT congdan applyingmachinelearningalgorithmstodevelopasurvivalpredictionmodelforlungadenocarcinomabasedongenesrelatedtofattyacidmetabolism AT zhaoyanan applyingmachinelearningalgorithmstodevelopasurvivalpredictionmodelforlungadenocarcinomabasedongenesrelatedtofattyacidmetabolism AT zhangwenlong applyingmachinelearningalgorithmstodevelopasurvivalpredictionmodelforlungadenocarcinomabasedongenesrelatedtofattyacidmetabolism AT lijun applyingmachinelearningalgorithmstodevelopasurvivalpredictionmodelforlungadenocarcinomabasedongenesrelatedtofattyacidmetabolism AT baiyuansong applyingmachinelearningalgorithmstodevelopasurvivalpredictionmodelforlungadenocarcinomabasedongenesrelatedtofattyacidmetabolism |