Cargando…

A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies

BACKGROUND: Predicting lung adenocarcinoma (LUAD) risk is crucial in determining further treatment strategies. Molecular biomarkers may improve risk stratification for LUAD. METHODS: We analyzed the gene expression profiles of LUAD patients from The Cancer Genome Atlas (TCGA) and Gene Expression Omn...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Yin, Ge, Di, Gu, Jie, Xu, Fengkai, Zhu, Qiaoliang, Lu, Chunlai
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6729062/ https://www.ncbi.nlm.nih.gov/pubmed/31488089 http://dx.doi.org/10.1186/s12885-019-6101-7

_version_	1783449536836927488
author	Li, Yin Ge, Di Gu, Jie Xu, Fengkai Zhu, Qiaoliang Lu, Chunlai
author_facet	Li, Yin Ge, Di Gu, Jie Xu, Fengkai Zhu, Qiaoliang Lu, Chunlai
author_sort	Li, Yin
collection	PubMed
description	BACKGROUND: Predicting lung adenocarcinoma (LUAD) risk is crucial in determining further treatment strategies. Molecular biomarkers may improve risk stratification for LUAD. METHODS: We analyzed the gene expression profiles of LUAD patients from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). We initially used three distinct algorithms (sigFeature, random forest, and univariate Cox regression) to evaluate each gene’s prognostic relevance. Survival related genes were then fitted into the least absolute shrinkage and selection operator (LASSO) model to build a risk prediction model for LUAD. After 100,000 times of calculation and model construction, a 16-gene-based prediction model capable of classifying LUAD patients into high-risk and low-risk groups was successfully built. RESULTS: Using a combined strategy, we initially identified 2472 significant survival-related genes. Functional enrichment analysis demonstrated these genes’ relevance to tumor initiation and progression. Using the LASSO method, we successfully built a reliable risk prediction model. The risk model was validated in two external sets and an independent set. The expression of these 16 genes was highly correlated with patients’ risk. High-risk group patients witnessed poorer recurrence-free survival (RFS) and overall survival (OS) compared to low-risk group patients. Moreover, stratification analysis and decision curve analysis (DCA) confirmed the independence and potential translational value of this predictive tool. We also built a nomogram comprising risk model and stage to predict OS for LUAD patients. CONCLUSIONS: Our risk model may serve as a practical and reliable prognosis predictive tool for LUAD and could provide novel insights into the understanding of the molecular mechanism of this disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12885-019-6101-7) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-6729062
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-67290622019-09-12 A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies Li, Yin Ge, Di Gu, Jie Xu, Fengkai Zhu, Qiaoliang Lu, Chunlai BMC Cancer Research Article BACKGROUND: Predicting lung adenocarcinoma (LUAD) risk is crucial in determining further treatment strategies. Molecular biomarkers may improve risk stratification for LUAD. METHODS: We analyzed the gene expression profiles of LUAD patients from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). We initially used three distinct algorithms (sigFeature, random forest, and univariate Cox regression) to evaluate each gene’s prognostic relevance. Survival related genes were then fitted into the least absolute shrinkage and selection operator (LASSO) model to build a risk prediction model for LUAD. After 100,000 times of calculation and model construction, a 16-gene-based prediction model capable of classifying LUAD patients into high-risk and low-risk groups was successfully built. RESULTS: Using a combined strategy, we initially identified 2472 significant survival-related genes. Functional enrichment analysis demonstrated these genes’ relevance to tumor initiation and progression. Using the LASSO method, we successfully built a reliable risk prediction model. The risk model was validated in two external sets and an independent set. The expression of these 16 genes was highly correlated with patients’ risk. High-risk group patients witnessed poorer recurrence-free survival (RFS) and overall survival (OS) compared to low-risk group patients. Moreover, stratification analysis and decision curve analysis (DCA) confirmed the independence and potential translational value of this predictive tool. We also built a nomogram comprising risk model and stage to predict OS for LUAD patients. CONCLUSIONS: Our risk model may serve as a practical and reliable prognosis predictive tool for LUAD and could provide novel insights into the understanding of the molecular mechanism of this disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12885-019-6101-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-09-05 /pmc/articles/PMC6729062/ /pubmed/31488089 http://dx.doi.org/10.1186/s12885-019-6101-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Li, Yin Ge, Di Gu, Jie Xu, Fengkai Zhu, Qiaoliang Lu, Chunlai A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies
title	A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies
title_full	A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies
title_fullStr	A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies
title_full_unstemmed	A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies
title_short	A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies
title_sort	large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6729062/ https://www.ncbi.nlm.nih.gov/pubmed/31488089 http://dx.doi.org/10.1186/s12885-019-6101-7
work_keys_str_mv	AT liyin alargecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT gedi alargecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT gujie alargecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT xufengkai alargecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT zhuqiaoliang alargecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT luchunlai alargecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT liyin largecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT gedi largecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT gujie largecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT xufengkai largecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT zhuqiaoliang largecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies AT luchunlai largecohortstudyidentifyinganovelprognosispredictionmodelforlungadenocarcinomathroughmachinelearningstrategies

A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies

Ejemplares similares