Cargando…

Association of specific gene mutations derived from machine learning with survival in lung adenocarcinoma

Lung cancer is the second most common cancer in the United States and the leading cause of mortality in cancer patients. Biomarkers predicting survival of patients with lung cancer have a profound effect on patient prognosis and treatment. However, predictive biomarkers for survival and their releva...

Descripción completa

Detalles Bibliográficos
Autores principales: Cho, Han-Jun, Lee, Soonchul, Ji, Young Geon, Lee, Dong Hyeon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6231670/
https://www.ncbi.nlm.nih.gov/pubmed/30419062
http://dx.doi.org/10.1371/journal.pone.0207204
_version_ 1783370275543318528
author Cho, Han-Jun
Lee, Soonchul
Ji, Young Geon
Lee, Dong Hyeon
author_facet Cho, Han-Jun
Lee, Soonchul
Ji, Young Geon
Lee, Dong Hyeon
author_sort Cho, Han-Jun
collection PubMed
description Lung cancer is the second most common cancer in the United States and the leading cause of mortality in cancer patients. Biomarkers predicting survival of patients with lung cancer have a profound effect on patient prognosis and treatment. However, predictive biomarkers for survival and their relevance for lung cancer are not been well known yet. The objective of this study was to perform machine learning with data from The Cancer Genome Atlas of patients with lung adenocarcinoma (LUAD) to find survival-specific gene mutations that could be used as survival-predicting biomarkers. To identify survival-specific mutations according to various clinical factors, four feature selection methods (information gain, chi-squared test, minimum redundancy maximum relevance, and correlation) were used. Extracted survival-specific mutations of LUAD were applied individually or as a group for Kaplan-Meier survival analysis. Mutations in MMRN2 and GMPPA were significantly associated with patient mortality while those in ZNF560 and SETX were associated with patient survival. Mutations in DNAJC2 and MMRN2 showed significant negative association with overall survival while mutations in ZNF560 showed significant positive association with overall survival. Mutations in MMRN2 showed significant negative association with disease-free survival while mutations in DRD3 and ZNF560 showed positive associated with disease-free survival. Mutations in DRD3, SETX, and ZNF560 showed significant positive association with survival in patients with LUAD while the opposite was true for mutations in DNAJC2, GMPPA, and MMRN2. These gene mutations were also found in other cohorts of LUAD, lung squamous cell carcinoma, and small cell lung cancer. In LUAD of Pan-Lung Cancer cohort, mutations in GMPPA, DNAJC2, and MMRN2 showed significant negative associations with survival of patients while mutations in DRD3 and SETX showed significant positive association with survival. In this study, machine learning was conducted to obtain information necessary to discover specific gene mutations associated with the survival of patients with LUAD. Mutations in the above six genes could predict survival rate and disease-free survival rate in patients with LUAD. Thus, they are important biomarker candidates for prognosis.
format Online
Article
Text
id pubmed-6231670
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62316702018-11-19 Association of specific gene mutations derived from machine learning with survival in lung adenocarcinoma Cho, Han-Jun Lee, Soonchul Ji, Young Geon Lee, Dong Hyeon PLoS One Research Article Lung cancer is the second most common cancer in the United States and the leading cause of mortality in cancer patients. Biomarkers predicting survival of patients with lung cancer have a profound effect on patient prognosis and treatment. However, predictive biomarkers for survival and their relevance for lung cancer are not been well known yet. The objective of this study was to perform machine learning with data from The Cancer Genome Atlas of patients with lung adenocarcinoma (LUAD) to find survival-specific gene mutations that could be used as survival-predicting biomarkers. To identify survival-specific mutations according to various clinical factors, four feature selection methods (information gain, chi-squared test, minimum redundancy maximum relevance, and correlation) were used. Extracted survival-specific mutations of LUAD were applied individually or as a group for Kaplan-Meier survival analysis. Mutations in MMRN2 and GMPPA were significantly associated with patient mortality while those in ZNF560 and SETX were associated with patient survival. Mutations in DNAJC2 and MMRN2 showed significant negative association with overall survival while mutations in ZNF560 showed significant positive association with overall survival. Mutations in MMRN2 showed significant negative association with disease-free survival while mutations in DRD3 and ZNF560 showed positive associated with disease-free survival. Mutations in DRD3, SETX, and ZNF560 showed significant positive association with survival in patients with LUAD while the opposite was true for mutations in DNAJC2, GMPPA, and MMRN2. These gene mutations were also found in other cohorts of LUAD, lung squamous cell carcinoma, and small cell lung cancer. In LUAD of Pan-Lung Cancer cohort, mutations in GMPPA, DNAJC2, and MMRN2 showed significant negative associations with survival of patients while mutations in DRD3 and SETX showed significant positive association with survival. In this study, machine learning was conducted to obtain information necessary to discover specific gene mutations associated with the survival of patients with LUAD. Mutations in the above six genes could predict survival rate and disease-free survival rate in patients with LUAD. Thus, they are important biomarker candidates for prognosis. Public Library of Science 2018-11-12 /pmc/articles/PMC6231670/ /pubmed/30419062 http://dx.doi.org/10.1371/journal.pone.0207204 Text en © 2018 Cho et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cho, Han-Jun
Lee, Soonchul
Ji, Young Geon
Lee, Dong Hyeon
Association of specific gene mutations derived from machine learning with survival in lung adenocarcinoma
title Association of specific gene mutations derived from machine learning with survival in lung adenocarcinoma
title_full Association of specific gene mutations derived from machine learning with survival in lung adenocarcinoma
title_fullStr Association of specific gene mutations derived from machine learning with survival in lung adenocarcinoma
title_full_unstemmed Association of specific gene mutations derived from machine learning with survival in lung adenocarcinoma
title_short Association of specific gene mutations derived from machine learning with survival in lung adenocarcinoma
title_sort association of specific gene mutations derived from machine learning with survival in lung adenocarcinoma
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6231670/
https://www.ncbi.nlm.nih.gov/pubmed/30419062
http://dx.doi.org/10.1371/journal.pone.0207204
work_keys_str_mv AT chohanjun associationofspecificgenemutationsderivedfrommachinelearningwithsurvivalinlungadenocarcinoma
AT leesoonchul associationofspecificgenemutationsderivedfrommachinelearningwithsurvivalinlungadenocarcinoma
AT jiyounggeon associationofspecificgenemutationsderivedfrommachinelearningwithsurvivalinlungadenocarcinoma
AT leedonghyeon associationofspecificgenemutationsderivedfrommachinelearningwithsurvivalinlungadenocarcinoma