Cargando…

Identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data

This study aimed to identify the genes and pathways associated with smoking-related lung adenocarcinoma. Three lung adenocarcinoma associated datasets (GSE43458, GSE10072, and GSE50081), the subjects of which included smokers and nonsmokers, were downloaded to screen the differentially expressed fea...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Ying, Ni, Ran, Zhang, Hui, Miao, Lijun, Wang, Jing, Jia, Wenqing, Wang, Yuanyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove Medical Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5153289/
https://www.ncbi.nlm.nih.gov/pubmed/27994470
http://dx.doi.org/10.2147/OTT.S114230
_version_ 1782474667413471232
author Liu, Ying
Ni, Ran
Zhang, Hui
Miao, Lijun
Wang, Jing
Jia, Wenqing
Wang, Yuanyuan
author_facet Liu, Ying
Ni, Ran
Zhang, Hui
Miao, Lijun
Wang, Jing
Jia, Wenqing
Wang, Yuanyuan
author_sort Liu, Ying
collection PubMed
description This study aimed to identify the genes and pathways associated with smoking-related lung adenocarcinoma. Three lung adenocarcinoma associated datasets (GSE43458, GSE10072, and GSE50081), the subjects of which included smokers and nonsmokers, were downloaded to screen the differentially expressed feature genes between smokers and nonsmokers. Based on the identified feature genes, we constructed the protein–protein interaction (PPI) network and optimized feature genes using closeness centrality (CC) algorithm. Then, the support vector machine (SVM) classification model was constructed based on the feature genes with higher CC values. Finally, pathway enrichment analysis of the feature genes was performed. A total of 213 down-regulated and 83 up-regulated differentially expressed genes were identified. In the constructed PPI network, the top ten nodes with higher degrees and CC values included ANK3, EPHA4, FGFR2, etc. The SVM classifier was constructed with 27 feature genes, which could accurately identify smokers and nonsmokers. Pathways enrichment analysis for the 27 feature genes revealed that they were significantly enriched in five pathways, including proteoglycans in cancer (EGFR, SDC4, SDC2, etc.), and Ras signaling pathway (FGFR2, PLA2G1B, EGFR, etc.). The 27 feature genes, such as EPHA4, FGFR2, and EGFR for SVM classifier construction and cancer-related pathways of Ras signaling pathway and proteoglycans in cancer may play key roles in the progression and development of smoking-related lung adenocarcinoma.
format Online
Article
Text
id pubmed-5153289
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Dove Medical Press
record_format MEDLINE/PubMed
spelling pubmed-51532892016-12-19 Identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data Liu, Ying Ni, Ran Zhang, Hui Miao, Lijun Wang, Jing Jia, Wenqing Wang, Yuanyuan Onco Targets Ther Original Research This study aimed to identify the genes and pathways associated with smoking-related lung adenocarcinoma. Three lung adenocarcinoma associated datasets (GSE43458, GSE10072, and GSE50081), the subjects of which included smokers and nonsmokers, were downloaded to screen the differentially expressed feature genes between smokers and nonsmokers. Based on the identified feature genes, we constructed the protein–protein interaction (PPI) network and optimized feature genes using closeness centrality (CC) algorithm. Then, the support vector machine (SVM) classification model was constructed based on the feature genes with higher CC values. Finally, pathway enrichment analysis of the feature genes was performed. A total of 213 down-regulated and 83 up-regulated differentially expressed genes were identified. In the constructed PPI network, the top ten nodes with higher degrees and CC values included ANK3, EPHA4, FGFR2, etc. The SVM classifier was constructed with 27 feature genes, which could accurately identify smokers and nonsmokers. Pathways enrichment analysis for the 27 feature genes revealed that they were significantly enriched in five pathways, including proteoglycans in cancer (EGFR, SDC4, SDC2, etc.), and Ras signaling pathway (FGFR2, PLA2G1B, EGFR, etc.). The 27 feature genes, such as EPHA4, FGFR2, and EGFR for SVM classifier construction and cancer-related pathways of Ras signaling pathway and proteoglycans in cancer may play key roles in the progression and development of smoking-related lung adenocarcinoma. Dove Medical Press 2016-12-07 /pmc/articles/PMC5153289/ /pubmed/27994470 http://dx.doi.org/10.2147/OTT.S114230 Text en © 2016 Liu et al. This work is published and licensed by Dove Medical Press Limited The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed.
spellingShingle Original Research
Liu, Ying
Ni, Ran
Zhang, Hui
Miao, Lijun
Wang, Jing
Jia, Wenqing
Wang, Yuanyuan
Identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data
title Identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data
title_full Identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data
title_fullStr Identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data
title_full_unstemmed Identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data
title_short Identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data
title_sort identification of feature genes for smoking-related lung adenocarcinoma based on gene expression profile data
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5153289/
https://www.ncbi.nlm.nih.gov/pubmed/27994470
http://dx.doi.org/10.2147/OTT.S114230
work_keys_str_mv AT liuying identificationoffeaturegenesforsmokingrelatedlungadenocarcinomabasedongeneexpressionprofiledata
AT niran identificationoffeaturegenesforsmokingrelatedlungadenocarcinomabasedongeneexpressionprofiledata
AT zhanghui identificationoffeaturegenesforsmokingrelatedlungadenocarcinomabasedongeneexpressionprofiledata
AT miaolijun identificationoffeaturegenesforsmokingrelatedlungadenocarcinomabasedongeneexpressionprofiledata
AT wangjing identificationoffeaturegenesforsmokingrelatedlungadenocarcinomabasedongeneexpressionprofiledata
AT jiawenqing identificationoffeaturegenesforsmokingrelatedlungadenocarcinomabasedongeneexpressionprofiledata
AT wangyuanyuan identificationoffeaturegenesforsmokingrelatedlungadenocarcinomabasedongeneexpressionprofiledata