Cargando…

Genomic analyses based on pulmonary adenocarcinoma in situ reveal early lung cancer signature

BACKGROUND: Non-small cell lung cancer (NSCLC) represents more than about 80% of the lung cancer. The early stages of NSCLC can be treated with complete resection with a good prognosis. However, most cases are detected at late stage of the disease. The average survival rate of the patients with inva...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Dan, Yang, William, Zhang, Yifan, Yang, Jack Y, Guan, Renchu, Xu, Dong, Yang, Mary Qu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245590/
https://www.ncbi.nlm.nih.gov/pubmed/30453959
http://dx.doi.org/10.1186/s12920-018-0413-3
_version_ 1783372270166605824
author Li, Dan
Yang, William
Zhang, Yifan
Yang, Jack Y
Guan, Renchu
Xu, Dong
Yang, Mary Qu
author_facet Li, Dan
Yang, William
Zhang, Yifan
Yang, Jack Y
Guan, Renchu
Xu, Dong
Yang, Mary Qu
author_sort Li, Dan
collection PubMed
description BACKGROUND: Non-small cell lung cancer (NSCLC) represents more than about 80% of the lung cancer. The early stages of NSCLC can be treated with complete resection with a good prognosis. However, most cases are detected at late stage of the disease. The average survival rate of the patients with invasive lung cancer is only about 4%. Adenocarcinoma in situ (AIS) is an intermediate subtype of lung adenocarcinoma that exhibits early stage growth patterns but can develop into invasion. METHODS: In this study, we used RNA-seq data from normal, AIS, and invasive lung cancer tissues to identify a gene module that represents the distinguishing characteristics of AIS as AIS-specific genes. Two differential expression analysis algorithms were employed to identify the AIS-specific genes. Then, the subset of the best performed AIS-specific genes for the early lung cancer prediction were selected by random forest. Finally, the performances of the early lung cancer prediction were assessed using random forest, support vector machine (SVM) and artificial neural networks (ANNs) on four independent early lung cancer datasets including one tumor-educated blood platelets (TEPs) dataset. RESULTS: Based on the differential expression analysis, 107 AIS-specific genes that consisted of 93 protein-coding genes and 14 long non-coding RNAs (lncRNAs) were identified. The significant functions associated with these genes include angiogenesis and ECM-receptor interaction, which are highly related to cancer development and contribute to the smoking-free lung cancers. Moreover, 12 of the AIS-specific lncRNAs are involved in lung cancer progression by potentially regulating the ECM-receptor interaction pathway. The feature selection by random forest identified 20 of the AIS-specific genes as early stage lung cancer signatures using the dataset obtained from The Cancer Genome Atlas (TCGA) lung adenocarcinoma samples. Of the 20 signatures, two were lncRNAs, BLACAT1 and CTD-2527I21.15 which have been reported to be associated with bladder cancer, colorectal cancer and breast cancer. In blind classification for three independent tissue sample datasets, these signature genes consistently yielded about 98% accuracy for distinguishing early stage lung cancer from normal cases. However, the prediction accuracy for the blood platelets samples was only 64.35% (sensitivity 78.1%, specificity 50.59%, and AUROC 0.747). CONCLUSIONS: The comparison of AIS with normal and invasive tumor revealed diseases-specific genes and offered new insights into the mechanism underlying AIS progression into an invasive tumor. These genes can also serve as the signatures for early diagnosis of lung cancer with high accuracy. The expression profile of gene signatures identified from tissue cancer samples yielded remarkable early cancer prediction for tissues samples, however, relatively lower accuracy for boold platelets samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0413-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6245590
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62455902018-11-26 Genomic analyses based on pulmonary adenocarcinoma in situ reveal early lung cancer signature Li, Dan Yang, William Zhang, Yifan Yang, Jack Y Guan, Renchu Xu, Dong Yang, Mary Qu BMC Med Genomics Research BACKGROUND: Non-small cell lung cancer (NSCLC) represents more than about 80% of the lung cancer. The early stages of NSCLC can be treated with complete resection with a good prognosis. However, most cases are detected at late stage of the disease. The average survival rate of the patients with invasive lung cancer is only about 4%. Adenocarcinoma in situ (AIS) is an intermediate subtype of lung adenocarcinoma that exhibits early stage growth patterns but can develop into invasion. METHODS: In this study, we used RNA-seq data from normal, AIS, and invasive lung cancer tissues to identify a gene module that represents the distinguishing characteristics of AIS as AIS-specific genes. Two differential expression analysis algorithms were employed to identify the AIS-specific genes. Then, the subset of the best performed AIS-specific genes for the early lung cancer prediction were selected by random forest. Finally, the performances of the early lung cancer prediction were assessed using random forest, support vector machine (SVM) and artificial neural networks (ANNs) on four independent early lung cancer datasets including one tumor-educated blood platelets (TEPs) dataset. RESULTS: Based on the differential expression analysis, 107 AIS-specific genes that consisted of 93 protein-coding genes and 14 long non-coding RNAs (lncRNAs) were identified. The significant functions associated with these genes include angiogenesis and ECM-receptor interaction, which are highly related to cancer development and contribute to the smoking-free lung cancers. Moreover, 12 of the AIS-specific lncRNAs are involved in lung cancer progression by potentially regulating the ECM-receptor interaction pathway. The feature selection by random forest identified 20 of the AIS-specific genes as early stage lung cancer signatures using the dataset obtained from The Cancer Genome Atlas (TCGA) lung adenocarcinoma samples. Of the 20 signatures, two were lncRNAs, BLACAT1 and CTD-2527I21.15 which have been reported to be associated with bladder cancer, colorectal cancer and breast cancer. In blind classification for three independent tissue sample datasets, these signature genes consistently yielded about 98% accuracy for distinguishing early stage lung cancer from normal cases. However, the prediction accuracy for the blood platelets samples was only 64.35% (sensitivity 78.1%, specificity 50.59%, and AUROC 0.747). CONCLUSIONS: The comparison of AIS with normal and invasive tumor revealed diseases-specific genes and offered new insights into the mechanism underlying AIS progression into an invasive tumor. These genes can also serve as the signatures for early diagnosis of lung cancer with high accuracy. The expression profile of gene signatures identified from tissue cancer samples yielded remarkable early cancer prediction for tissues samples, however, relatively lower accuracy for boold platelets samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0413-3) contains supplementary material, which is available to authorized users. BioMed Central 2018-11-20 /pmc/articles/PMC6245590/ /pubmed/30453959 http://dx.doi.org/10.1186/s12920-018-0413-3 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Li, Dan
Yang, William
Zhang, Yifan
Yang, Jack Y
Guan, Renchu
Xu, Dong
Yang, Mary Qu
Genomic analyses based on pulmonary adenocarcinoma in situ reveal early lung cancer signature
title Genomic analyses based on pulmonary adenocarcinoma in situ reveal early lung cancer signature
title_full Genomic analyses based on pulmonary adenocarcinoma in situ reveal early lung cancer signature
title_fullStr Genomic analyses based on pulmonary adenocarcinoma in situ reveal early lung cancer signature
title_full_unstemmed Genomic analyses based on pulmonary adenocarcinoma in situ reveal early lung cancer signature
title_short Genomic analyses based on pulmonary adenocarcinoma in situ reveal early lung cancer signature
title_sort genomic analyses based on pulmonary adenocarcinoma in situ reveal early lung cancer signature
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245590/
https://www.ncbi.nlm.nih.gov/pubmed/30453959
http://dx.doi.org/10.1186/s12920-018-0413-3
work_keys_str_mv AT lidan genomicanalysesbasedonpulmonaryadenocarcinomainsiturevealearlylungcancersignature
AT yangwilliam genomicanalysesbasedonpulmonaryadenocarcinomainsiturevealearlylungcancersignature
AT zhangyifan genomicanalysesbasedonpulmonaryadenocarcinomainsiturevealearlylungcancersignature
AT yangjacky genomicanalysesbasedonpulmonaryadenocarcinomainsiturevealearlylungcancersignature
AT guanrenchu genomicanalysesbasedonpulmonaryadenocarcinomainsiturevealearlylungcancersignature
AT xudong genomicanalysesbasedonpulmonaryadenocarcinomainsiturevealearlylungcancersignature
AT yangmaryqu genomicanalysesbasedonpulmonaryadenocarcinomainsiturevealearlylungcancersignature