Cargando…

On Predicting lung cancer subtypes using ‘omic’ data from tumor and tumor-adjacent histologically-normal tissue

BACKGROUND: Adenocarcinoma (ADC) and squamous cell carcinoma (SCC) are the most prevalent histological types among lung cancers. Distinguishing between these subtypes is critically important because they have different implications for prognosis and treatment. Normally, histopathological analyses ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Pineda, Arturo López, Ogoe, Henry Ato, Balasubramanian, Jeya Balaji, Rangel Escareño, Claudia, Visweswaran, Shyam, Herman, James Gordon, Gopalakrishnan, Vanathi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4778315/
https://www.ncbi.nlm.nih.gov/pubmed/26944944
http://dx.doi.org/10.1186/s12885-016-2223-3
_version_ 1782419442404163584
author Pineda, Arturo López
Ogoe, Henry Ato
Balasubramanian, Jeya Balaji
Rangel Escareño, Claudia
Visweswaran, Shyam
Herman, James Gordon
Gopalakrishnan, Vanathi
author_facet Pineda, Arturo López
Ogoe, Henry Ato
Balasubramanian, Jeya Balaji
Rangel Escareño, Claudia
Visweswaran, Shyam
Herman, James Gordon
Gopalakrishnan, Vanathi
author_sort Pineda, Arturo López
collection PubMed
description BACKGROUND: Adenocarcinoma (ADC) and squamous cell carcinoma (SCC) are the most prevalent histological types among lung cancers. Distinguishing between these subtypes is critically important because they have different implications for prognosis and treatment. Normally, histopathological analyses are used to distinguish between the two, where the tissue samples are collected based on small endoscopic samples or needle aspirations. However, the lack of cell architecture in these small tissue samples hampers the process of distinguishing between the two subtypes. Molecular profiling can also be used to discriminate between the two lung cancer subtypes, on condition that the biopsy is composed of at least 50 % of tumor cells. However, for some cases, the tissue composition of a biopsy might be a mix of tumor and tumor-adjacent histologically normal tissue (TAHN). When this happens, a new biopsy is required, with associated cost, risks and discomfort to the patient. To avoid this problem, we hypothesize that a computational method can distinguish between lung cancer subtypes given tumor and TAHN tissue. METHODS: Using publicly available datasets for gene expression and DNA methylation, we applied four classification tasks, depending on the possible combinations of tumor and TAHN tissue. First, we used a feature selector (ReliefF/Limma) to select relevant variables, which were then used to build a simple naïve Bayes classification model. Then, we evaluated the classification performance of our models by measuring the area under the receiver operating characteristic curve (AUC). Finally, we analyzed the relevance of the selected genes using hierarchical clustering and IPA® software for gene functional analysis. RESULTS: All Bayesian models achieved high classification performance (AUC > 0.94), which were confirmed by hierarchical cluster analysis. From the genes selected, 25 (93 %) were found to be related to cancer (19 were associated with ADC or SCC), confirming the biological relevance of our method. CONCLUSIONS: The results from this study confirm that computational methods using tumor and TAHN tissue can serve as a prognostic tool for lung cancer subtype classification. Our study complements results from other studies where TAHN tissue has been used as prognostic tool for prostate cancer. The clinical implications of this finding could greatly benefit lung cancer patients. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12885-016-2223-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4778315
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47783152016-03-05 On Predicting lung cancer subtypes using ‘omic’ data from tumor and tumor-adjacent histologically-normal tissue Pineda, Arturo López Ogoe, Henry Ato Balasubramanian, Jeya Balaji Rangel Escareño, Claudia Visweswaran, Shyam Herman, James Gordon Gopalakrishnan, Vanathi BMC Cancer Research Article BACKGROUND: Adenocarcinoma (ADC) and squamous cell carcinoma (SCC) are the most prevalent histological types among lung cancers. Distinguishing between these subtypes is critically important because they have different implications for prognosis and treatment. Normally, histopathological analyses are used to distinguish between the two, where the tissue samples are collected based on small endoscopic samples or needle aspirations. However, the lack of cell architecture in these small tissue samples hampers the process of distinguishing between the two subtypes. Molecular profiling can also be used to discriminate between the two lung cancer subtypes, on condition that the biopsy is composed of at least 50 % of tumor cells. However, for some cases, the tissue composition of a biopsy might be a mix of tumor and tumor-adjacent histologically normal tissue (TAHN). When this happens, a new biopsy is required, with associated cost, risks and discomfort to the patient. To avoid this problem, we hypothesize that a computational method can distinguish between lung cancer subtypes given tumor and TAHN tissue. METHODS: Using publicly available datasets for gene expression and DNA methylation, we applied four classification tasks, depending on the possible combinations of tumor and TAHN tissue. First, we used a feature selector (ReliefF/Limma) to select relevant variables, which were then used to build a simple naïve Bayes classification model. Then, we evaluated the classification performance of our models by measuring the area under the receiver operating characteristic curve (AUC). Finally, we analyzed the relevance of the selected genes using hierarchical clustering and IPA® software for gene functional analysis. RESULTS: All Bayesian models achieved high classification performance (AUC > 0.94), which were confirmed by hierarchical cluster analysis. From the genes selected, 25 (93 %) were found to be related to cancer (19 were associated with ADC or SCC), confirming the biological relevance of our method. CONCLUSIONS: The results from this study confirm that computational methods using tumor and TAHN tissue can serve as a prognostic tool for lung cancer subtype classification. Our study complements results from other studies where TAHN tissue has been used as prognostic tool for prostate cancer. The clinical implications of this finding could greatly benefit lung cancer patients. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12885-016-2223-3) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-04 /pmc/articles/PMC4778315/ /pubmed/26944944 http://dx.doi.org/10.1186/s12885-016-2223-3 Text en © Pineda et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Pineda, Arturo López
Ogoe, Henry Ato
Balasubramanian, Jeya Balaji
Rangel Escareño, Claudia
Visweswaran, Shyam
Herman, James Gordon
Gopalakrishnan, Vanathi
On Predicting lung cancer subtypes using ‘omic’ data from tumor and tumor-adjacent histologically-normal tissue
title On Predicting lung cancer subtypes using ‘omic’ data from tumor and tumor-adjacent histologically-normal tissue
title_full On Predicting lung cancer subtypes using ‘omic’ data from tumor and tumor-adjacent histologically-normal tissue
title_fullStr On Predicting lung cancer subtypes using ‘omic’ data from tumor and tumor-adjacent histologically-normal tissue
title_full_unstemmed On Predicting lung cancer subtypes using ‘omic’ data from tumor and tumor-adjacent histologically-normal tissue
title_short On Predicting lung cancer subtypes using ‘omic’ data from tumor and tumor-adjacent histologically-normal tissue
title_sort on predicting lung cancer subtypes using ‘omic’ data from tumor and tumor-adjacent histologically-normal tissue
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4778315/
https://www.ncbi.nlm.nih.gov/pubmed/26944944
http://dx.doi.org/10.1186/s12885-016-2223-3
work_keys_str_mv AT pinedaarturolopez onpredictinglungcancersubtypesusingomicdatafromtumorandtumoradjacenthistologicallynormaltissue
AT ogoehenryato onpredictinglungcancersubtypesusingomicdatafromtumorandtumoradjacenthistologicallynormaltissue
AT balasubramanianjeyabalaji onpredictinglungcancersubtypesusingomicdatafromtumorandtumoradjacenthistologicallynormaltissue
AT rangelescarenoclaudia onpredictinglungcancersubtypesusingomicdatafromtumorandtumoradjacenthistologicallynormaltissue
AT visweswaranshyam onpredictinglungcancersubtypesusingomicdatafromtumorandtumoradjacenthistologicallynormaltissue
AT hermanjamesgordon onpredictinglungcancersubtypesusingomicdatafromtumorandtumoradjacenthistologicallynormaltissue
AT gopalakrishnanvanathi onpredictinglungcancersubtypesusingomicdatafromtumorandtumoradjacenthistologicallynormaltissue