Cargando…

Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data

BACKGROUND: Few studies have investigated prognostic biomarkers of distant metastases of lung cancer. One of the central difficulties in identifying biomarkers from microarray data is the availability of only a small number of samples, which results overtraining. Recently obtained evidence reveals t...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Hui-Ling, Wu, Yu-Chung, Su, Li-Jen, Huang, Yun-Ju, Charoenkwan, Phasit, Chen, Wen-Liang, Lee, Hua-Chin, Chu, William Cheng-Chung, Ho, Shinn-Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349617/
https://www.ncbi.nlm.nih.gov/pubmed/25881029
http://dx.doi.org/10.1186/s12859-015-0463-x
_version_ 1782360050080153600
author Huang, Hui-Ling
Wu, Yu-Chung
Su, Li-Jen
Huang, Yun-Ju
Charoenkwan, Phasit
Chen, Wen-Liang
Lee, Hua-Chin
Chu, William Cheng-Chung
Ho, Shinn-Ying
author_facet Huang, Hui-Ling
Wu, Yu-Chung
Su, Li-Jen
Huang, Yun-Ju
Charoenkwan, Phasit
Chen, Wen-Liang
Lee, Hua-Chin
Chu, William Cheng-Chung
Ho, Shinn-Ying
author_sort Huang, Hui-Ling
collection PubMed
description BACKGROUND: Few studies have investigated prognostic biomarkers of distant metastases of lung cancer. One of the central difficulties in identifying biomarkers from microarray data is the availability of only a small number of samples, which results overtraining. Recently obtained evidence reveals that epithelial–mesenchymal transition (EMT) of tumor cells causes metastasis, which is detrimental to patients’ survival. RESULTS: This work proposes a novel optimization approach to discovering EMT-related prognostic biomarkers to predict the distant metastasis of lung cancer using both microarray and survival data. This weighted objective function maximizes both the accuracy of prediction of distant metastasis and the area between the disease-free survival curves of the non-distant and distant metastases. Seventy-eight patients with lung cancer and a follow-up time of 120 months are used to identify a set of gene markers and an independent cohort of 26 patients is used to evaluate the identified biomarkers. The medical records of the 78 patients show a significant difference between the disease-free survival times of the 37 non-distant- and the 41 distant-metastasis patients. The experimental results thus obtained are as follows. 1) The use of disease-free survival curves can compensate for the shortcoming of insufficient samples and greatly increase the test accuracy by 11.10%; and 2) the support vector machine with a set of 17 transcripts, such as CCL16 and CDKN2AIP, can yield a leave-one-out cross-validation accuracy of 93.59%, a test accuracy of 76.92%, a large disease-free survival area of 74.81%, and a mean survival prediction error of 3.99 months. The identified putative biomarkers are examined using related studies and signaling pathways to reveal the potential effectiveness of the biomarkers in prospective confirmatory studies. CONCLUSIONS: The proposed new optimization approach to identifying prognostic biomarkers by combining multiple sources of data (microarray and survival) can facilitate the accurate selection of biomarkers that are most relevant to the disease while solving the problem of insufficient samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0463-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4349617
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43496172015-03-05 Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data Huang, Hui-Ling Wu, Yu-Chung Su, Li-Jen Huang, Yun-Ju Charoenkwan, Phasit Chen, Wen-Liang Lee, Hua-Chin Chu, William Cheng-Chung Ho, Shinn-Ying BMC Bioinformatics Methodology Article BACKGROUND: Few studies have investigated prognostic biomarkers of distant metastases of lung cancer. One of the central difficulties in identifying biomarkers from microarray data is the availability of only a small number of samples, which results overtraining. Recently obtained evidence reveals that epithelial–mesenchymal transition (EMT) of tumor cells causes metastasis, which is detrimental to patients’ survival. RESULTS: This work proposes a novel optimization approach to discovering EMT-related prognostic biomarkers to predict the distant metastasis of lung cancer using both microarray and survival data. This weighted objective function maximizes both the accuracy of prediction of distant metastasis and the area between the disease-free survival curves of the non-distant and distant metastases. Seventy-eight patients with lung cancer and a follow-up time of 120 months are used to identify a set of gene markers and an independent cohort of 26 patients is used to evaluate the identified biomarkers. The medical records of the 78 patients show a significant difference between the disease-free survival times of the 37 non-distant- and the 41 distant-metastasis patients. The experimental results thus obtained are as follows. 1) The use of disease-free survival curves can compensate for the shortcoming of insufficient samples and greatly increase the test accuracy by 11.10%; and 2) the support vector machine with a set of 17 transcripts, such as CCL16 and CDKN2AIP, can yield a leave-one-out cross-validation accuracy of 93.59%, a test accuracy of 76.92%, a large disease-free survival area of 74.81%, and a mean survival prediction error of 3.99 months. The identified putative biomarkers are examined using related studies and signaling pathways to reveal the potential effectiveness of the biomarkers in prospective confirmatory studies. CONCLUSIONS: The proposed new optimization approach to identifying prognostic biomarkers by combining multiple sources of data (microarray and survival) can facilitate the accurate selection of biomarkers that are most relevant to the disease while solving the problem of insufficient samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0463-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-02-21 /pmc/articles/PMC4349617/ /pubmed/25881029 http://dx.doi.org/10.1186/s12859-015-0463-x Text en © Huang et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Huang, Hui-Ling
Wu, Yu-Chung
Su, Li-Jen
Huang, Yun-Ju
Charoenkwan, Phasit
Chen, Wen-Liang
Lee, Hua-Chin
Chu, William Cheng-Chung
Ho, Shinn-Ying
Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data
title Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data
title_full Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data
title_fullStr Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data
title_full_unstemmed Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data
title_short Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data
title_sort discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349617/
https://www.ncbi.nlm.nih.gov/pubmed/25881029
http://dx.doi.org/10.1186/s12859-015-0463-x
work_keys_str_mv AT huanghuiling discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata
AT wuyuchung discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata
AT sulijen discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata
AT huangyunju discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata
AT charoenkwanphasit discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata
AT chenwenliang discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata
AT leehuachin discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata
AT chuwilliamchengchung discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata
AT hoshinnying discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata