Cargando…
Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data
BACKGROUND: Few studies have investigated prognostic biomarkers of distant metastases of lung cancer. One of the central difficulties in identifying biomarkers from microarray data is the availability of only a small number of samples, which results overtraining. Recently obtained evidence reveals t...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349617/ https://www.ncbi.nlm.nih.gov/pubmed/25881029 http://dx.doi.org/10.1186/s12859-015-0463-x |
_version_ | 1782360050080153600 |
---|---|
author | Huang, Hui-Ling Wu, Yu-Chung Su, Li-Jen Huang, Yun-Ju Charoenkwan, Phasit Chen, Wen-Liang Lee, Hua-Chin Chu, William Cheng-Chung Ho, Shinn-Ying |
author_facet | Huang, Hui-Ling Wu, Yu-Chung Su, Li-Jen Huang, Yun-Ju Charoenkwan, Phasit Chen, Wen-Liang Lee, Hua-Chin Chu, William Cheng-Chung Ho, Shinn-Ying |
author_sort | Huang, Hui-Ling |
collection | PubMed |
description | BACKGROUND: Few studies have investigated prognostic biomarkers of distant metastases of lung cancer. One of the central difficulties in identifying biomarkers from microarray data is the availability of only a small number of samples, which results overtraining. Recently obtained evidence reveals that epithelial–mesenchymal transition (EMT) of tumor cells causes metastasis, which is detrimental to patients’ survival. RESULTS: This work proposes a novel optimization approach to discovering EMT-related prognostic biomarkers to predict the distant metastasis of lung cancer using both microarray and survival data. This weighted objective function maximizes both the accuracy of prediction of distant metastasis and the area between the disease-free survival curves of the non-distant and distant metastases. Seventy-eight patients with lung cancer and a follow-up time of 120 months are used to identify a set of gene markers and an independent cohort of 26 patients is used to evaluate the identified biomarkers. The medical records of the 78 patients show a significant difference between the disease-free survival times of the 37 non-distant- and the 41 distant-metastasis patients. The experimental results thus obtained are as follows. 1) The use of disease-free survival curves can compensate for the shortcoming of insufficient samples and greatly increase the test accuracy by 11.10%; and 2) the support vector machine with a set of 17 transcripts, such as CCL16 and CDKN2AIP, can yield a leave-one-out cross-validation accuracy of 93.59%, a test accuracy of 76.92%, a large disease-free survival area of 74.81%, and a mean survival prediction error of 3.99 months. The identified putative biomarkers are examined using related studies and signaling pathways to reveal the potential effectiveness of the biomarkers in prospective confirmatory studies. CONCLUSIONS: The proposed new optimization approach to identifying prognostic biomarkers by combining multiple sources of data (microarray and survival) can facilitate the accurate selection of biomarkers that are most relevant to the disease while solving the problem of insufficient samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0463-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4349617 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43496172015-03-05 Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data Huang, Hui-Ling Wu, Yu-Chung Su, Li-Jen Huang, Yun-Ju Charoenkwan, Phasit Chen, Wen-Liang Lee, Hua-Chin Chu, William Cheng-Chung Ho, Shinn-Ying BMC Bioinformatics Methodology Article BACKGROUND: Few studies have investigated prognostic biomarkers of distant metastases of lung cancer. One of the central difficulties in identifying biomarkers from microarray data is the availability of only a small number of samples, which results overtraining. Recently obtained evidence reveals that epithelial–mesenchymal transition (EMT) of tumor cells causes metastasis, which is detrimental to patients’ survival. RESULTS: This work proposes a novel optimization approach to discovering EMT-related prognostic biomarkers to predict the distant metastasis of lung cancer using both microarray and survival data. This weighted objective function maximizes both the accuracy of prediction of distant metastasis and the area between the disease-free survival curves of the non-distant and distant metastases. Seventy-eight patients with lung cancer and a follow-up time of 120 months are used to identify a set of gene markers and an independent cohort of 26 patients is used to evaluate the identified biomarkers. The medical records of the 78 patients show a significant difference between the disease-free survival times of the 37 non-distant- and the 41 distant-metastasis patients. The experimental results thus obtained are as follows. 1) The use of disease-free survival curves can compensate for the shortcoming of insufficient samples and greatly increase the test accuracy by 11.10%; and 2) the support vector machine with a set of 17 transcripts, such as CCL16 and CDKN2AIP, can yield a leave-one-out cross-validation accuracy of 93.59%, a test accuracy of 76.92%, a large disease-free survival area of 74.81%, and a mean survival prediction error of 3.99 months. The identified putative biomarkers are examined using related studies and signaling pathways to reveal the potential effectiveness of the biomarkers in prospective confirmatory studies. CONCLUSIONS: The proposed new optimization approach to identifying prognostic biomarkers by combining multiple sources of data (microarray and survival) can facilitate the accurate selection of biomarkers that are most relevant to the disease while solving the problem of insufficient samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0463-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-02-21 /pmc/articles/PMC4349617/ /pubmed/25881029 http://dx.doi.org/10.1186/s12859-015-0463-x Text en © Huang et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Huang, Hui-Ling Wu, Yu-Chung Su, Li-Jen Huang, Yun-Ju Charoenkwan, Phasit Chen, Wen-Liang Lee, Hua-Chin Chu, William Cheng-Chung Ho, Shinn-Ying Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data |
title | Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data |
title_full | Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data |
title_fullStr | Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data |
title_full_unstemmed | Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data |
title_short | Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data |
title_sort | discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349617/ https://www.ncbi.nlm.nih.gov/pubmed/25881029 http://dx.doi.org/10.1186/s12859-015-0463-x |
work_keys_str_mv | AT huanghuiling discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata AT wuyuchung discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata AT sulijen discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata AT huangyunju discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata AT charoenkwanphasit discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata AT chenwenliang discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata AT leehuachin discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata AT chuwilliamchengchung discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata AT hoshinnying discoveryofprognosticbiomarkersforpredictinglungcancermetastasisusingmicroarrayandsurvivaldata |