Cargando…
Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data
BACKGROUND: Lung cancer is one of the most common forms of cancer resulting in over a million deaths per year worldwide. Typically, the problem can be approached by developing more discriminative diagnosis methods. In this paper, computer-aided diagnosis was used to facilitate the prediction of char...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3655169/ https://www.ncbi.nlm.nih.gov/pubmed/23691066 http://dx.doi.org/10.1371/journal.pone.0063559 |
_version_ | 1782269838570291200 |
---|---|
author | Sun, Tao Zhang, Regina Wang, Jingjing Li, Xia Guo, Xiuhua |
author_facet | Sun, Tao Zhang, Regina Wang, Jingjing Li, Xia Guo, Xiuhua |
author_sort | Sun, Tao |
collection | PubMed |
description | BACKGROUND: Lung cancer is one of the most common forms of cancer resulting in over a million deaths per year worldwide. Typically, the problem can be approached by developing more discriminative diagnosis methods. In this paper, computer-aided diagnosis was used to facilitate the prediction of characteristics of solitary pulmonary nodules in CT of lungs to diagnose early-stage lung cancer. METHODS: The synthetic minority over-sampling technique (SMOTE) was used to account for raw data in order to balance the original training data set. Curvelet-transformation textural features, together with 3 patient demographic characteristics, and 9 morphological features were used to establish a support vector machine (SVM) prediction model. Longitudinal data as the test data set was used to evaluate the classification performance of predicting early-stage lung cancer. RESULTS: Using the SMOTE as a pre-processing procedure, the original training data was balanced with a ratio of malignant to benign cases of 1∶1. Accuracy based on cross-evaluation for the original unbalanced data and balanced data was 80% and 97%, respectively. Based on Curvelet-transformation textural features and other features, the SVM prediction model had good classification performance for early-stage lung cancer, with an area under the curve of the SVMs of 0.949 (P<0.001). Textural feature (standard deviation) showed benign cases had a higher change in the follow-up period than malignant cases. CONCLUSIONS: With textural features extracted from a Curvelet transformation and other parameters, a sensitive support vector machine prediction model can increase the rate of diagnosis for early-stage lung cancer. This scheme can be used as an auxiliary tool to differentiate between benign and malignant early-stage lung cancers in CT images. |
format | Online Article Text |
id | pubmed-3655169 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-36551692013-05-20 Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data Sun, Tao Zhang, Regina Wang, Jingjing Li, Xia Guo, Xiuhua PLoS One Research Article BACKGROUND: Lung cancer is one of the most common forms of cancer resulting in over a million deaths per year worldwide. Typically, the problem can be approached by developing more discriminative diagnosis methods. In this paper, computer-aided diagnosis was used to facilitate the prediction of characteristics of solitary pulmonary nodules in CT of lungs to diagnose early-stage lung cancer. METHODS: The synthetic minority over-sampling technique (SMOTE) was used to account for raw data in order to balance the original training data set. Curvelet-transformation textural features, together with 3 patient demographic characteristics, and 9 morphological features were used to establish a support vector machine (SVM) prediction model. Longitudinal data as the test data set was used to evaluate the classification performance of predicting early-stage lung cancer. RESULTS: Using the SMOTE as a pre-processing procedure, the original training data was balanced with a ratio of malignant to benign cases of 1∶1. Accuracy based on cross-evaluation for the original unbalanced data and balanced data was 80% and 97%, respectively. Based on Curvelet-transformation textural features and other features, the SVM prediction model had good classification performance for early-stage lung cancer, with an area under the curve of the SVMs of 0.949 (P<0.001). Textural feature (standard deviation) showed benign cases had a higher change in the follow-up period than malignant cases. CONCLUSIONS: With textural features extracted from a Curvelet transformation and other parameters, a sensitive support vector machine prediction model can increase the rate of diagnosis for early-stage lung cancer. This scheme can be used as an auxiliary tool to differentiate between benign and malignant early-stage lung cancers in CT images. Public Library of Science 2013-05-15 /pmc/articles/PMC3655169/ /pubmed/23691066 http://dx.doi.org/10.1371/journal.pone.0063559 Text en © 2013 Sun et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Sun, Tao Zhang, Regina Wang, Jingjing Li, Xia Guo, Xiuhua Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data |
title | Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data |
title_full | Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data |
title_fullStr | Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data |
title_full_unstemmed | Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data |
title_short | Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data |
title_sort | computer-aided diagnosis for early-stage lung cancer based on longitudinal and balanced data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3655169/ https://www.ncbi.nlm.nih.gov/pubmed/23691066 http://dx.doi.org/10.1371/journal.pone.0063559 |
work_keys_str_mv | AT suntao computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata AT zhangregina computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata AT wangjingjing computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata AT lixia computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata AT guoxiuhua computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata |