Cargando…

Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data

BACKGROUND: Lung cancer is one of the most common forms of cancer resulting in over a million deaths per year worldwide. Typically, the problem can be approached by developing more discriminative diagnosis methods. In this paper, computer-aided diagnosis was used to facilitate the prediction of char...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Tao, Zhang, Regina, Wang, Jingjing, Li, Xia, Guo, Xiuhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3655169/
https://www.ncbi.nlm.nih.gov/pubmed/23691066
http://dx.doi.org/10.1371/journal.pone.0063559
_version_ 1782269838570291200
author Sun, Tao
Zhang, Regina
Wang, Jingjing
Li, Xia
Guo, Xiuhua
author_facet Sun, Tao
Zhang, Regina
Wang, Jingjing
Li, Xia
Guo, Xiuhua
author_sort Sun, Tao
collection PubMed
description BACKGROUND: Lung cancer is one of the most common forms of cancer resulting in over a million deaths per year worldwide. Typically, the problem can be approached by developing more discriminative diagnosis methods. In this paper, computer-aided diagnosis was used to facilitate the prediction of characteristics of solitary pulmonary nodules in CT of lungs to diagnose early-stage lung cancer. METHODS: The synthetic minority over-sampling technique (SMOTE) was used to account for raw data in order to balance the original training data set. Curvelet-transformation textural features, together with 3 patient demographic characteristics, and 9 morphological features were used to establish a support vector machine (SVM) prediction model. Longitudinal data as the test data set was used to evaluate the classification performance of predicting early-stage lung cancer. RESULTS: Using the SMOTE as a pre-processing procedure, the original training data was balanced with a ratio of malignant to benign cases of 1∶1. Accuracy based on cross-evaluation for the original unbalanced data and balanced data was 80% and 97%, respectively. Based on Curvelet-transformation textural features and other features, the SVM prediction model had good classification performance for early-stage lung cancer, with an area under the curve of the SVMs of 0.949 (P<0.001). Textural feature (standard deviation) showed benign cases had a higher change in the follow-up period than malignant cases. CONCLUSIONS: With textural features extracted from a Curvelet transformation and other parameters, a sensitive support vector machine prediction model can increase the rate of diagnosis for early-stage lung cancer. This scheme can be used as an auxiliary tool to differentiate between benign and malignant early-stage lung cancers in CT images.
format Online
Article
Text
id pubmed-3655169
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36551692013-05-20 Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data Sun, Tao Zhang, Regina Wang, Jingjing Li, Xia Guo, Xiuhua PLoS One Research Article BACKGROUND: Lung cancer is one of the most common forms of cancer resulting in over a million deaths per year worldwide. Typically, the problem can be approached by developing more discriminative diagnosis methods. In this paper, computer-aided diagnosis was used to facilitate the prediction of characteristics of solitary pulmonary nodules in CT of lungs to diagnose early-stage lung cancer. METHODS: The synthetic minority over-sampling technique (SMOTE) was used to account for raw data in order to balance the original training data set. Curvelet-transformation textural features, together with 3 patient demographic characteristics, and 9 morphological features were used to establish a support vector machine (SVM) prediction model. Longitudinal data as the test data set was used to evaluate the classification performance of predicting early-stage lung cancer. RESULTS: Using the SMOTE as a pre-processing procedure, the original training data was balanced with a ratio of malignant to benign cases of 1∶1. Accuracy based on cross-evaluation for the original unbalanced data and balanced data was 80% and 97%, respectively. Based on Curvelet-transformation textural features and other features, the SVM prediction model had good classification performance for early-stage lung cancer, with an area under the curve of the SVMs of 0.949 (P<0.001). Textural feature (standard deviation) showed benign cases had a higher change in the follow-up period than malignant cases. CONCLUSIONS: With textural features extracted from a Curvelet transformation and other parameters, a sensitive support vector machine prediction model can increase the rate of diagnosis for early-stage lung cancer. This scheme can be used as an auxiliary tool to differentiate between benign and malignant early-stage lung cancers in CT images. Public Library of Science 2013-05-15 /pmc/articles/PMC3655169/ /pubmed/23691066 http://dx.doi.org/10.1371/journal.pone.0063559 Text en © 2013 Sun et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Sun, Tao
Zhang, Regina
Wang, Jingjing
Li, Xia
Guo, Xiuhua
Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data
title Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data
title_full Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data
title_fullStr Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data
title_full_unstemmed Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data
title_short Computer-Aided Diagnosis for Early-Stage Lung Cancer Based on Longitudinal and Balanced Data
title_sort computer-aided diagnosis for early-stage lung cancer based on longitudinal and balanced data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3655169/
https://www.ncbi.nlm.nih.gov/pubmed/23691066
http://dx.doi.org/10.1371/journal.pone.0063559
work_keys_str_mv AT suntao computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata
AT zhangregina computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata
AT wangjingjing computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata
AT lixia computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata
AT guoxiuhua computeraideddiagnosisforearlystagelungcancerbasedonlongitudinalandbalanceddata