Cargando…

Computer-Aided Lung Nodule Recognition by SVM Classifier Based on Combination of Random Undersampling and SMOTE

In lung cancer computer-aided detection/diagnosis (CAD) systems, classification of regions of interest (ROI) is often used to detect/diagnose lung nodule accurately. However, problems of unbalanced datasets often have detrimental effects on the performance of classification. In this paper, both mino...

Descripción completa

Detalles Bibliográficos
Autores principales: Sui, Yuan, Wei, Ying, Zhao, Dazhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4419492/
https://www.ncbi.nlm.nih.gov/pubmed/25977704
http://dx.doi.org/10.1155/2015/368674
_version_ 1782369585761091584
author Sui, Yuan
Wei, Ying
Zhao, Dazhe
author_facet Sui, Yuan
Wei, Ying
Zhao, Dazhe
author_sort Sui, Yuan
collection PubMed
description In lung cancer computer-aided detection/diagnosis (CAD) systems, classification of regions of interest (ROI) is often used to detect/diagnose lung nodule accurately. However, problems of unbalanced datasets often have detrimental effects on the performance of classification. In this paper, both minority and majority classes are resampled to increase the generalization ability. We propose a novel SVM classifier combined with random undersampling (RU) and SMOTE for lung nodule recognition. The combinations of the two resampling methods not only achieve a balanced training samples but also remove noise and duplicate information in the training sample and retain useful information to improve the effective data utilization, hence improving performance of SVM algorithm for pulmonary nodules classification under the unbalanced data. Eight features including 2D and 3D features are extracted for training and classification. Experimental results show that for different sizes of training datasets our RU-SMOTE-SVM classifier gets the highest classification accuracy among the four kinds of classifiers, and the average classification accuracy is more than 92.94%.
format Online
Article
Text
id pubmed-4419492
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-44194922015-05-14 Computer-Aided Lung Nodule Recognition by SVM Classifier Based on Combination of Random Undersampling and SMOTE Sui, Yuan Wei, Ying Zhao, Dazhe Comput Math Methods Med Research Article In lung cancer computer-aided detection/diagnosis (CAD) systems, classification of regions of interest (ROI) is often used to detect/diagnose lung nodule accurately. However, problems of unbalanced datasets often have detrimental effects on the performance of classification. In this paper, both minority and majority classes are resampled to increase the generalization ability. We propose a novel SVM classifier combined with random undersampling (RU) and SMOTE for lung nodule recognition. The combinations of the two resampling methods not only achieve a balanced training samples but also remove noise and duplicate information in the training sample and retain useful information to improve the effective data utilization, hence improving performance of SVM algorithm for pulmonary nodules classification under the unbalanced data. Eight features including 2D and 3D features are extracted for training and classification. Experimental results show that for different sizes of training datasets our RU-SMOTE-SVM classifier gets the highest classification accuracy among the four kinds of classifiers, and the average classification accuracy is more than 92.94%. Hindawi Publishing Corporation 2015 2015-04-06 /pmc/articles/PMC4419492/ /pubmed/25977704 http://dx.doi.org/10.1155/2015/368674 Text en Copyright © 2015 Yuan Sui et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sui, Yuan
Wei, Ying
Zhao, Dazhe
Computer-Aided Lung Nodule Recognition by SVM Classifier Based on Combination of Random Undersampling and SMOTE
title Computer-Aided Lung Nodule Recognition by SVM Classifier Based on Combination of Random Undersampling and SMOTE
title_full Computer-Aided Lung Nodule Recognition by SVM Classifier Based on Combination of Random Undersampling and SMOTE
title_fullStr Computer-Aided Lung Nodule Recognition by SVM Classifier Based on Combination of Random Undersampling and SMOTE
title_full_unstemmed Computer-Aided Lung Nodule Recognition by SVM Classifier Based on Combination of Random Undersampling and SMOTE
title_short Computer-Aided Lung Nodule Recognition by SVM Classifier Based on Combination of Random Undersampling and SMOTE
title_sort computer-aided lung nodule recognition by svm classifier based on combination of random undersampling and smote
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4419492/
https://www.ncbi.nlm.nih.gov/pubmed/25977704
http://dx.doi.org/10.1155/2015/368674
work_keys_str_mv AT suiyuan computeraidedlungnodulerecognitionbysvmclassifierbasedoncombinationofrandomundersamplingandsmote
AT weiying computeraidedlungnodulerecognitionbysvmclassifierbasedoncombinationofrandomundersamplingandsmote
AT zhaodazhe computeraidedlungnodulerecognitionbysvmclassifierbasedoncombinationofrandomundersamplingandsmote