Cargando…

AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM

BACKGROUND: Digital mammography is one of the most promising options to diagnose breast cancer which is the most common cancer in women. However, its effectiveness is enfeebled due to the difficulty in distinguishing actual cancer lesions from benign abnormalities, which results in unnecessary biops...

Descripción completa

Detalles Bibliográficos
Autores principales: Yoon, Sejong, Kim, Saejoon
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2773916/
https://www.ncbi.nlm.nih.gov/pubmed/19891795
http://dx.doi.org/10.1186/1472-6947-9-S1-S1
_version_ 1782173908601929728
author Yoon, Sejong
Kim, Saejoon
author_facet Yoon, Sejong
Kim, Saejoon
author_sort Yoon, Sejong
collection PubMed
description BACKGROUND: Digital mammography is one of the most promising options to diagnose breast cancer which is the most common cancer in women. However, its effectiveness is enfeebled due to the difficulty in distinguishing actual cancer lesions from benign abnormalities, which results in unnecessary biopsy referrals. To overcome this issue, computer aided diagnosis (CADx) using machine learning techniques have been studied worldwide. Since this is a classification problem and the number of features obtainable from a mammogram image is infinite, a feature selection method that is tailored for use in the CADx systems is needed. METHODS: We propose a feature selection method based on multiple support vector machine recursive feature elimination (MSVM-RFE). We compared our method with four previously proposed feature selection methods which use support vector machine as the base classifier. Experiments were performed on lesions extracted from the Digital Database of Screening Mammography, the largest public digital mammography database available. We measured average accuracy over 5-fold cross validation on the 8 datasets we extracted. RESULTS: Selecting from 8 features, conventional algorithms like SVM-RFE and multiple SVM-RFE showed slightly better performance than others. However, when selecting from 22 features, our proposed modified multiple SVM-RFE using boosting outperformed or was at least competitive to all others. CONCLUSION: Our modified method may be a possible alternative to SVM-RFE or the original MSVM-RFE in many cases of interest. In the future, we need a specific method to effectively combine models trained during the feature selection process and a way to combine feature subsets generated from individual SVM-RFE instances.
format Text
id pubmed-2773916
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27739162009-11-07 AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM Yoon, Sejong Kim, Saejoon BMC Med Inform Decis Mak Research BACKGROUND: Digital mammography is one of the most promising options to diagnose breast cancer which is the most common cancer in women. However, its effectiveness is enfeebled due to the difficulty in distinguishing actual cancer lesions from benign abnormalities, which results in unnecessary biopsy referrals. To overcome this issue, computer aided diagnosis (CADx) using machine learning techniques have been studied worldwide. Since this is a classification problem and the number of features obtainable from a mammogram image is infinite, a feature selection method that is tailored for use in the CADx systems is needed. METHODS: We propose a feature selection method based on multiple support vector machine recursive feature elimination (MSVM-RFE). We compared our method with four previously proposed feature selection methods which use support vector machine as the base classifier. Experiments were performed on lesions extracted from the Digital Database of Screening Mammography, the largest public digital mammography database available. We measured average accuracy over 5-fold cross validation on the 8 datasets we extracted. RESULTS: Selecting from 8 features, conventional algorithms like SVM-RFE and multiple SVM-RFE showed slightly better performance than others. However, when selecting from 22 features, our proposed modified multiple SVM-RFE using boosting outperformed or was at least competitive to all others. CONCLUSION: Our modified method may be a possible alternative to SVM-RFE or the original MSVM-RFE in many cases of interest. In the future, we need a specific method to effectively combine models trained during the feature selection process and a way to combine feature subsets generated from individual SVM-RFE instances. BioMed Central 2009-11-03 /pmc/articles/PMC2773916/ /pubmed/19891795 http://dx.doi.org/10.1186/1472-6947-9-S1-S1 Text en Copyright © 2009 Yoon and Kim; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Yoon, Sejong
Kim, Saejoon
AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
title AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
title_full AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
title_fullStr AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
title_full_unstemmed AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
title_short AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
title_sort adaboost-based multiple svm-rfe for classification of mammograms in ddsm
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2773916/
https://www.ncbi.nlm.nih.gov/pubmed/19891795
http://dx.doi.org/10.1186/1472-6947-9-S1-S1
work_keys_str_mv AT yoonsejong adaboostbasedmultiplesvmrfeforclassificationofmammogramsinddsm
AT kimsaejoon adaboostbasedmultiplesvmrfeforclassificationofmammogramsinddsm