Cargando…
AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM
BACKGROUND: Digital mammography is one of the most promising options to diagnose breast cancer which is the most common cancer in women. However, its effectiveness is enfeebled due to the difficulty in distinguishing actual cancer lesions from benign abnormalities, which results in unnecessary biops...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2773916/ https://www.ncbi.nlm.nih.gov/pubmed/19891795 http://dx.doi.org/10.1186/1472-6947-9-S1-S1 |
_version_ | 1782173908601929728 |
---|---|
author | Yoon, Sejong Kim, Saejoon |
author_facet | Yoon, Sejong Kim, Saejoon |
author_sort | Yoon, Sejong |
collection | PubMed |
description | BACKGROUND: Digital mammography is one of the most promising options to diagnose breast cancer which is the most common cancer in women. However, its effectiveness is enfeebled due to the difficulty in distinguishing actual cancer lesions from benign abnormalities, which results in unnecessary biopsy referrals. To overcome this issue, computer aided diagnosis (CADx) using machine learning techniques have been studied worldwide. Since this is a classification problem and the number of features obtainable from a mammogram image is infinite, a feature selection method that is tailored for use in the CADx systems is needed. METHODS: We propose a feature selection method based on multiple support vector machine recursive feature elimination (MSVM-RFE). We compared our method with four previously proposed feature selection methods which use support vector machine as the base classifier. Experiments were performed on lesions extracted from the Digital Database of Screening Mammography, the largest public digital mammography database available. We measured average accuracy over 5-fold cross validation on the 8 datasets we extracted. RESULTS: Selecting from 8 features, conventional algorithms like SVM-RFE and multiple SVM-RFE showed slightly better performance than others. However, when selecting from 22 features, our proposed modified multiple SVM-RFE using boosting outperformed or was at least competitive to all others. CONCLUSION: Our modified method may be a possible alternative to SVM-RFE or the original MSVM-RFE in many cases of interest. In the future, we need a specific method to effectively combine models trained during the feature selection process and a way to combine feature subsets generated from individual SVM-RFE instances. |
format | Text |
id | pubmed-2773916 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27739162009-11-07 AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM Yoon, Sejong Kim, Saejoon BMC Med Inform Decis Mak Research BACKGROUND: Digital mammography is one of the most promising options to diagnose breast cancer which is the most common cancer in women. However, its effectiveness is enfeebled due to the difficulty in distinguishing actual cancer lesions from benign abnormalities, which results in unnecessary biopsy referrals. To overcome this issue, computer aided diagnosis (CADx) using machine learning techniques have been studied worldwide. Since this is a classification problem and the number of features obtainable from a mammogram image is infinite, a feature selection method that is tailored for use in the CADx systems is needed. METHODS: We propose a feature selection method based on multiple support vector machine recursive feature elimination (MSVM-RFE). We compared our method with four previously proposed feature selection methods which use support vector machine as the base classifier. Experiments were performed on lesions extracted from the Digital Database of Screening Mammography, the largest public digital mammography database available. We measured average accuracy over 5-fold cross validation on the 8 datasets we extracted. RESULTS: Selecting from 8 features, conventional algorithms like SVM-RFE and multiple SVM-RFE showed slightly better performance than others. However, when selecting from 22 features, our proposed modified multiple SVM-RFE using boosting outperformed or was at least competitive to all others. CONCLUSION: Our modified method may be a possible alternative to SVM-RFE or the original MSVM-RFE in many cases of interest. In the future, we need a specific method to effectively combine models trained during the feature selection process and a way to combine feature subsets generated from individual SVM-RFE instances. BioMed Central 2009-11-03 /pmc/articles/PMC2773916/ /pubmed/19891795 http://dx.doi.org/10.1186/1472-6947-9-S1-S1 Text en Copyright © 2009 Yoon and Kim; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Yoon, Sejong Kim, Saejoon AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM |
title | AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM |
title_full | AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM |
title_fullStr | AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM |
title_full_unstemmed | AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM |
title_short | AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM |
title_sort | adaboost-based multiple svm-rfe for classification of mammograms in ddsm |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2773916/ https://www.ncbi.nlm.nih.gov/pubmed/19891795 http://dx.doi.org/10.1186/1472-6947-9-S1-S1 |
work_keys_str_mv | AT yoonsejong adaboostbasedmultiplesvmrfeforclassificationofmammogramsinddsm AT kimsaejoon adaboostbasedmultiplesvmrfeforclassificationofmammogramsinddsm |