Cargando…

SVM and SVM Ensembles in Breast Cancer Prediction

Breast cancer is an all too common disease in women, making how to effectively predict it an active research problem. A number of statistical and machine learning techniques have been employed to develop various breast cancer prediction models. Among them, support vector machines (SVM) have been sho...

Descripción completa

Detalles Bibliográficos
Autores principales:	Huang, Min-Wei, Chen, Chih-Wen, Lin, Wei-Chao, Ke, Shih-Wen, Tsai, Chih-Fong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5217832/ https://www.ncbi.nlm.nih.gov/pubmed/28060807 http://dx.doi.org/10.1371/journal.pone.0161501

_version_	1782492163870818304
author	Huang, Min-Wei Chen, Chih-Wen Lin, Wei-Chao Ke, Shih-Wen Tsai, Chih-Fong
author_facet	Huang, Min-Wei Chen, Chih-Wen Lin, Wei-Chao Ke, Shih-Wen Tsai, Chih-Fong
author_sort	Huang, Min-Wei
collection	PubMed
description	Breast cancer is an all too common disease in women, making how to effectively predict it an active research problem. A number of statistical and machine learning techniques have been employed to develop various breast cancer prediction models. Among them, support vector machines (SVM) have been shown to outperform many related techniques. To construct the SVM classifier, it is first necessary to decide the kernel function, and different kernel functions can result in different prediction performance. However, there have been very few studies focused on examining the prediction performances of SVM based on different kernel functions. Moreover, it is unknown whether SVM classifier ensembles which have been proposed to improve the performance of single classifiers can outperform single SVM classifiers in terms of breast cancer prediction. Therefore, the aim of this paper is to fully assess the prediction performance of SVM and SVM ensembles over small and large scale breast cancer datasets. The classification accuracy, ROC, F-measure, and computational times of training SVM and SVM ensembles are compared. The experimental results show that linear kernel based SVM ensembles based on the bagging method and RBF kernel based SVM ensembles with the boosting method can be the better choices for a small scale dataset, where feature selection should be performed in the data pre-processing stage. For a large scale dataset, RBF kernel based SVM ensembles based on boosting perform better than the other classifiers.
format	Online Article Text
id	pubmed-5217832
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-52178322017-01-19 SVM and SVM Ensembles in Breast Cancer Prediction Huang, Min-Wei Chen, Chih-Wen Lin, Wei-Chao Ke, Shih-Wen Tsai, Chih-Fong PLoS One Research Article Breast cancer is an all too common disease in women, making how to effectively predict it an active research problem. A number of statistical and machine learning techniques have been employed to develop various breast cancer prediction models. Among them, support vector machines (SVM) have been shown to outperform many related techniques. To construct the SVM classifier, it is first necessary to decide the kernel function, and different kernel functions can result in different prediction performance. However, there have been very few studies focused on examining the prediction performances of SVM based on different kernel functions. Moreover, it is unknown whether SVM classifier ensembles which have been proposed to improve the performance of single classifiers can outperform single SVM classifiers in terms of breast cancer prediction. Therefore, the aim of this paper is to fully assess the prediction performance of SVM and SVM ensembles over small and large scale breast cancer datasets. The classification accuracy, ROC, F-measure, and computational times of training SVM and SVM ensembles are compared. The experimental results show that linear kernel based SVM ensembles based on the bagging method and RBF kernel based SVM ensembles with the boosting method can be the better choices for a small scale dataset, where feature selection should be performed in the data pre-processing stage. For a large scale dataset, RBF kernel based SVM ensembles based on boosting perform better than the other classifiers. Public Library of Science 2017-01-06 /pmc/articles/PMC5217832/ /pubmed/28060807 http://dx.doi.org/10.1371/journal.pone.0161501 Text en © 2017 Huang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Huang, Min-Wei Chen, Chih-Wen Lin, Wei-Chao Ke, Shih-Wen Tsai, Chih-Fong SVM and SVM Ensembles in Breast Cancer Prediction
title	SVM and SVM Ensembles in Breast Cancer Prediction
title_full	SVM and SVM Ensembles in Breast Cancer Prediction
title_fullStr	SVM and SVM Ensembles in Breast Cancer Prediction
title_full_unstemmed	SVM and SVM Ensembles in Breast Cancer Prediction
title_short	SVM and SVM Ensembles in Breast Cancer Prediction
title_sort	svm and svm ensembles in breast cancer prediction
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5217832/ https://www.ncbi.nlm.nih.gov/pubmed/28060807 http://dx.doi.org/10.1371/journal.pone.0161501
work_keys_str_mv	AT huangminwei svmandsvmensemblesinbreastcancerprediction AT chenchihwen svmandsvmensemblesinbreastcancerprediction AT linweichao svmandsvmensemblesinbreastcancerprediction AT keshihwen svmandsvmensemblesinbreastcancerprediction AT tsaichihfong svmandsvmensemblesinbreastcancerprediction

SVM and SVM Ensembles in Breast Cancer Prediction

Ejemplares similares