Cargando…

Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization

We aimed to evaluate a computer-aided diagnosis (CADx) system for lung nodule classification focussing on (i) usefulness of the conventional CADx system (hand-crafted imaging feature + machine learning algorithm), (ii) comparison between support vector machine (SVM) and gradient tree boosting (XGBoo...

Descripción completa

Detalles Bibliográficos
Autores principales: Nishio, Mizuho, Nishizawa, Mitsuo, Sugiyama, Osamu, Kojima, Ryosuke, Yakami, Masahiro, Kuroda, Tomohiro, Togashi, Kaori
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5908232/
https://www.ncbi.nlm.nih.gov/pubmed/29672639
http://dx.doi.org/10.1371/journal.pone.0195875
_version_ 1783315684333191168
author Nishio, Mizuho
Nishizawa, Mitsuo
Sugiyama, Osamu
Kojima, Ryosuke
Yakami, Masahiro
Kuroda, Tomohiro
Togashi, Kaori
author_facet Nishio, Mizuho
Nishizawa, Mitsuo
Sugiyama, Osamu
Kojima, Ryosuke
Yakami, Masahiro
Kuroda, Tomohiro
Togashi, Kaori
author_sort Nishio, Mizuho
collection PubMed
description We aimed to evaluate a computer-aided diagnosis (CADx) system for lung nodule classification focussing on (i) usefulness of the conventional CADx system (hand-crafted imaging feature + machine learning algorithm), (ii) comparison between support vector machine (SVM) and gradient tree boosting (XGBoost) as machine learning algorithms, and (iii) effectiveness of parameter optimization using Bayesian optimization and random search. Data on 99 lung nodules (62 lung cancers and 37 benign lung nodules) were included from public databases of CT images. A variant of the local binary pattern was used for calculating a feature vector. SVM or XGBoost was trained using the feature vector and its corresponding label. Tree Parzen Estimator (TPE) was used as Bayesian optimization for parameters of SVM and XGBoost. Random search was done for comparison with TPE. Leave-one-out cross-validation was used for optimizing and evaluating the performance of our CADx system. Performance was evaluated using area under the curve (AUC) of receiver operating characteristic analysis. AUC was calculated 10 times, and its average was obtained. The best averaged AUC of SVM and XGBoost was 0.850 and 0.896, respectively; both were obtained using TPE. XGBoost was generally superior to SVM. Optimal parameters for achieving high AUC were obtained with fewer numbers of trials when using TPE, compared with random search. Bayesian optimization of SVM and XGBoost parameters was more efficient than random search. Based on observer study, AUC values of two board-certified radiologists were 0.898 and 0.822. The results show that diagnostic accuracy of our CADx system was comparable to that of radiologists with respect to classifying lung nodules.
format Online
Article
Text
id pubmed-5908232
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-59082322018-05-05 Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization Nishio, Mizuho Nishizawa, Mitsuo Sugiyama, Osamu Kojima, Ryosuke Yakami, Masahiro Kuroda, Tomohiro Togashi, Kaori PLoS One Research Article We aimed to evaluate a computer-aided diagnosis (CADx) system for lung nodule classification focussing on (i) usefulness of the conventional CADx system (hand-crafted imaging feature + machine learning algorithm), (ii) comparison between support vector machine (SVM) and gradient tree boosting (XGBoost) as machine learning algorithms, and (iii) effectiveness of parameter optimization using Bayesian optimization and random search. Data on 99 lung nodules (62 lung cancers and 37 benign lung nodules) were included from public databases of CT images. A variant of the local binary pattern was used for calculating a feature vector. SVM or XGBoost was trained using the feature vector and its corresponding label. Tree Parzen Estimator (TPE) was used as Bayesian optimization for parameters of SVM and XGBoost. Random search was done for comparison with TPE. Leave-one-out cross-validation was used for optimizing and evaluating the performance of our CADx system. Performance was evaluated using area under the curve (AUC) of receiver operating characteristic analysis. AUC was calculated 10 times, and its average was obtained. The best averaged AUC of SVM and XGBoost was 0.850 and 0.896, respectively; both were obtained using TPE. XGBoost was generally superior to SVM. Optimal parameters for achieving high AUC were obtained with fewer numbers of trials when using TPE, compared with random search. Bayesian optimization of SVM and XGBoost parameters was more efficient than random search. Based on observer study, AUC values of two board-certified radiologists were 0.898 and 0.822. The results show that diagnostic accuracy of our CADx system was comparable to that of radiologists with respect to classifying lung nodules. Public Library of Science 2018-04-19 /pmc/articles/PMC5908232/ /pubmed/29672639 http://dx.doi.org/10.1371/journal.pone.0195875 Text en © 2018 Nishio et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Nishio, Mizuho
Nishizawa, Mitsuo
Sugiyama, Osamu
Kojima, Ryosuke
Yakami, Masahiro
Kuroda, Tomohiro
Togashi, Kaori
Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization
title Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization
title_full Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization
title_fullStr Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization
title_full_unstemmed Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization
title_short Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization
title_sort computer-aided diagnosis of lung nodule using gradient tree boosting and bayesian optimization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5908232/
https://www.ncbi.nlm.nih.gov/pubmed/29672639
http://dx.doi.org/10.1371/journal.pone.0195875
work_keys_str_mv AT nishiomizuho computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT nishizawamitsuo computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT sugiyamaosamu computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT kojimaryosuke computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT yakamimasahiro computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT kurodatomohiro computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT togashikaori computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization