Cargando…

The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images

SIMPLE SUMMARY: Radiomics aims to extract high-dimensional features from clinical images and associate them to clinical outcomes. These associations may be further investigated with machine learning models; however, guidelines on the most suitable method to support clinical decisions are still missi...

Descripción completa

Detalles Bibliográficos
Autores principales: Corso, Federica, Tini, Giulia, Lo Presti, Giuliana, Garau, Noemi, De Angelis, Simone Pietro, Bellerba, Federica, Rinaldi, Lisa, Botta, Francesca, Rizzo, Stefania, Origgi, Daniela, Paganelli, Chiara, Cremonesi, Marta, Rampinelli, Cristiano, Bellomi, Massimo, Mazzarella, Luca, Pelicci, Pier Giuseppe, Gandini, Sara, Raimondi, Sara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8234634/
https://www.ncbi.nlm.nih.gov/pubmed/34205631
http://dx.doi.org/10.3390/cancers13123088
_version_ 1783714129561780224
author Corso, Federica
Tini, Giulia
Lo Presti, Giuliana
Garau, Noemi
De Angelis, Simone Pietro
Bellerba, Federica
Rinaldi, Lisa
Botta, Francesca
Rizzo, Stefania
Origgi, Daniela
Paganelli, Chiara
Cremonesi, Marta
Rampinelli, Cristiano
Bellomi, Massimo
Mazzarella, Luca
Pelicci, Pier Giuseppe
Gandini, Sara
Raimondi, Sara
author_facet Corso, Federica
Tini, Giulia
Lo Presti, Giuliana
Garau, Noemi
De Angelis, Simone Pietro
Bellerba, Federica
Rinaldi, Lisa
Botta, Francesca
Rizzo, Stefania
Origgi, Daniela
Paganelli, Chiara
Cremonesi, Marta
Rampinelli, Cristiano
Bellomi, Massimo
Mazzarella, Luca
Pelicci, Pier Giuseppe
Gandini, Sara
Raimondi, Sara
author_sort Corso, Federica
collection PubMed
description SIMPLE SUMMARY: Radiomics aims to extract high-dimensional features from clinical images and associate them to clinical outcomes. These associations may be further investigated with machine learning models; however, guidelines on the most suitable method to support clinical decisions are still missing. To improve the reliability and the accuracy of radiomic features in the prediction of a binary variable in a lung cancer setting, we compared several machine learning classifiers and feature selection methods on simulated data. These account for important characteristics that may vary in real clinical datasets: sample size, outcome balancing and association strength between radiomic features and outcome variables. We were able to suggest the most suitable classifiers for each studied case and to evaluate the impact of method choices. Our work highlights the importance of these choices in radiomic analyses and provides guidelines on how to select the best models for the data at hand. ABSTRACT: Radiomics uses high-dimensional sets of imaging features to predict biological characteristics of tumors and clinical outcomes. The choice of the algorithm used to analyze radiomic features and perform predictions has a high impact on the results, thus the identification of adequate machine learning methods for radiomic applications is crucial. In this study we aim to identify suitable approaches of analysis for radiomic-based binary predictions, according to sample size, outcome balancing and the features–outcome association strength. Simulated data were obtained reproducing the correlation structure among 168 radiomic features extracted from Computed Tomography images of 270 Non-Small-Cell Lung Cancer (NSCLC) patients and the associated to lymph node status. Performances of six classifiers combined with six feature selection (FS) methods were assessed on the simulated data using AUC (Area Under the Receiver Operating Characteristics Curves), sensitivity, and specificity. For all the FS methods and regardless of the association strength, the tree-based classifiers Random Forest and Extreme Gradient Boosting obtained good performances (AUC ≥ 0.73), showing the best trade-off between sensitivity and specificity. On small samples, performances were generally lower than in large–medium samples and with larger variations. FS methods generally did not improve performances. Thus, in radiomic studies, we suggest evaluating the choice of FS and classifiers, considering specific sample size, balancing, and association strength.
format Online
Article
Text
id pubmed-8234634
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82346342021-06-27 The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images Corso, Federica Tini, Giulia Lo Presti, Giuliana Garau, Noemi De Angelis, Simone Pietro Bellerba, Federica Rinaldi, Lisa Botta, Francesca Rizzo, Stefania Origgi, Daniela Paganelli, Chiara Cremonesi, Marta Rampinelli, Cristiano Bellomi, Massimo Mazzarella, Luca Pelicci, Pier Giuseppe Gandini, Sara Raimondi, Sara Cancers (Basel) Article SIMPLE SUMMARY: Radiomics aims to extract high-dimensional features from clinical images and associate them to clinical outcomes. These associations may be further investigated with machine learning models; however, guidelines on the most suitable method to support clinical decisions are still missing. To improve the reliability and the accuracy of radiomic features in the prediction of a binary variable in a lung cancer setting, we compared several machine learning classifiers and feature selection methods on simulated data. These account for important characteristics that may vary in real clinical datasets: sample size, outcome balancing and association strength between radiomic features and outcome variables. We were able to suggest the most suitable classifiers for each studied case and to evaluate the impact of method choices. Our work highlights the importance of these choices in radiomic analyses and provides guidelines on how to select the best models for the data at hand. ABSTRACT: Radiomics uses high-dimensional sets of imaging features to predict biological characteristics of tumors and clinical outcomes. The choice of the algorithm used to analyze radiomic features and perform predictions has a high impact on the results, thus the identification of adequate machine learning methods for radiomic applications is crucial. In this study we aim to identify suitable approaches of analysis for radiomic-based binary predictions, according to sample size, outcome balancing and the features–outcome association strength. Simulated data were obtained reproducing the correlation structure among 168 radiomic features extracted from Computed Tomography images of 270 Non-Small-Cell Lung Cancer (NSCLC) patients and the associated to lymph node status. Performances of six classifiers combined with six feature selection (FS) methods were assessed on the simulated data using AUC (Area Under the Receiver Operating Characteristics Curves), sensitivity, and specificity. For all the FS methods and regardless of the association strength, the tree-based classifiers Random Forest and Extreme Gradient Boosting obtained good performances (AUC ≥ 0.73), showing the best trade-off between sensitivity and specificity. On small samples, performances were generally lower than in large–medium samples and with larger variations. FS methods generally did not improve performances. Thus, in radiomic studies, we suggest evaluating the choice of FS and classifiers, considering specific sample size, balancing, and association strength. MDPI 2021-06-21 /pmc/articles/PMC8234634/ /pubmed/34205631 http://dx.doi.org/10.3390/cancers13123088 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Corso, Federica
Tini, Giulia
Lo Presti, Giuliana
Garau, Noemi
De Angelis, Simone Pietro
Bellerba, Federica
Rinaldi, Lisa
Botta, Francesca
Rizzo, Stefania
Origgi, Daniela
Paganelli, Chiara
Cremonesi, Marta
Rampinelli, Cristiano
Bellomi, Massimo
Mazzarella, Luca
Pelicci, Pier Giuseppe
Gandini, Sara
Raimondi, Sara
The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images
title The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images
title_full The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images
title_fullStr The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images
title_full_unstemmed The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images
title_short The Challenge of Choosing the Best Classification Method in Radiomic Analyses: Recommendations and Applications to Lung Cancer CT Images
title_sort challenge of choosing the best classification method in radiomic analyses: recommendations and applications to lung cancer ct images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8234634/
https://www.ncbi.nlm.nih.gov/pubmed/34205631
http://dx.doi.org/10.3390/cancers13123088
work_keys_str_mv AT corsofederica thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT tinigiulia thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT loprestigiuliana thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT garaunoemi thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT deangelissimonepietro thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT bellerbafederica thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT rinaldilisa thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT bottafrancesca thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT rizzostefania thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT origgidaniela thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT paganellichiara thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT cremonesimarta thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT rampinellicristiano thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT bellomimassimo thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT mazzarellaluca thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT peliccipiergiuseppe thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT gandinisara thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT raimondisara thechallengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT corsofederica challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT tinigiulia challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT loprestigiuliana challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT garaunoemi challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT deangelissimonepietro challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT bellerbafederica challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT rinaldilisa challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT bottafrancesca challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT rizzostefania challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT origgidaniela challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT paganellichiara challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT cremonesimarta challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT rampinellicristiano challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT bellomimassimo challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT mazzarellaluca challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT peliccipiergiuseppe challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT gandinisara challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages
AT raimondisara challengeofchoosingthebestclassificationmethodinradiomicanalysesrecommendationsandapplicationstolungcancerctimages