Cargando…

Evaluation of the dependence of radiomic features on the machine learning model

BACKGROUND: In radiomic studies, several models are often trained with different combinations of feature selection methods and classifiers. The features of the best model are usually considered relevant to the problem, and they represent potential biomarkers. Features selected from statistically sim...

Descripción completa

Detalles Bibliográficos
Autor principal:	Demircioğlu, Aydin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2022
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8873309/ https://www.ncbi.nlm.nih.gov/pubmed/35201534 http://dx.doi.org/10.1186/s13244-022-01170-2

_version_	1784657436211150848
author	Demircioğlu, Aydin
author_facet	Demircioğlu, Aydin
author_sort	Demircioğlu, Aydin
collection	PubMed
description	BACKGROUND: In radiomic studies, several models are often trained with different combinations of feature selection methods and classifiers. The features of the best model are usually considered relevant to the problem, and they represent potential biomarkers. Features selected from statistically similarly performing models are generally not studied. To understand the degree to which the selected features of these statistically similar models differ, 14 publicly available datasets, 8 feature selection methods, and 8 classifiers were used in this retrospective study. For each combination of feature selection and classifier, a model was trained, and its performance was measured with AUC-ROC. The best-performing model was compared to other models using a DeLong test. Models that were statistically similar were compared in terms of their selected features. RESULTS: Approximately 57% of all models analyzed were statistically similar to the best-performing model. Feature selection methods were, in general, relatively unstable (0.58; range 0.35–0.84). The features selected by different models varied largely (0.19; range 0.02–0.42), although the selected features themselves were highly correlated (0.71; range 0.4–0.92). CONCLUSIONS: Feature relevance in radiomics strongly depends on the model used, and statistically similar models will generally identify different features as relevant. Considering features selected by a single model is misleading, and it is often not possible to directly determine whether such features are candidate biomarkers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13244-022-01170-2.
format	Online Article Text
id	pubmed-8873309
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-88733092022-03-02 Evaluation of the dependence of radiomic features on the machine learning model Demircioğlu, Aydin Insights Imaging Original Article BACKGROUND: In radiomic studies, several models are often trained with different combinations of feature selection methods and classifiers. The features of the best model are usually considered relevant to the problem, and they represent potential biomarkers. Features selected from statistically similarly performing models are generally not studied. To understand the degree to which the selected features of these statistically similar models differ, 14 publicly available datasets, 8 feature selection methods, and 8 classifiers were used in this retrospective study. For each combination of feature selection and classifier, a model was trained, and its performance was measured with AUC-ROC. The best-performing model was compared to other models using a DeLong test. Models that were statistically similar were compared in terms of their selected features. RESULTS: Approximately 57% of all models analyzed were statistically similar to the best-performing model. Feature selection methods were, in general, relatively unstable (0.58; range 0.35–0.84). The features selected by different models varied largely (0.19; range 0.02–0.42), although the selected features themselves were highly correlated (0.71; range 0.4–0.92). CONCLUSIONS: Feature relevance in radiomics strongly depends on the model used, and statistically similar models will generally identify different features as relevant. Considering features selected by a single model is misleading, and it is often not possible to directly determine whether such features are candidate biomarkers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13244-022-01170-2. Springer International Publishing 2022-02-24 /pmc/articles/PMC8873309/ /pubmed/35201534 http://dx.doi.org/10.1186/s13244-022-01170-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Original Article Demircioğlu, Aydin Evaluation of the dependence of radiomic features on the machine learning model
title	Evaluation of the dependence of radiomic features on the machine learning model
title_full	Evaluation of the dependence of radiomic features on the machine learning model
title_fullStr	Evaluation of the dependence of radiomic features on the machine learning model
title_full_unstemmed	Evaluation of the dependence of radiomic features on the machine learning model
title_short	Evaluation of the dependence of radiomic features on the machine learning model
title_sort	evaluation of the dependence of radiomic features on the machine learning model
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8873309/ https://www.ncbi.nlm.nih.gov/pubmed/35201534 http://dx.doi.org/10.1186/s13244-022-01170-2
work_keys_str_mv	AT demirciogluaydin evaluationofthedependenceofradiomicfeaturesonthemachinelearningmodel

Evaluation of the dependence of radiomic features on the machine learning model

Ejemplares similares