Cargando…

Optimal Statistical Incorporation of Independent Feature Stability Information into Radiomics Studies

Conducting side experiments termed robustness experiments, to identify features that are stable with respect to rescans, annotation, or other confounding effects is an important element in radiomics research. However, the matter of how to include the finding of these experiments into the model build...

Descripción completa

Detalles Bibliográficos
Autores principales: Götz, Michael, Maier-Hein, Klaus H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6971266/
https://www.ncbi.nlm.nih.gov/pubmed/31959832
http://dx.doi.org/10.1038/s41598-020-57739-8
_version_ 1783489688704647168
author Götz, Michael
Maier-Hein, Klaus H.
author_facet Götz, Michael
Maier-Hein, Klaus H.
author_sort Götz, Michael
collection PubMed
description Conducting side experiments termed robustness experiments, to identify features that are stable with respect to rescans, annotation, or other confounding effects is an important element in radiomics research. However, the matter of how to include the finding of these experiments into the model building process still needs to be explored. Three different methods for incorporating prior knowledge into a radiomics modelling process were evaluated: the naïve approach (ignoring feature quality), the most common approach consisting of removing unstable features, and a novel approach using data augmentation for information transfer (DAFIT). Multiple experiments were conducted using both synthetic and publicly available real lung imaging patient data. Ignoring additional information from side experiments resulted in significantly overestimated model performances meaning the estimated mean area under the curve achieved with a model was increased. Removing unstable features improved the performance estimation, while slightly decreasing the model performance, i.e. decreasing the area under curve achieved with the model. The proposed approach was superior both in terms of the estimation of the model performance and the actual model performance. Our experiments show that data augmentation can prevent biases in performance estimation and has several advantages over the plain omission of the unstable feature. The actual gain that can be obtained depends on the quality and applicability of the prior information on the features in the given domain. This will be an important topic of future research.
format Online
Article
Text
id pubmed-6971266
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-69712662020-01-27 Optimal Statistical Incorporation of Independent Feature Stability Information into Radiomics Studies Götz, Michael Maier-Hein, Klaus H. Sci Rep Article Conducting side experiments termed robustness experiments, to identify features that are stable with respect to rescans, annotation, or other confounding effects is an important element in radiomics research. However, the matter of how to include the finding of these experiments into the model building process still needs to be explored. Three different methods for incorporating prior knowledge into a radiomics modelling process were evaluated: the naïve approach (ignoring feature quality), the most common approach consisting of removing unstable features, and a novel approach using data augmentation for information transfer (DAFIT). Multiple experiments were conducted using both synthetic and publicly available real lung imaging patient data. Ignoring additional information from side experiments resulted in significantly overestimated model performances meaning the estimated mean area under the curve achieved with a model was increased. Removing unstable features improved the performance estimation, while slightly decreasing the model performance, i.e. decreasing the area under curve achieved with the model. The proposed approach was superior both in terms of the estimation of the model performance and the actual model performance. Our experiments show that data augmentation can prevent biases in performance estimation and has several advantages over the plain omission of the unstable feature. The actual gain that can be obtained depends on the quality and applicability of the prior information on the features in the given domain. This will be an important topic of future research. Nature Publishing Group UK 2020-01-20 /pmc/articles/PMC6971266/ /pubmed/31959832 http://dx.doi.org/10.1038/s41598-020-57739-8 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Götz, Michael
Maier-Hein, Klaus H.
Optimal Statistical Incorporation of Independent Feature Stability Information into Radiomics Studies
title Optimal Statistical Incorporation of Independent Feature Stability Information into Radiomics Studies
title_full Optimal Statistical Incorporation of Independent Feature Stability Information into Radiomics Studies
title_fullStr Optimal Statistical Incorporation of Independent Feature Stability Information into Radiomics Studies
title_full_unstemmed Optimal Statistical Incorporation of Independent Feature Stability Information into Radiomics Studies
title_short Optimal Statistical Incorporation of Independent Feature Stability Information into Radiomics Studies
title_sort optimal statistical incorporation of independent feature stability information into radiomics studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6971266/
https://www.ncbi.nlm.nih.gov/pubmed/31959832
http://dx.doi.org/10.1038/s41598-020-57739-8
work_keys_str_mv AT gotzmichael optimalstatisticalincorporationofindependentfeaturestabilityinformationintoradiomicsstudies
AT maierheinklaush optimalstatisticalincorporationofindependentfeaturestabilityinformationintoradiomicsstudies