Cargando…

Clinical impact of variability on CT radiomics and suggestions for suitable feature selection: a focus on lung cancer

BACKGROUND: Radiomics suffers from feature reproducibility. We studied the variability of radiomics features and the relationship of radiomics features with tumor size and shape to determine guidelines for optimal radiomics study. METHODS: We dealt with 260 lung nodules (180 for training, 80 for tes...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Seung-Hak, Cho, Hwan-ho, Lee, Ho Yun, Park, Hyunjin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6660971/
https://www.ncbi.nlm.nih.gov/pubmed/31349872
http://dx.doi.org/10.1186/s40644-019-0239-z
_version_ 1783439394261172224
author Lee, Seung-Hak
Cho, Hwan-ho
Lee, Ho Yun
Park, Hyunjin
author_facet Lee, Seung-Hak
Cho, Hwan-ho
Lee, Ho Yun
Park, Hyunjin
author_sort Lee, Seung-Hak
collection PubMed
description BACKGROUND: Radiomics suffers from feature reproducibility. We studied the variability of radiomics features and the relationship of radiomics features with tumor size and shape to determine guidelines for optimal radiomics study. METHODS: We dealt with 260 lung nodules (180 for training, 80 for testing) limited to 2 cm or less. We quantified how voxel geometry (isotropic/anisotropic) and the number of histogram bins, factors commonly adjusted in multi-center studies, affect reproducibility. First, features showing high reproducibility between the original and isotropic transformed voxel settings were identified. Second, features showing high reproducibility in various binning settings were identified. Two hundred fifty-two features were computed and features with high intra-correlation coefficient were selected. Features that explained nodule status (benign/malignant) were retained using the least absolute shrinkage selector operator. Common features among different settings were identified, and the final features showing high reproducibility correlated with nodule status were identified. The identified features were used for the random forest classifier to validate the effectiveness of the features. The properties of the uncalculated feature were inspected to suggest a tentative guideline for radiomics studies. RESULTS: Nine features showing high reproducibility for both the original and isotropic voxel settings were selected and used to classify nodule status (AUC 0.659–0.697). Five features showing high reproducibility among different binning settings were selected and used in classification (AUC 0.729–0.748). Some texture features are likely to be successfully computed if a nodule was larger than 1000 mm(3). CONCLUSIONS: Features showing high reproducibility among different settings correlated with nodule status were identified. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s40644-019-0239-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6660971
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66609712019-08-01 Clinical impact of variability on CT radiomics and suggestions for suitable feature selection: a focus on lung cancer Lee, Seung-Hak Cho, Hwan-ho Lee, Ho Yun Park, Hyunjin Cancer Imaging Research Article BACKGROUND: Radiomics suffers from feature reproducibility. We studied the variability of radiomics features and the relationship of radiomics features with tumor size and shape to determine guidelines for optimal radiomics study. METHODS: We dealt with 260 lung nodules (180 for training, 80 for testing) limited to 2 cm or less. We quantified how voxel geometry (isotropic/anisotropic) and the number of histogram bins, factors commonly adjusted in multi-center studies, affect reproducibility. First, features showing high reproducibility between the original and isotropic transformed voxel settings were identified. Second, features showing high reproducibility in various binning settings were identified. Two hundred fifty-two features were computed and features with high intra-correlation coefficient were selected. Features that explained nodule status (benign/malignant) were retained using the least absolute shrinkage selector operator. Common features among different settings were identified, and the final features showing high reproducibility correlated with nodule status were identified. The identified features were used for the random forest classifier to validate the effectiveness of the features. The properties of the uncalculated feature were inspected to suggest a tentative guideline for radiomics studies. RESULTS: Nine features showing high reproducibility for both the original and isotropic voxel settings were selected and used to classify nodule status (AUC 0.659–0.697). Five features showing high reproducibility among different binning settings were selected and used in classification (AUC 0.729–0.748). Some texture features are likely to be successfully computed if a nodule was larger than 1000 mm(3). CONCLUSIONS: Features showing high reproducibility among different settings correlated with nodule status were identified. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s40644-019-0239-z) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-26 /pmc/articles/PMC6660971/ /pubmed/31349872 http://dx.doi.org/10.1186/s40644-019-0239-z Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Lee, Seung-Hak
Cho, Hwan-ho
Lee, Ho Yun
Park, Hyunjin
Clinical impact of variability on CT radiomics and suggestions for suitable feature selection: a focus on lung cancer
title Clinical impact of variability on CT radiomics and suggestions for suitable feature selection: a focus on lung cancer
title_full Clinical impact of variability on CT radiomics and suggestions for suitable feature selection: a focus on lung cancer
title_fullStr Clinical impact of variability on CT radiomics and suggestions for suitable feature selection: a focus on lung cancer
title_full_unstemmed Clinical impact of variability on CT radiomics and suggestions for suitable feature selection: a focus on lung cancer
title_short Clinical impact of variability on CT radiomics and suggestions for suitable feature selection: a focus on lung cancer
title_sort clinical impact of variability on ct radiomics and suggestions for suitable feature selection: a focus on lung cancer
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6660971/
https://www.ncbi.nlm.nih.gov/pubmed/31349872
http://dx.doi.org/10.1186/s40644-019-0239-z
work_keys_str_mv AT leeseunghak clinicalimpactofvariabilityonctradiomicsandsuggestionsforsuitablefeatureselectionafocusonlungcancer
AT chohwanho clinicalimpactofvariabilityonctradiomicsandsuggestionsforsuitablefeatureselectionafocusonlungcancer
AT leehoyun clinicalimpactofvariabilityonctradiomicsandsuggestionsforsuitablefeatureselectionafocusonlungcancer
AT parkhyunjin clinicalimpactofvariabilityonctradiomicsandsuggestionsforsuitablefeatureselectionafocusonlungcancer