Cargando…

Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data

We investigated the performance of multiple radiomics feature extractors/software on predicting epidermal growth factor receptor mutation status in 228 patients with non–small cell lung cancer from publicly available data sets in The Cancer Imaging Archive. The imaging and clinical data were split i...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Lin, Sun, Shawn H., Yang, Hao, E, Linning, Guo, Pingzhen, Schwartz, Lawrence H., Zhao, Binsheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Grapho Publications, LLC 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7289249/
https://www.ncbi.nlm.nih.gov/pubmed/32548300
http://dx.doi.org/10.18383/j.tom.2020.00017
_version_ 1783545424845471744
author Lu, Lin
Sun, Shawn H.
Yang, Hao
E, Linning
Guo, Pingzhen
Schwartz, Lawrence H.
Zhao, Binsheng
author_facet Lu, Lin
Sun, Shawn H.
Yang, Hao
E, Linning
Guo, Pingzhen
Schwartz, Lawrence H.
Zhao, Binsheng
author_sort Lu, Lin
collection PubMed
description We investigated the performance of multiple radiomics feature extractors/software on predicting epidermal growth factor receptor mutation status in 228 patients with non–small cell lung cancer from publicly available data sets in The Cancer Imaging Archive. The imaging and clinical data were split into training (n = 105) and validation cohorts (n = 123). Two of the most cited open-source feature extractors, IBEX (1563 features) and Pyradiomics (1319 features), and our in-house software, Columbia Image Feature Extractor (CIFE) (1160 features), were used to extract radiomics features. Univariate and multivariate analyses were performed sequentially to predict EGFR mutation status using each individual feature extractor. Our univariate analysis integrated an unsupervised clustering method to identify nonredundant and informative candidate features for the creation of prediction models by multivariate analyses. In training, unsupervised clustering-based univariate analysis identified 5, 6, and 4 features from IBEX, Pyradiomics, and CIFE as candidate features, respectively. Multivariate prediction models using these features from IBEX, Pyradiomics, and CIFE yielded similar areas under the receiver operating characteristic curve of 0.68, 0.67, and 0.69. However, in validation, areas under the receiver operating characteristic curve of multivariate prediction models from IBEX, Pyradiomics, and CIFE decreased to 0.54, 0.56 and 0.64, respectively. Different feature extractors select different radiomics features, which leads to prediction models with varying performance. However, correlation between those selected features from different extractors may indicate these features measure similar imaging phenotypes associated with similar biological characteristics. Overall, attention should be paid to the generalizability of individual radiomics features and radiomics prediction models.
format Online
Article
Text
id pubmed-7289249
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Grapho Publications, LLC
record_format MEDLINE/PubMed
spelling pubmed-72892492020-06-15 Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data Lu, Lin Sun, Shawn H. Yang, Hao E, Linning Guo, Pingzhen Schwartz, Lawrence H. Zhao, Binsheng Tomography Research Articles We investigated the performance of multiple radiomics feature extractors/software on predicting epidermal growth factor receptor mutation status in 228 patients with non–small cell lung cancer from publicly available data sets in The Cancer Imaging Archive. The imaging and clinical data were split into training (n = 105) and validation cohorts (n = 123). Two of the most cited open-source feature extractors, IBEX (1563 features) and Pyradiomics (1319 features), and our in-house software, Columbia Image Feature Extractor (CIFE) (1160 features), were used to extract radiomics features. Univariate and multivariate analyses were performed sequentially to predict EGFR mutation status using each individual feature extractor. Our univariate analysis integrated an unsupervised clustering method to identify nonredundant and informative candidate features for the creation of prediction models by multivariate analyses. In training, unsupervised clustering-based univariate analysis identified 5, 6, and 4 features from IBEX, Pyradiomics, and CIFE as candidate features, respectively. Multivariate prediction models using these features from IBEX, Pyradiomics, and CIFE yielded similar areas under the receiver operating characteristic curve of 0.68, 0.67, and 0.69. However, in validation, areas under the receiver operating characteristic curve of multivariate prediction models from IBEX, Pyradiomics, and CIFE decreased to 0.54, 0.56 and 0.64, respectively. Different feature extractors select different radiomics features, which leads to prediction models with varying performance. However, correlation between those selected features from different extractors may indicate these features measure similar imaging phenotypes associated with similar biological characteristics. Overall, attention should be paid to the generalizability of individual radiomics features and radiomics prediction models. Grapho Publications, LLC 2020-06 /pmc/articles/PMC7289249/ /pubmed/32548300 http://dx.doi.org/10.18383/j.tom.2020.00017 Text en © 2020 The Authors. Published by Grapho Publications, LLC http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Articles
Lu, Lin
Sun, Shawn H.
Yang, Hao
E, Linning
Guo, Pingzhen
Schwartz, Lawrence H.
Zhao, Binsheng
Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data
title Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data
title_full Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data
title_fullStr Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data
title_full_unstemmed Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data
title_short Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data
title_sort radiomics prediction of egfr status in lung cancer—our experience in using multiple feature extractors and the cancer imaging archive data
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7289249/
https://www.ncbi.nlm.nih.gov/pubmed/32548300
http://dx.doi.org/10.18383/j.tom.2020.00017
work_keys_str_mv AT lulin radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata
AT sunshawnh radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata
AT yanghao radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata
AT elinning radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata
AT guopingzhen radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata
AT schwartzlawrenceh radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata
AT zhaobinsheng radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata