Cargando…
Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data
We investigated the performance of multiple radiomics feature extractors/software on predicting epidermal growth factor receptor mutation status in 228 patients with non–small cell lung cancer from publicly available data sets in The Cancer Imaging Archive. The imaging and clinical data were split i...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Grapho Publications, LLC
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7289249/ https://www.ncbi.nlm.nih.gov/pubmed/32548300 http://dx.doi.org/10.18383/j.tom.2020.00017 |
_version_ | 1783545424845471744 |
---|---|
author | Lu, Lin Sun, Shawn H. Yang, Hao E, Linning Guo, Pingzhen Schwartz, Lawrence H. Zhao, Binsheng |
author_facet | Lu, Lin Sun, Shawn H. Yang, Hao E, Linning Guo, Pingzhen Schwartz, Lawrence H. Zhao, Binsheng |
author_sort | Lu, Lin |
collection | PubMed |
description | We investigated the performance of multiple radiomics feature extractors/software on predicting epidermal growth factor receptor mutation status in 228 patients with non–small cell lung cancer from publicly available data sets in The Cancer Imaging Archive. The imaging and clinical data were split into training (n = 105) and validation cohorts (n = 123). Two of the most cited open-source feature extractors, IBEX (1563 features) and Pyradiomics (1319 features), and our in-house software, Columbia Image Feature Extractor (CIFE) (1160 features), were used to extract radiomics features. Univariate and multivariate analyses were performed sequentially to predict EGFR mutation status using each individual feature extractor. Our univariate analysis integrated an unsupervised clustering method to identify nonredundant and informative candidate features for the creation of prediction models by multivariate analyses. In training, unsupervised clustering-based univariate analysis identified 5, 6, and 4 features from IBEX, Pyradiomics, and CIFE as candidate features, respectively. Multivariate prediction models using these features from IBEX, Pyradiomics, and CIFE yielded similar areas under the receiver operating characteristic curve of 0.68, 0.67, and 0.69. However, in validation, areas under the receiver operating characteristic curve of multivariate prediction models from IBEX, Pyradiomics, and CIFE decreased to 0.54, 0.56 and 0.64, respectively. Different feature extractors select different radiomics features, which leads to prediction models with varying performance. However, correlation between those selected features from different extractors may indicate these features measure similar imaging phenotypes associated with similar biological characteristics. Overall, attention should be paid to the generalizability of individual radiomics features and radiomics prediction models. |
format | Online Article Text |
id | pubmed-7289249 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Grapho Publications, LLC |
record_format | MEDLINE/PubMed |
spelling | pubmed-72892492020-06-15 Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data Lu, Lin Sun, Shawn H. Yang, Hao E, Linning Guo, Pingzhen Schwartz, Lawrence H. Zhao, Binsheng Tomography Research Articles We investigated the performance of multiple radiomics feature extractors/software on predicting epidermal growth factor receptor mutation status in 228 patients with non–small cell lung cancer from publicly available data sets in The Cancer Imaging Archive. The imaging and clinical data were split into training (n = 105) and validation cohorts (n = 123). Two of the most cited open-source feature extractors, IBEX (1563 features) and Pyradiomics (1319 features), and our in-house software, Columbia Image Feature Extractor (CIFE) (1160 features), were used to extract radiomics features. Univariate and multivariate analyses were performed sequentially to predict EGFR mutation status using each individual feature extractor. Our univariate analysis integrated an unsupervised clustering method to identify nonredundant and informative candidate features for the creation of prediction models by multivariate analyses. In training, unsupervised clustering-based univariate analysis identified 5, 6, and 4 features from IBEX, Pyradiomics, and CIFE as candidate features, respectively. Multivariate prediction models using these features from IBEX, Pyradiomics, and CIFE yielded similar areas under the receiver operating characteristic curve of 0.68, 0.67, and 0.69. However, in validation, areas under the receiver operating characteristic curve of multivariate prediction models from IBEX, Pyradiomics, and CIFE decreased to 0.54, 0.56 and 0.64, respectively. Different feature extractors select different radiomics features, which leads to prediction models with varying performance. However, correlation between those selected features from different extractors may indicate these features measure similar imaging phenotypes associated with similar biological characteristics. Overall, attention should be paid to the generalizability of individual radiomics features and radiomics prediction models. Grapho Publications, LLC 2020-06 /pmc/articles/PMC7289249/ /pubmed/32548300 http://dx.doi.org/10.18383/j.tom.2020.00017 Text en © 2020 The Authors. Published by Grapho Publications, LLC http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Articles Lu, Lin Sun, Shawn H. Yang, Hao E, Linning Guo, Pingzhen Schwartz, Lawrence H. Zhao, Binsheng Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data |
title | Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data |
title_full | Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data |
title_fullStr | Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data |
title_full_unstemmed | Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data |
title_short | Radiomics Prediction of EGFR Status in Lung Cancer—Our Experience in Using Multiple Feature Extractors and The Cancer Imaging Archive Data |
title_sort | radiomics prediction of egfr status in lung cancer—our experience in using multiple feature extractors and the cancer imaging archive data |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7289249/ https://www.ncbi.nlm.nih.gov/pubmed/32548300 http://dx.doi.org/10.18383/j.tom.2020.00017 |
work_keys_str_mv | AT lulin radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata AT sunshawnh radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata AT yanghao radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata AT elinning radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata AT guopingzhen radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata AT schwartzlawrenceh radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata AT zhaobinsheng radiomicspredictionofegfrstatusinlungcancerourexperienceinusingmultiplefeatureextractorsandthecancerimagingarchivedata |