Cargando…
Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients
BACKGROUND: Radiomics is a new technology to noninvasively predict survival prognosis with quantitative features extracted from medical images. Most radiomics-based prognostic studies of non-small-cell lung cancer (NSCLC) patients have used mixed datasets of different subgroups. Therefore, we invest...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8086112/ https://www.ncbi.nlm.nih.gov/pubmed/33931085 http://dx.doi.org/10.1186/s13014-021-01810-9 |
_version_ | 1783686460143042560 |
---|---|
author | Sugai, Yuto Kadoya, Noriyuki Tanaka, Shohei Tanabe, Shunpei Umeda, Mariko Yamamoto, Takaya Takeda, Kazuya Dobashi, Suguru Ohashi, Haruna Takeda, Ken Jingu, Keiichi |
author_facet | Sugai, Yuto Kadoya, Noriyuki Tanaka, Shohei Tanabe, Shunpei Umeda, Mariko Yamamoto, Takaya Takeda, Kazuya Dobashi, Suguru Ohashi, Haruna Takeda, Ken Jingu, Keiichi |
author_sort | Sugai, Yuto |
collection | PubMed |
description | BACKGROUND: Radiomics is a new technology to noninvasively predict survival prognosis with quantitative features extracted from medical images. Most radiomics-based prognostic studies of non-small-cell lung cancer (NSCLC) patients have used mixed datasets of different subgroups. Therefore, we investigated the radiomics-based survival prediction of NSCLC patients by focusing on subgroups with identical characteristics. METHODS: A total of 304 NSCLC (Stages I–IV) patients treated with radiotherapy in our hospital were used. We extracted 107 radiomic features (i.e., 14 shape features, 18 first-order statistical features, and 75 texture features) from the gross tumor volume drawn on the free breathing planning computed tomography image. Three feature selection methods [i.e., test–retest and multiple segmentation (FS1), Pearson's correlation analysis (FS2), and a method that combined FS1 and FS2 (FS3)] were used to clarify how they affect survival prediction performance. Subgroup analysis for each histological subtype and each T stage applied the best selection method for the analysis of All data. We used a least absolute shrinkage and selection operator Cox regression model for all analyses and evaluated prognostic performance using the concordance-index (C-index) and the Kaplan–Meier method. For subgroup analysis, fivefold cross-validation was applied to ensure model reliability. RESULTS: In the analysis of All data, the C-index for the test dataset is 0.62 (FS1), 0.63 (FS2), and 0.62 (FS3). The subgroup analysis indicated that the prediction model based on specific histological subtypes and T stages had a higher C-index for the test dataset than that based on All data (All data, 0.64 vs. SCC(all), 060; ADC(all), 0.69; T1, 0.68; T2, 0.65; T3, 0.66; T4, 0.70). In addition, the prediction models unified for each T stage in histological subtype showed a different trend in the C-index for the test dataset between ADC-related and SCC-related models (ADC(T1)–ADC(T4), 0.72–0.83; SCC(T1)–SCC(T4), 0.58–0.71). CONCLUSIONS: Our results showed that feature selection methods moderately affected the survival prediction performance. In addition, prediction models based on specific subgroups may improve the prediction performance. These results may prove useful for determining the optimal radiomics-based predication model. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13014-021-01810-9. |
format | Online Article Text |
id | pubmed-8086112 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-80861122021-04-30 Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients Sugai, Yuto Kadoya, Noriyuki Tanaka, Shohei Tanabe, Shunpei Umeda, Mariko Yamamoto, Takaya Takeda, Kazuya Dobashi, Suguru Ohashi, Haruna Takeda, Ken Jingu, Keiichi Radiat Oncol Research BACKGROUND: Radiomics is a new technology to noninvasively predict survival prognosis with quantitative features extracted from medical images. Most radiomics-based prognostic studies of non-small-cell lung cancer (NSCLC) patients have used mixed datasets of different subgroups. Therefore, we investigated the radiomics-based survival prediction of NSCLC patients by focusing on subgroups with identical characteristics. METHODS: A total of 304 NSCLC (Stages I–IV) patients treated with radiotherapy in our hospital were used. We extracted 107 radiomic features (i.e., 14 shape features, 18 first-order statistical features, and 75 texture features) from the gross tumor volume drawn on the free breathing planning computed tomography image. Three feature selection methods [i.e., test–retest and multiple segmentation (FS1), Pearson's correlation analysis (FS2), and a method that combined FS1 and FS2 (FS3)] were used to clarify how they affect survival prediction performance. Subgroup analysis for each histological subtype and each T stage applied the best selection method for the analysis of All data. We used a least absolute shrinkage and selection operator Cox regression model for all analyses and evaluated prognostic performance using the concordance-index (C-index) and the Kaplan–Meier method. For subgroup analysis, fivefold cross-validation was applied to ensure model reliability. RESULTS: In the analysis of All data, the C-index for the test dataset is 0.62 (FS1), 0.63 (FS2), and 0.62 (FS3). The subgroup analysis indicated that the prediction model based on specific histological subtypes and T stages had a higher C-index for the test dataset than that based on All data (All data, 0.64 vs. SCC(all), 060; ADC(all), 0.69; T1, 0.68; T2, 0.65; T3, 0.66; T4, 0.70). In addition, the prediction models unified for each T stage in histological subtype showed a different trend in the C-index for the test dataset between ADC-related and SCC-related models (ADC(T1)–ADC(T4), 0.72–0.83; SCC(T1)–SCC(T4), 0.58–0.71). CONCLUSIONS: Our results showed that feature selection methods moderately affected the survival prediction performance. In addition, prediction models based on specific subgroups may improve the prediction performance. These results may prove useful for determining the optimal radiomics-based predication model. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13014-021-01810-9. BioMed Central 2021-04-30 /pmc/articles/PMC8086112/ /pubmed/33931085 http://dx.doi.org/10.1186/s13014-021-01810-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Sugai, Yuto Kadoya, Noriyuki Tanaka, Shohei Tanabe, Shunpei Umeda, Mariko Yamamoto, Takaya Takeda, Kazuya Dobashi, Suguru Ohashi, Haruna Takeda, Ken Jingu, Keiichi Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients |
title | Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients |
title_full | Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients |
title_fullStr | Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients |
title_full_unstemmed | Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients |
title_short | Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients |
title_sort | impact of feature selection methods and subgroup factors on prognostic analysis with ct-based radiomics in non-small cell lung cancer patients |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8086112/ https://www.ncbi.nlm.nih.gov/pubmed/33931085 http://dx.doi.org/10.1186/s13014-021-01810-9 |
work_keys_str_mv | AT sugaiyuto impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT kadoyanoriyuki impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT tanakashohei impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT tanabeshunpei impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT umedamariko impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT yamamototakaya impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT takedakazuya impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT dobashisuguru impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT ohashiharuna impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT takedaken impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients AT jingukeiichi impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients |