Cargando…

Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients

BACKGROUND: Radiomics is a new technology to noninvasively predict survival prognosis with quantitative features extracted from medical images. Most radiomics-based prognostic studies of non-small-cell lung cancer (NSCLC) patients have used mixed datasets of different subgroups. Therefore, we invest...

Descripción completa

Detalles Bibliográficos
Autores principales: Sugai, Yuto, Kadoya, Noriyuki, Tanaka, Shohei, Tanabe, Shunpei, Umeda, Mariko, Yamamoto, Takaya, Takeda, Kazuya, Dobashi, Suguru, Ohashi, Haruna, Takeda, Ken, Jingu, Keiichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8086112/
https://www.ncbi.nlm.nih.gov/pubmed/33931085
http://dx.doi.org/10.1186/s13014-021-01810-9
_version_ 1783686460143042560
author Sugai, Yuto
Kadoya, Noriyuki
Tanaka, Shohei
Tanabe, Shunpei
Umeda, Mariko
Yamamoto, Takaya
Takeda, Kazuya
Dobashi, Suguru
Ohashi, Haruna
Takeda, Ken
Jingu, Keiichi
author_facet Sugai, Yuto
Kadoya, Noriyuki
Tanaka, Shohei
Tanabe, Shunpei
Umeda, Mariko
Yamamoto, Takaya
Takeda, Kazuya
Dobashi, Suguru
Ohashi, Haruna
Takeda, Ken
Jingu, Keiichi
author_sort Sugai, Yuto
collection PubMed
description BACKGROUND: Radiomics is a new technology to noninvasively predict survival prognosis with quantitative features extracted from medical images. Most radiomics-based prognostic studies of non-small-cell lung cancer (NSCLC) patients have used mixed datasets of different subgroups. Therefore, we investigated the radiomics-based survival prediction of NSCLC patients by focusing on subgroups with identical characteristics. METHODS: A total of 304 NSCLC (Stages I–IV) patients treated with radiotherapy in our hospital were used. We extracted 107 radiomic features (i.e., 14 shape features, 18 first-order statistical features, and 75 texture features) from the gross tumor volume drawn on the free breathing planning computed tomography image. Three feature selection methods [i.e., test–retest and multiple segmentation (FS1), Pearson's correlation analysis (FS2), and a method that combined FS1 and FS2 (FS3)] were used to clarify how they affect survival prediction performance. Subgroup analysis for each histological subtype and each T stage applied the best selection method for the analysis of All data. We used a least absolute shrinkage and selection operator Cox regression model for all analyses and evaluated prognostic performance using the concordance-index (C-index) and the Kaplan–Meier method. For subgroup analysis, fivefold cross-validation was applied to ensure model reliability. RESULTS: In the analysis of All data, the C-index for the test dataset is 0.62 (FS1), 0.63 (FS2), and 0.62 (FS3). The subgroup analysis indicated that the prediction model based on specific histological subtypes and T stages had a higher C-index for the test dataset than that based on All data (All data, 0.64 vs. SCC(all), 060; ADC(all), 0.69; T1, 0.68; T2, 0.65; T3, 0.66; T4, 0.70). In addition, the prediction models unified for each T stage in histological subtype showed a different trend in the C-index for the test dataset between ADC-related and SCC-related models (ADC(T1)–ADC(T4), 0.72–0.83; SCC(T1)–SCC(T4), 0.58–0.71). CONCLUSIONS: Our results showed that feature selection methods moderately affected the survival prediction performance. In addition, prediction models based on specific subgroups may improve the prediction performance. These results may prove useful for determining the optimal radiomics-based predication model. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13014-021-01810-9.
format Online
Article
Text
id pubmed-8086112
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-80861122021-04-30 Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients Sugai, Yuto Kadoya, Noriyuki Tanaka, Shohei Tanabe, Shunpei Umeda, Mariko Yamamoto, Takaya Takeda, Kazuya Dobashi, Suguru Ohashi, Haruna Takeda, Ken Jingu, Keiichi Radiat Oncol Research BACKGROUND: Radiomics is a new technology to noninvasively predict survival prognosis with quantitative features extracted from medical images. Most radiomics-based prognostic studies of non-small-cell lung cancer (NSCLC) patients have used mixed datasets of different subgroups. Therefore, we investigated the radiomics-based survival prediction of NSCLC patients by focusing on subgroups with identical characteristics. METHODS: A total of 304 NSCLC (Stages I–IV) patients treated with radiotherapy in our hospital were used. We extracted 107 radiomic features (i.e., 14 shape features, 18 first-order statistical features, and 75 texture features) from the gross tumor volume drawn on the free breathing planning computed tomography image. Three feature selection methods [i.e., test–retest and multiple segmentation (FS1), Pearson's correlation analysis (FS2), and a method that combined FS1 and FS2 (FS3)] were used to clarify how they affect survival prediction performance. Subgroup analysis for each histological subtype and each T stage applied the best selection method for the analysis of All data. We used a least absolute shrinkage and selection operator Cox regression model for all analyses and evaluated prognostic performance using the concordance-index (C-index) and the Kaplan–Meier method. For subgroup analysis, fivefold cross-validation was applied to ensure model reliability. RESULTS: In the analysis of All data, the C-index for the test dataset is 0.62 (FS1), 0.63 (FS2), and 0.62 (FS3). The subgroup analysis indicated that the prediction model based on specific histological subtypes and T stages had a higher C-index for the test dataset than that based on All data (All data, 0.64 vs. SCC(all), 060; ADC(all), 0.69; T1, 0.68; T2, 0.65; T3, 0.66; T4, 0.70). In addition, the prediction models unified for each T stage in histological subtype showed a different trend in the C-index for the test dataset between ADC-related and SCC-related models (ADC(T1)–ADC(T4), 0.72–0.83; SCC(T1)–SCC(T4), 0.58–0.71). CONCLUSIONS: Our results showed that feature selection methods moderately affected the survival prediction performance. In addition, prediction models based on specific subgroups may improve the prediction performance. These results may prove useful for determining the optimal radiomics-based predication model. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13014-021-01810-9. BioMed Central 2021-04-30 /pmc/articles/PMC8086112/ /pubmed/33931085 http://dx.doi.org/10.1186/s13014-021-01810-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Sugai, Yuto
Kadoya, Noriyuki
Tanaka, Shohei
Tanabe, Shunpei
Umeda, Mariko
Yamamoto, Takaya
Takeda, Kazuya
Dobashi, Suguru
Ohashi, Haruna
Takeda, Ken
Jingu, Keiichi
Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients
title Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients
title_full Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients
title_fullStr Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients
title_full_unstemmed Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients
title_short Impact of feature selection methods and subgroup factors on prognostic analysis with CT-based radiomics in non-small cell lung cancer patients
title_sort impact of feature selection methods and subgroup factors on prognostic analysis with ct-based radiomics in non-small cell lung cancer patients
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8086112/
https://www.ncbi.nlm.nih.gov/pubmed/33931085
http://dx.doi.org/10.1186/s13014-021-01810-9
work_keys_str_mv AT sugaiyuto impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT kadoyanoriyuki impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT tanakashohei impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT tanabeshunpei impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT umedamariko impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT yamamototakaya impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT takedakazuya impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT dobashisuguru impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT ohashiharuna impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT takedaken impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients
AT jingukeiichi impactoffeatureselectionmethodsandsubgroupfactorsonprognosticanalysiswithctbasedradiomicsinnonsmallcelllungcancerpatients