Cargando…
Imbalanced Data Correction Based PET/CT Radiomics Model for Predicting Lymph Node Metastasis in Clinical Stage T1 Lung Adenocarcinoma
OBJECTIVES: To develop and validate the imbalanced data correction based PET/CT radiomics model for predicting lymph node metastasis (LNM) in clinical stage T1 lung adenocarcinoma (LUAD). METHODS: A total of 183 patients (148/35 non-metastasis/LNM) with pathologically confirmed LUAD were retrospecti...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8831550/ https://www.ncbi.nlm.nih.gov/pubmed/35155231 http://dx.doi.org/10.3389/fonc.2022.788968 |
_version_ | 1784648529655889920 |
---|---|
author | Lv, Jieqin Chen, Xiaohui Liu, Xinran Du, Dongyang Lv, Wenbing Lu, Lijun Wu, Hubing |
author_facet | Lv, Jieqin Chen, Xiaohui Liu, Xinran Du, Dongyang Lv, Wenbing Lu, Lijun Wu, Hubing |
author_sort | Lv, Jieqin |
collection | PubMed |
description | OBJECTIVES: To develop and validate the imbalanced data correction based PET/CT radiomics model for predicting lymph node metastasis (LNM) in clinical stage T1 lung adenocarcinoma (LUAD). METHODS: A total of 183 patients (148/35 non-metastasis/LNM) with pathologically confirmed LUAD were retrospectively included. The cohorts were divided into training vs. validation cohort in a ratio of 7:3. A total of 487 radiomics features were extracted from PET and CT components separately for radiomics model construction. Four clinical features and seven PET/CT radiological features were extracted for traditional model construction. To balance the distribution of majority (non-metastasis) class and minority (LNM) class, the imbalance-adjustment strategies using ten data re-sampling methods were adopted. Three multivariate models (denoted as Traditional, Radiomics, and Combined) were constructed using multivariable logistic regression analysis, where the combined model incorporated all of the significant clinical, radiological, and radiomics features. One hundred times repeated Monte Carlo cross-validation was used to assess the application order of feature selection and imbalance-adjustment strategies in the machine learning pipeline. Prediction performance of each model was evaluated using the area under the receiver operating characteristic curve (AUC) and Geometric mean score (G-mean). RESULTS: A total of 2 clinical parameters, 2 radiological features, 3 PET, and 5 CT radiomics features were significantly associated with LNM. The combined model with Edited Nearest Neighbors (ENN) re-sampling methods showed strong prediction performance than traditional model or radiomics model with the AUC of 0.94 (95%CI = 0.86–0.97) vs. 0.89 (95%CI = 0.79–0.93), 0.92 (95%CI = 0.85–0.97), and G-mean of 0.88 vs. 0.82, 0.80 in the training cohort, and the AUC of 0.75 (95%CI = 0.57–0.91) vs. 0.68 (95%CI = 0.36–0.83), 0.71 (95%CI = 0.48–0.83) and G-mean of 0.76 vs. 0.64, 0.51 in the validation cohort. The combination of performing feature selection before data re-sampling obtains a better result than the reverse combination (AUC 0.76 ± 0.06 vs. 0.70 ± 0.07, p<0.001). CONCLUSIONS: The combined model (consisting of age, histological type, C/T ratio, MATV, and radiomics signature) integrated with ENN re-sampling methods had strong lymph node metastasis prediction performance for imbalance cohorts in clinical stage T1 LUAD. Radiomics signatures extracted from PET/CT images could provide complementary prediction information compared with traditional model. |
format | Online Article Text |
id | pubmed-8831550 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88315502022-02-12 Imbalanced Data Correction Based PET/CT Radiomics Model for Predicting Lymph Node Metastasis in Clinical Stage T1 Lung Adenocarcinoma Lv, Jieqin Chen, Xiaohui Liu, Xinran Du, Dongyang Lv, Wenbing Lu, Lijun Wu, Hubing Front Oncol Oncology OBJECTIVES: To develop and validate the imbalanced data correction based PET/CT radiomics model for predicting lymph node metastasis (LNM) in clinical stage T1 lung adenocarcinoma (LUAD). METHODS: A total of 183 patients (148/35 non-metastasis/LNM) with pathologically confirmed LUAD were retrospectively included. The cohorts were divided into training vs. validation cohort in a ratio of 7:3. A total of 487 radiomics features were extracted from PET and CT components separately for radiomics model construction. Four clinical features and seven PET/CT radiological features were extracted for traditional model construction. To balance the distribution of majority (non-metastasis) class and minority (LNM) class, the imbalance-adjustment strategies using ten data re-sampling methods were adopted. Three multivariate models (denoted as Traditional, Radiomics, and Combined) were constructed using multivariable logistic regression analysis, where the combined model incorporated all of the significant clinical, radiological, and radiomics features. One hundred times repeated Monte Carlo cross-validation was used to assess the application order of feature selection and imbalance-adjustment strategies in the machine learning pipeline. Prediction performance of each model was evaluated using the area under the receiver operating characteristic curve (AUC) and Geometric mean score (G-mean). RESULTS: A total of 2 clinical parameters, 2 radiological features, 3 PET, and 5 CT radiomics features were significantly associated with LNM. The combined model with Edited Nearest Neighbors (ENN) re-sampling methods showed strong prediction performance than traditional model or radiomics model with the AUC of 0.94 (95%CI = 0.86–0.97) vs. 0.89 (95%CI = 0.79–0.93), 0.92 (95%CI = 0.85–0.97), and G-mean of 0.88 vs. 0.82, 0.80 in the training cohort, and the AUC of 0.75 (95%CI = 0.57–0.91) vs. 0.68 (95%CI = 0.36–0.83), 0.71 (95%CI = 0.48–0.83) and G-mean of 0.76 vs. 0.64, 0.51 in the validation cohort. The combination of performing feature selection before data re-sampling obtains a better result than the reverse combination (AUC 0.76 ± 0.06 vs. 0.70 ± 0.07, p<0.001). CONCLUSIONS: The combined model (consisting of age, histological type, C/T ratio, MATV, and radiomics signature) integrated with ENN re-sampling methods had strong lymph node metastasis prediction performance for imbalance cohorts in clinical stage T1 LUAD. Radiomics signatures extracted from PET/CT images could provide complementary prediction information compared with traditional model. Frontiers Media S.A. 2022-01-28 /pmc/articles/PMC8831550/ /pubmed/35155231 http://dx.doi.org/10.3389/fonc.2022.788968 Text en Copyright © 2022 Lv, Chen, Liu, Du, Lv, Lu and Wu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Lv, Jieqin Chen, Xiaohui Liu, Xinran Du, Dongyang Lv, Wenbing Lu, Lijun Wu, Hubing Imbalanced Data Correction Based PET/CT Radiomics Model for Predicting Lymph Node Metastasis in Clinical Stage T1 Lung Adenocarcinoma |
title | Imbalanced Data Correction Based PET/CT Radiomics Model for Predicting Lymph Node Metastasis in Clinical Stage T1 Lung Adenocarcinoma |
title_full | Imbalanced Data Correction Based PET/CT Radiomics Model for Predicting Lymph Node Metastasis in Clinical Stage T1 Lung Adenocarcinoma |
title_fullStr | Imbalanced Data Correction Based PET/CT Radiomics Model for Predicting Lymph Node Metastasis in Clinical Stage T1 Lung Adenocarcinoma |
title_full_unstemmed | Imbalanced Data Correction Based PET/CT Radiomics Model for Predicting Lymph Node Metastasis in Clinical Stage T1 Lung Adenocarcinoma |
title_short | Imbalanced Data Correction Based PET/CT Radiomics Model for Predicting Lymph Node Metastasis in Clinical Stage T1 Lung Adenocarcinoma |
title_sort | imbalanced data correction based pet/ct radiomics model for predicting lymph node metastasis in clinical stage t1 lung adenocarcinoma |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8831550/ https://www.ncbi.nlm.nih.gov/pubmed/35155231 http://dx.doi.org/10.3389/fonc.2022.788968 |
work_keys_str_mv | AT lvjieqin imbalanceddatacorrectionbasedpetctradiomicsmodelforpredictinglymphnodemetastasisinclinicalstaget1lungadenocarcinoma AT chenxiaohui imbalanceddatacorrectionbasedpetctradiomicsmodelforpredictinglymphnodemetastasisinclinicalstaget1lungadenocarcinoma AT liuxinran imbalanceddatacorrectionbasedpetctradiomicsmodelforpredictinglymphnodemetastasisinclinicalstaget1lungadenocarcinoma AT dudongyang imbalanceddatacorrectionbasedpetctradiomicsmodelforpredictinglymphnodemetastasisinclinicalstaget1lungadenocarcinoma AT lvwenbing imbalanceddatacorrectionbasedpetctradiomicsmodelforpredictinglymphnodemetastasisinclinicalstaget1lungadenocarcinoma AT lulijun imbalanceddatacorrectionbasedpetctradiomicsmodelforpredictinglymphnodemetastasisinclinicalstaget1lungadenocarcinoma AT wuhubing imbalanceddatacorrectionbasedpetctradiomicsmodelforpredictinglymphnodemetastasisinclinicalstaget1lungadenocarcinoma |