Cargando…

Ensemble Learning for Breast Cancer Lesion Classification: A Pilot Validation Using Correlated Spectroscopic Imaging and Diffusion-Weighted Imaging

The main objective of this work was to evaluate the application of individual and ensemble machine learning models to classify malignant and benign breast masses using features from two-dimensional (2D) correlated spectroscopy spectra extracted from five-dimensional echo-planar correlated spectrosco...

Descripción completa

Detalles Bibliográficos
Autores principales: Joy, Ajin, Lin, Marlene, Joines, Melissa, Saucedo, Andres, Lee-Felker, Stephanie, Baker, Jennifer, Chien, Aichi, Emir, Uzay, Macey, Paul M., Thomas, M. Albert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10385820/
https://www.ncbi.nlm.nih.gov/pubmed/37512542
http://dx.doi.org/10.3390/metabo13070835
_version_ 1785081505079361536
author Joy, Ajin
Lin, Marlene
Joines, Melissa
Saucedo, Andres
Lee-Felker, Stephanie
Baker, Jennifer
Chien, Aichi
Emir, Uzay
Macey, Paul M.
Thomas, M. Albert
author_facet Joy, Ajin
Lin, Marlene
Joines, Melissa
Saucedo, Andres
Lee-Felker, Stephanie
Baker, Jennifer
Chien, Aichi
Emir, Uzay
Macey, Paul M.
Thomas, M. Albert
author_sort Joy, Ajin
collection PubMed
description The main objective of this work was to evaluate the application of individual and ensemble machine learning models to classify malignant and benign breast masses using features from two-dimensional (2D) correlated spectroscopy spectra extracted from five-dimensional echo-planar correlated spectroscopic imaging (5D EP-COSI) and diffusion-weighted imaging (DWI). Twenty-four different metabolite and lipid ratios with respect to diagonal fat peaks (1.4 ppm, 5.4 ppm) from 2D spectra, and water and fat peaks (4.7 ppm, 1.4 ppm) from one-dimensional non-water-suppressed (NWS) spectra were used as the features. Additionally, water fraction, fat fraction and water-to-fat ratios from NWS spectra and apparent diffusion coefficients (ADC) from DWI were included. The nine most important features were identified using recursive feature elimination, sequential forward selection and correlation analysis. XGBoost (AUC: 93.0%, Accuracy: 85.7%, F1-score: 88.9%, Precision: 88.2%, Sensitivity: 90.4%, Specificity: 84.6%) and GradientBoost (AUC: 94.3%, Accuracy: 89.3%, F1-score: 90.7%, Precision: 87.9%, Sensitivity: 94.2%, Specificity: 83.4%) were the best-performing models. Conventional biomarkers like choline, myo-Inositol, and glycine were statistically significant predictors. Key features contributing to the classification were ADC, 2D diagonal peaks at 0.9 ppm, 2.1 ppm, 3.5 ppm, and 5.4 ppm, cross peaks between 1.4 and 0.9 ppm, 4.3 and 4.1 ppm, 2.3 and 1.6 ppm, and the triglyceryl–fat cross peak. The results highlight the contribution of the 2D spectral peaks to the model, and they demonstrate the potential of 5D EP-COSI for early breast cancer detection.
format Online
Article
Text
id pubmed-10385820
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103858202023-07-30 Ensemble Learning for Breast Cancer Lesion Classification: A Pilot Validation Using Correlated Spectroscopic Imaging and Diffusion-Weighted Imaging Joy, Ajin Lin, Marlene Joines, Melissa Saucedo, Andres Lee-Felker, Stephanie Baker, Jennifer Chien, Aichi Emir, Uzay Macey, Paul M. Thomas, M. Albert Metabolites Article The main objective of this work was to evaluate the application of individual and ensemble machine learning models to classify malignant and benign breast masses using features from two-dimensional (2D) correlated spectroscopy spectra extracted from five-dimensional echo-planar correlated spectroscopic imaging (5D EP-COSI) and diffusion-weighted imaging (DWI). Twenty-four different metabolite and lipid ratios with respect to diagonal fat peaks (1.4 ppm, 5.4 ppm) from 2D spectra, and water and fat peaks (4.7 ppm, 1.4 ppm) from one-dimensional non-water-suppressed (NWS) spectra were used as the features. Additionally, water fraction, fat fraction and water-to-fat ratios from NWS spectra and apparent diffusion coefficients (ADC) from DWI were included. The nine most important features were identified using recursive feature elimination, sequential forward selection and correlation analysis. XGBoost (AUC: 93.0%, Accuracy: 85.7%, F1-score: 88.9%, Precision: 88.2%, Sensitivity: 90.4%, Specificity: 84.6%) and GradientBoost (AUC: 94.3%, Accuracy: 89.3%, F1-score: 90.7%, Precision: 87.9%, Sensitivity: 94.2%, Specificity: 83.4%) were the best-performing models. Conventional biomarkers like choline, myo-Inositol, and glycine were statistically significant predictors. Key features contributing to the classification were ADC, 2D diagonal peaks at 0.9 ppm, 2.1 ppm, 3.5 ppm, and 5.4 ppm, cross peaks between 1.4 and 0.9 ppm, 4.3 and 4.1 ppm, 2.3 and 1.6 ppm, and the triglyceryl–fat cross peak. The results highlight the contribution of the 2D spectral peaks to the model, and they demonstrate the potential of 5D EP-COSI for early breast cancer detection. MDPI 2023-07-11 /pmc/articles/PMC10385820/ /pubmed/37512542 http://dx.doi.org/10.3390/metabo13070835 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Joy, Ajin
Lin, Marlene
Joines, Melissa
Saucedo, Andres
Lee-Felker, Stephanie
Baker, Jennifer
Chien, Aichi
Emir, Uzay
Macey, Paul M.
Thomas, M. Albert
Ensemble Learning for Breast Cancer Lesion Classification: A Pilot Validation Using Correlated Spectroscopic Imaging and Diffusion-Weighted Imaging
title Ensemble Learning for Breast Cancer Lesion Classification: A Pilot Validation Using Correlated Spectroscopic Imaging and Diffusion-Weighted Imaging
title_full Ensemble Learning for Breast Cancer Lesion Classification: A Pilot Validation Using Correlated Spectroscopic Imaging and Diffusion-Weighted Imaging
title_fullStr Ensemble Learning for Breast Cancer Lesion Classification: A Pilot Validation Using Correlated Spectroscopic Imaging and Diffusion-Weighted Imaging
title_full_unstemmed Ensemble Learning for Breast Cancer Lesion Classification: A Pilot Validation Using Correlated Spectroscopic Imaging and Diffusion-Weighted Imaging
title_short Ensemble Learning for Breast Cancer Lesion Classification: A Pilot Validation Using Correlated Spectroscopic Imaging and Diffusion-Weighted Imaging
title_sort ensemble learning for breast cancer lesion classification: a pilot validation using correlated spectroscopic imaging and diffusion-weighted imaging
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10385820/
https://www.ncbi.nlm.nih.gov/pubmed/37512542
http://dx.doi.org/10.3390/metabo13070835
work_keys_str_mv AT joyajin ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging
AT linmarlene ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging
AT joinesmelissa ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging
AT saucedoandres ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging
AT leefelkerstephanie ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging
AT bakerjennifer ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging
AT chienaichi ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging
AT emiruzay ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging
AT maceypaulm ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging
AT thomasmalbert ensemblelearningforbreastcancerlesionclassificationapilotvalidationusingcorrelatedspectroscopicimaginganddiffusionweightedimaging