Cargando…
Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer
SIMPLE SUMMARY: Breast cancer nowadays is the most common cancer among women. Two types refer to whether cancer has spread or not: Non-invasive and invasive breast cancers. Invasive ductal carcinoma is responsible for approximately 80% of all breast cancers, and ductal carcinoma in situ accounts for...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139618/ https://www.ncbi.nlm.nih.gov/pubmed/35626043 http://dx.doi.org/10.3390/cancers14102437 |
_version_ | 1784714901488402432 |
---|---|
author | Vy, Vu Pham Thao Yao, Melissa Min-Szu Khanh Le, Nguyen Quoc Chan, Wing P. |
author_facet | Vy, Vu Pham Thao Yao, Melissa Min-Szu Khanh Le, Nguyen Quoc Chan, Wing P. |
author_sort | Vy, Vu Pham Thao |
collection | PubMed |
description | SIMPLE SUMMARY: Breast cancer nowadays is the most common cancer among women. Two types refer to whether cancer has spread or not: Non-invasive and invasive breast cancers. Invasive ductal carcinoma is responsible for approximately 80% of all breast cancers, and ductal carcinoma in situ accounts for the majority of the remainder. Early identification of types of breast cancers provides breast cancer patients with more options for less invasive therapy. Our study aimed to develop a machine-learning classification model to differentiate ductal carcinoma in situ and minimally invasive breast cancer using clinical characteristics, mammography findings, ultrasound findings, and histopathology features. Our model showed that the five most important features were calcifications on mammograms, lymph node presence, microcalcifications on histopathology, the shape of the mass on ultrasound, and the orientation of the mass on ultrasound. ABSTRACT: Purpose: Given that early identification of breast cancer type allows for less-invasive therapies, we aimed to develop a machine learning model to discriminate between ductal carcinoma in situ (DCIS) and minimally invasive breast cancer (MIBC). Methods: In this retrospective study, the health records of 420 women who underwent biopsies between 2010 and 2020 to confirm breast cancer were collected. A trained XGBoost algorithm was used to classify cancers as either DCIS or MIBC using clinical characteristics, mammographic findings, ultrasonographic findings, and histopathological features. Its performance was measured against other methods using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, accuracy, precision, and F1 score. Results: The model was trained using 357 women and tested using 63 women with an overall 420 patients (mean [standard deviation] age, 57.1 [12.0] years). The model performed well when feature importance was determined, reaching an accuracy of 0.84 (95% confidence interval [CI], 0.76–0.91), an AUC of 0.93 (95% CI, 0.87–0.95), a specificity of 0.75 (95% CI, 0.67–0.83), and a sensitivity of 0.91 (95% CI, 0.76–0.94). Conclusion: The XGBoost model, combining clinical, mammographic, ultrasonographic, and histopathologic findings, can be used to discriminate DCIS from MIBC with an accuracy equivalent to that of experienced radiologists, thereby giving patients the widest range of therapeutic options. |
format | Online Article Text |
id | pubmed-9139618 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91396182022-05-28 Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer Vy, Vu Pham Thao Yao, Melissa Min-Szu Khanh Le, Nguyen Quoc Chan, Wing P. Cancers (Basel) Article SIMPLE SUMMARY: Breast cancer nowadays is the most common cancer among women. Two types refer to whether cancer has spread or not: Non-invasive and invasive breast cancers. Invasive ductal carcinoma is responsible for approximately 80% of all breast cancers, and ductal carcinoma in situ accounts for the majority of the remainder. Early identification of types of breast cancers provides breast cancer patients with more options for less invasive therapy. Our study aimed to develop a machine-learning classification model to differentiate ductal carcinoma in situ and minimally invasive breast cancer using clinical characteristics, mammography findings, ultrasound findings, and histopathology features. Our model showed that the five most important features were calcifications on mammograms, lymph node presence, microcalcifications on histopathology, the shape of the mass on ultrasound, and the orientation of the mass on ultrasound. ABSTRACT: Purpose: Given that early identification of breast cancer type allows for less-invasive therapies, we aimed to develop a machine learning model to discriminate between ductal carcinoma in situ (DCIS) and minimally invasive breast cancer (MIBC). Methods: In this retrospective study, the health records of 420 women who underwent biopsies between 2010 and 2020 to confirm breast cancer were collected. A trained XGBoost algorithm was used to classify cancers as either DCIS or MIBC using clinical characteristics, mammographic findings, ultrasonographic findings, and histopathological features. Its performance was measured against other methods using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, accuracy, precision, and F1 score. Results: The model was trained using 357 women and tested using 63 women with an overall 420 patients (mean [standard deviation] age, 57.1 [12.0] years). The model performed well when feature importance was determined, reaching an accuracy of 0.84 (95% confidence interval [CI], 0.76–0.91), an AUC of 0.93 (95% CI, 0.87–0.95), a specificity of 0.75 (95% CI, 0.67–0.83), and a sensitivity of 0.91 (95% CI, 0.76–0.94). Conclusion: The XGBoost model, combining clinical, mammographic, ultrasonographic, and histopathologic findings, can be used to discriminate DCIS from MIBC with an accuracy equivalent to that of experienced radiologists, thereby giving patients the widest range of therapeutic options. MDPI 2022-05-15 /pmc/articles/PMC9139618/ /pubmed/35626043 http://dx.doi.org/10.3390/cancers14102437 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Vy, Vu Pham Thao Yao, Melissa Min-Szu Khanh Le, Nguyen Quoc Chan, Wing P. Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer |
title | Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer |
title_full | Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer |
title_fullStr | Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer |
title_full_unstemmed | Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer |
title_short | Machine Learning Algorithm for Distinguishing Ductal Carcinoma In Situ from Invasive Breast Cancer |
title_sort | machine learning algorithm for distinguishing ductal carcinoma in situ from invasive breast cancer |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139618/ https://www.ncbi.nlm.nih.gov/pubmed/35626043 http://dx.doi.org/10.3390/cancers14102437 |
work_keys_str_mv | AT vyvuphamthao machinelearningalgorithmfordistinguishingductalcarcinomainsitufrominvasivebreastcancer AT yaomelissaminszu machinelearningalgorithmfordistinguishingductalcarcinomainsitufrominvasivebreastcancer AT khanhlenguyenquoc machinelearningalgorithmfordistinguishingductalcarcinomainsitufrominvasivebreastcancer AT chanwingp machinelearningalgorithmfordistinguishingductalcarcinomainsitufrominvasivebreastcancer |