Cargando…

Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography

PURPOSE: The aim of this study was to compare the performance of image analysis for predicting breast cancer using two distinct regression models and to evaluate the usefulness of incorporating clinical and demographic data (CDD) into the image analysis in order to improve the diagnosis of breast ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Sun Mi, Kim, Yongdai, Jeong, Kuhwan, Jeong, Heeyeong, Kim, Jiyoung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Ultrasound in Medicine 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5769953/
https://www.ncbi.nlm.nih.gov/pubmed/28618771
http://dx.doi.org/10.14366/usg.16045
_version_ 1783292994070249472
author Kim, Sun Mi
Kim, Yongdai
Jeong, Kuhwan
Jeong, Heeyeong
Kim, Jiyoung
author_facet Kim, Sun Mi
Kim, Yongdai
Jeong, Kuhwan
Jeong, Heeyeong
Kim, Jiyoung
author_sort Kim, Sun Mi
collection PubMed
description PURPOSE: The aim of this study was to compare the performance of image analysis for predicting breast cancer using two distinct regression models and to evaluate the usefulness of incorporating clinical and demographic data (CDD) into the image analysis in order to improve the diagnosis of breast cancer. METHODS: This study included 139 solid masses from 139 patients who underwent a ultrasonography-guided core biopsy and had available CDD between June 2009 and April 2010. Three breast radiologists retrospectively reviewed 139 breast masses and described each lesion using the Breast Imaging Reporting and Data System (BI-RADS) lexicon. We applied and compared two regression methods-stepwise logistic (SL) regression and logistic least absolute shrinkage and selection operator (LASSO) regression-in which the BI-RADS descriptors and CDD were used as covariates. We investigated the performances of these regression methods and the agreement of radiologists in terms of test misclassification error and the area under the curve (AUC) of the tests. RESULTS: Logistic LASSO regression was superior (P<0.05) to SL regression, regardless of whether CDD was included in the covariates, in terms of test misclassification errors (0.234 vs. 0.253, without CDD; 0.196 vs. 0.258, with CDD) and AUC (0.785 vs. 0.759, without CDD; 0.873 vs. 0.735, with CDD). However, it was inferior (P<0.05) to the agreement of three radiologists in terms of test misclassification errors (0.234 vs. 0.168, without CDD; 0.196 vs. 0.088, with CDD) and the AUC without CDD (0.785 vs. 0.844, P<0.001), but was comparable to the AUC with CDD (0.873 vs. 0.880, P=0.141). CONCLUSION: Logistic LASSO regression based on BI-RADS descriptors and CDD showed better performance than SL in predicting the presence of breast cancer. The use of CDD as a supplement to the BI-RADS descriptors significantly improved the prediction of breast cancer using logistic LASSO regression.
format Online
Article
Text
id pubmed-5769953
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Korean Society of Ultrasound in Medicine
record_format MEDLINE/PubMed
spelling pubmed-57699532018-01-19 Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography Kim, Sun Mi Kim, Yongdai Jeong, Kuhwan Jeong, Heeyeong Kim, Jiyoung Ultrasonography Original Article PURPOSE: The aim of this study was to compare the performance of image analysis for predicting breast cancer using two distinct regression models and to evaluate the usefulness of incorporating clinical and demographic data (CDD) into the image analysis in order to improve the diagnosis of breast cancer. METHODS: This study included 139 solid masses from 139 patients who underwent a ultrasonography-guided core biopsy and had available CDD between June 2009 and April 2010. Three breast radiologists retrospectively reviewed 139 breast masses and described each lesion using the Breast Imaging Reporting and Data System (BI-RADS) lexicon. We applied and compared two regression methods-stepwise logistic (SL) regression and logistic least absolute shrinkage and selection operator (LASSO) regression-in which the BI-RADS descriptors and CDD were used as covariates. We investigated the performances of these regression methods and the agreement of radiologists in terms of test misclassification error and the area under the curve (AUC) of the tests. RESULTS: Logistic LASSO regression was superior (P<0.05) to SL regression, regardless of whether CDD was included in the covariates, in terms of test misclassification errors (0.234 vs. 0.253, without CDD; 0.196 vs. 0.258, with CDD) and AUC (0.785 vs. 0.759, without CDD; 0.873 vs. 0.735, with CDD). However, it was inferior (P<0.05) to the agreement of three radiologists in terms of test misclassification errors (0.234 vs. 0.168, without CDD; 0.196 vs. 0.088, with CDD) and the AUC without CDD (0.785 vs. 0.844, P<0.001), but was comparable to the AUC with CDD (0.873 vs. 0.880, P=0.141). CONCLUSION: Logistic LASSO regression based on BI-RADS descriptors and CDD showed better performance than SL in predicting the presence of breast cancer. The use of CDD as a supplement to the BI-RADS descriptors significantly improved the prediction of breast cancer using logistic LASSO regression. Korean Society of Ultrasound in Medicine 2018-01 2017-04-14 /pmc/articles/PMC5769953/ /pubmed/28618771 http://dx.doi.org/10.14366/usg.16045 Text en Copyright © 2017 Korean Society of Ultrasound in Medicine (KSUM) This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Kim, Sun Mi
Kim, Yongdai
Jeong, Kuhwan
Jeong, Heeyeong
Kim, Jiyoung
Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography
title Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography
title_full Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography
title_fullStr Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography
title_full_unstemmed Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography
title_short Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography
title_sort logistic lasso regression for the diagnosis of breast cancer using clinical demographic data and the bi-rads lexicon for ultrasonography
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5769953/
https://www.ncbi.nlm.nih.gov/pubmed/28618771
http://dx.doi.org/10.14366/usg.16045
work_keys_str_mv AT kimsunmi logisticlassoregressionforthediagnosisofbreastcancerusingclinicaldemographicdataandthebiradslexiconforultrasonography
AT kimyongdai logisticlassoregressionforthediagnosisofbreastcancerusingclinicaldemographicdataandthebiradslexiconforultrasonography
AT jeongkuhwan logisticlassoregressionforthediagnosisofbreastcancerusingclinicaldemographicdataandthebiradslexiconforultrasonography
AT jeongheeyeong logisticlassoregressionforthediagnosisofbreastcancerusingclinicaldemographicdataandthebiradslexiconforultrasonography
AT kimjiyoung logisticlassoregressionforthediagnosisofbreastcancerusingclinicaldemographicdataandthebiradslexiconforultrasonography