Cargando…

Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers

Applying state-of-the-art machine learning techniques to medical images requires a thorough selection and normalization of input data. One of such steps in digital mammography screening for breast cancer is the labeling and removal of special diagnostic views, in which diagnostic tools or magnificat...

Descripción completa

Detalles Bibliográficos
Autores principales: Lituiev, Dmytro S., Trivedi, Hari, Panahiazar, Maryam, Norgeot, Beau, Seo, Youngho, Franc, Benjamin, Harnish, Roy, Kawczynski, Michael, Hadley, Dexter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6456464/
https://www.ncbi.nlm.nih.gov/pubmed/30465142
http://dx.doi.org/10.1007/s10278-018-0154-z
_version_ 1783409754995949568
author Lituiev, Dmytro S.
Trivedi, Hari
Panahiazar, Maryam
Norgeot, Beau
Seo, Youngho
Franc, Benjamin
Harnish, Roy
Kawczynski, Michael
Hadley, Dexter
author_facet Lituiev, Dmytro S.
Trivedi, Hari
Panahiazar, Maryam
Norgeot, Beau
Seo, Youngho
Franc, Benjamin
Harnish, Roy
Kawczynski, Michael
Hadley, Dexter
author_sort Lituiev, Dmytro S.
collection PubMed
description Applying state-of-the-art machine learning techniques to medical images requires a thorough selection and normalization of input data. One of such steps in digital mammography screening for breast cancer is the labeling and removal of special diagnostic views, in which diagnostic tools or magnification are applied to assist in assessment of suspicious initial findings. As a common task in medical informatics is prediction of disease and its stage, these special diagnostic views, which are only enriched among the cohort of diseased cases, will bias machine learning disease predictions. In order to automate this process, here, we develop a machine learning pipeline that utilizes both DICOM headers and images to predict such views in an automatic manner, allowing for their removal and the generation of unbiased datasets. We achieve AUC of 99.72% in predicting special mammogram views when combining both types of models. Finally, we apply these models to clean up a dataset of about 772,000 images with expected sensitivity of 99.0%. The pipeline presented in this paper can be applied to other datasets to obtain high-quality image sets suitable to train algorithms for disease detection. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s10278-018-0154-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6456464
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-64564642019-04-26 Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers Lituiev, Dmytro S. Trivedi, Hari Panahiazar, Maryam Norgeot, Beau Seo, Youngho Franc, Benjamin Harnish, Roy Kawczynski, Michael Hadley, Dexter J Digit Imaging Article Applying state-of-the-art machine learning techniques to medical images requires a thorough selection and normalization of input data. One of such steps in digital mammography screening for breast cancer is the labeling and removal of special diagnostic views, in which diagnostic tools or magnification are applied to assist in assessment of suspicious initial findings. As a common task in medical informatics is prediction of disease and its stage, these special diagnostic views, which are only enriched among the cohort of diseased cases, will bias machine learning disease predictions. In order to automate this process, here, we develop a machine learning pipeline that utilizes both DICOM headers and images to predict such views in an automatic manner, allowing for their removal and the generation of unbiased datasets. We achieve AUC of 99.72% in predicting special mammogram views when combining both types of models. Finally, we apply these models to clean up a dataset of about 772,000 images with expected sensitivity of 99.0%. The pipeline presented in this paper can be applied to other datasets to obtain high-quality image sets suitable to train algorithms for disease detection. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s10278-018-0154-z) contains supplementary material, which is available to authorized users. Springer International Publishing 2018-11-21 2019-04 /pmc/articles/PMC6456464/ /pubmed/30465142 http://dx.doi.org/10.1007/s10278-018-0154-z Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Lituiev, Dmytro S.
Trivedi, Hari
Panahiazar, Maryam
Norgeot, Beau
Seo, Youngho
Franc, Benjamin
Harnish, Roy
Kawczynski, Michael
Hadley, Dexter
Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers
title Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers
title_full Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers
title_fullStr Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers
title_full_unstemmed Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers
title_short Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers
title_sort automatic labeling of special diagnostic mammography views from images and dicom headers
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6456464/
https://www.ncbi.nlm.nih.gov/pubmed/30465142
http://dx.doi.org/10.1007/s10278-018-0154-z
work_keys_str_mv AT lituievdmytros automaticlabelingofspecialdiagnosticmammographyviewsfromimagesanddicomheaders
AT trivedihari automaticlabelingofspecialdiagnosticmammographyviewsfromimagesanddicomheaders
AT panahiazarmaryam automaticlabelingofspecialdiagnosticmammographyviewsfromimagesanddicomheaders
AT norgeotbeau automaticlabelingofspecialdiagnosticmammographyviewsfromimagesanddicomheaders
AT seoyoungho automaticlabelingofspecialdiagnosticmammographyviewsfromimagesanddicomheaders
AT francbenjamin automaticlabelingofspecialdiagnosticmammographyviewsfromimagesanddicomheaders
AT harnishroy automaticlabelingofspecialdiagnosticmammographyviewsfromimagesanddicomheaders
AT kawczynskimichael automaticlabelingofspecialdiagnosticmammographyviewsfromimagesanddicomheaders
AT hadleydexter automaticlabelingofspecialdiagnosticmammographyviewsfromimagesanddicomheaders