Cargando…

Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images

Annotating cancerous regions in whole-slide images (WSIs) of pathology samples plays a critical role in clinical diagnosis, biomedical research, and machine learning algorithms development. However, generating exhaustive and accurate annotations is labor-intensive, challenging, and costly. Drawing o...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zhenzhen, Saoud, Carla, Wangsiricharoen, Sintawat, James, Aaron W., Popel, Aleksander S., Sulam, Jeremias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825360/
https://www.ncbi.nlm.nih.gov/pubmed/36037454
http://dx.doi.org/10.1109/TMI.2022.3202759
_version_ 1784866614393438208
author Wang, Zhenzhen
Saoud, Carla
Wangsiricharoen, Sintawat
James, Aaron W.
Popel, Aleksander S.
Sulam, Jeremias
author_facet Wang, Zhenzhen
Saoud, Carla
Wangsiricharoen, Sintawat
James, Aaron W.
Popel, Aleksander S.
Sulam, Jeremias
author_sort Wang, Zhenzhen
collection PubMed
description Annotating cancerous regions in whole-slide images (WSIs) of pathology samples plays a critical role in clinical diagnosis, biomedical research, and machine learning algorithms development. However, generating exhaustive and accurate annotations is labor-intensive, challenging, and costly. Drawing only coarse and approximate annotations is a much easier task, less costly, and it alleviates pathologists’ workload. In this paper, we study the problem of refining these approximate annotations in digital pathology to obtain more accurate ones. Some previous works have explored obtaining machine learning models from these inaccurate annotations, but few of them tackle the refinement problem where the mislabeled regions should be explicitly identified and corrected, and all of them require a – often very large – number of training samples. We present a method, named Label Cleaning Multiple Instance Learning (LC-MIL), to refine coarse annotations on a single WSI without the need for external training data. Patches cropped from a WSI with inaccurate labels are processed jointly within a multiple instance learning framework, mitigating their impact on the predictive model and refining the segmentation. Our experiments on a heterogeneous WSI set with breast cancer lymph node metastasis, liver cancer, and colorectal cancer samples show that LC-MIL significantly refines the coarse annotations, out-performing state-of-the-art alternatives, even while learning from a single slide. Moreover, we demonstrate how real annotations drawn by pathologists can be efficiently refined and improved by the proposed approach. All these results demonstrate that LC-MIL is a promising, lightweight tool to provide fine-grained annotations from coarsely annotated pathology sets.
format Online
Article
Text
id pubmed-9825360
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-98253602023-01-08 Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images Wang, Zhenzhen Saoud, Carla Wangsiricharoen, Sintawat James, Aaron W. Popel, Aleksander S. Sulam, Jeremias IEEE Trans Med Imaging Article Annotating cancerous regions in whole-slide images (WSIs) of pathology samples plays a critical role in clinical diagnosis, biomedical research, and machine learning algorithms development. However, generating exhaustive and accurate annotations is labor-intensive, challenging, and costly. Drawing only coarse and approximate annotations is a much easier task, less costly, and it alleviates pathologists’ workload. In this paper, we study the problem of refining these approximate annotations in digital pathology to obtain more accurate ones. Some previous works have explored obtaining machine learning models from these inaccurate annotations, but few of them tackle the refinement problem where the mislabeled regions should be explicitly identified and corrected, and all of them require a – often very large – number of training samples. We present a method, named Label Cleaning Multiple Instance Learning (LC-MIL), to refine coarse annotations on a single WSI without the need for external training data. Patches cropped from a WSI with inaccurate labels are processed jointly within a multiple instance learning framework, mitigating their impact on the predictive model and refining the segmentation. Our experiments on a heterogeneous WSI set with breast cancer lymph node metastasis, liver cancer, and colorectal cancer samples show that LC-MIL significantly refines the coarse annotations, out-performing state-of-the-art alternatives, even while learning from a single slide. Moreover, we demonstrate how real annotations drawn by pathologists can be efficiently refined and improved by the proposed approach. All these results demonstrate that LC-MIL is a promising, lightweight tool to provide fine-grained annotations from coarsely annotated pathology sets. 2022-12 2022-12-02 /pmc/articles/PMC9825360/ /pubmed/36037454 http://dx.doi.org/10.1109/TMI.2022.3202759 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Wang, Zhenzhen
Saoud, Carla
Wangsiricharoen, Sintawat
James, Aaron W.
Popel, Aleksander S.
Sulam, Jeremias
Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images
title Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images
title_full Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images
title_fullStr Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images
title_full_unstemmed Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images
title_short Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images
title_sort label cleaning multiple instance learning: refining coarse annotations on single whole-slide images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825360/
https://www.ncbi.nlm.nih.gov/pubmed/36037454
http://dx.doi.org/10.1109/TMI.2022.3202759
work_keys_str_mv AT wangzhenzhen labelcleaningmultipleinstancelearningrefiningcoarseannotationsonsinglewholeslideimages
AT saoudcarla labelcleaningmultipleinstancelearningrefiningcoarseannotationsonsinglewholeslideimages
AT wangsiricharoensintawat labelcleaningmultipleinstancelearningrefiningcoarseannotationsonsinglewholeslideimages
AT jamesaaronw labelcleaningmultipleinstancelearningrefiningcoarseannotationsonsinglewholeslideimages
AT popelaleksanders labelcleaningmultipleinstancelearningrefiningcoarseannotationsonsinglewholeslideimages
AT sulamjeremias labelcleaningmultipleinstancelearningrefiningcoarseannotationsonsinglewholeslideimages