Cargando…

A completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research

Canine mammary carcinoma (CMC) has been used as a model to investigate the pathogenesis of human breast cancer and the same grading scheme is commonly used to assess tumor malignancy in both. One key component of this grading scheme is the density of mitotic figures (MF). Current publicly available...

Descripción completa

Detalles Bibliográficos
Autores principales: Aubreville, Marc, Bertram, Christof A., Donovan, Taryn A., Marzahl, Christian, Maier, Andreas, Klopfleisch, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7699627/
https://www.ncbi.nlm.nih.gov/pubmed/33247116
http://dx.doi.org/10.1038/s41597-020-00756-z
_version_ 1783616092521889792
author Aubreville, Marc
Bertram, Christof A.
Donovan, Taryn A.
Marzahl, Christian
Maier, Andreas
Klopfleisch, Robert
author_facet Aubreville, Marc
Bertram, Christof A.
Donovan, Taryn A.
Marzahl, Christian
Maier, Andreas
Klopfleisch, Robert
author_sort Aubreville, Marc
collection PubMed
description Canine mammary carcinoma (CMC) has been used as a model to investigate the pathogenesis of human breast cancer and the same grading scheme is commonly used to assess tumor malignancy in both. One key component of this grading scheme is the density of mitotic figures (MF). Current publicly available datasets on human breast cancer only provide annotations for small subsets of whole slide images (WSIs). We present a novel dataset of 21 WSIs of CMC completely annotated for MF. For this, a pathologist screened all WSIs for potential MF and structures with a similar appearance. A second expert blindly assigned labels, and for non-matching labels, a third expert assigned the final labels. Additionally, we used machine learning to identify previously undetected MF. Finally, we performed representation learning and two-dimensional projection to further increase the consistency of the annotations. Our dataset consists of 13,907 MF and 36,379 hard negatives. We achieved a mean F1-score of 0.791 on the test set and of up to 0.696 on a human breast cancer dataset.
format Online
Article
Text
id pubmed-7699627
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-76996272020-12-03 A completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research Aubreville, Marc Bertram, Christof A. Donovan, Taryn A. Marzahl, Christian Maier, Andreas Klopfleisch, Robert Sci Data Data Descriptor Canine mammary carcinoma (CMC) has been used as a model to investigate the pathogenesis of human breast cancer and the same grading scheme is commonly used to assess tumor malignancy in both. One key component of this grading scheme is the density of mitotic figures (MF). Current publicly available datasets on human breast cancer only provide annotations for small subsets of whole slide images (WSIs). We present a novel dataset of 21 WSIs of CMC completely annotated for MF. For this, a pathologist screened all WSIs for potential MF and structures with a similar appearance. A second expert blindly assigned labels, and for non-matching labels, a third expert assigned the final labels. Additionally, we used machine learning to identify previously undetected MF. Finally, we performed representation learning and two-dimensional projection to further increase the consistency of the annotations. Our dataset consists of 13,907 MF and 36,379 hard negatives. We achieved a mean F1-score of 0.791 on the test set and of up to 0.696 on a human breast cancer dataset. Nature Publishing Group UK 2020-11-27 /pmc/articles/PMC7699627/ /pubmed/33247116 http://dx.doi.org/10.1038/s41597-020-00756-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Aubreville, Marc
Bertram, Christof A.
Donovan, Taryn A.
Marzahl, Christian
Maier, Andreas
Klopfleisch, Robert
A completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research
title A completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research
title_full A completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research
title_fullStr A completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research
title_full_unstemmed A completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research
title_short A completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research
title_sort completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7699627/
https://www.ncbi.nlm.nih.gov/pubmed/33247116
http://dx.doi.org/10.1038/s41597-020-00756-z
work_keys_str_mv AT aubrevillemarc acompletelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT bertramchristofa acompletelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT donovantaryna acompletelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT marzahlchristian acompletelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT maierandreas acompletelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT klopfleischrobert acompletelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT aubrevillemarc completelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT bertramchristofa completelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT donovantaryna completelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT marzahlchristian completelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT maierandreas completelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch
AT klopfleischrobert completelyannotatedwholeslideimagedatasetofcaninebreastcancertoaidhumanbreastcancerresearch