Cargando…

Non-melanoma skin cancer segmentation for histopathology dataset

Densely labelled segmentation data for digital pathology images is costly to produce but is invaluable to training effective machine learning models. We make available 290 hand-annotated histopathology tissue sections of the 3 most common skin cancers; basal cell carcinoma (BCC), squamous cell carci...

Descripción completa

Detalles Bibliográficos
Autores principales: Thomas, Simon M., Lefevre, James G., Baxter, Glenn, Hamilton, Nicholas A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8627989/
https://www.ncbi.nlm.nih.gov/pubmed/34877372
http://dx.doi.org/10.1016/j.dib.2021.107587
_version_ 1784606928083615744
author Thomas, Simon M.
Lefevre, James G.
Baxter, Glenn
Hamilton, Nicholas A.
author_facet Thomas, Simon M.
Lefevre, James G.
Baxter, Glenn
Hamilton, Nicholas A.
author_sort Thomas, Simon M.
collection PubMed
description Densely labelled segmentation data for digital pathology images is costly to produce but is invaluable to training effective machine learning models. We make available 290 hand-annotated histopathology tissue sections of the 3 most common skin cancers; basal cell carcinoma (BCC), squamous cell carcinoma (SCC) and intraepidermal carcinoma (IEC). These non-melanoma skin cancers constitute over 90% of all skin cancer diagnoses and hence this dataset gives an opportunity to the scientific community to benchmark analytic methodologies on a significant portion of the dermatopathology workflow. The data represents typical cases of the three cancer types (not requiring a differential diagnosis) across shave, punch and excision biopsy contexts. Each image is accompanied with a segmentation mask which characterizes the section into 12 tissue types, specifically: keratin, epidermis, papillary dermis, reticular dermis, hypodermis, inflammation, glands, hair follicles and background, as well as BCC, SCC and IEC. Included also are cancer margin measurements to work towards automated assessment of surgical margin clearance and tumour invasion. This leaves open many opportunities for researchers to utilize or extend the dataset, building upon recent work on image analysis problems in skin cancer (Thomas et al., 2021).
format Online
Article
Text
id pubmed-8627989
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-86279892021-12-06 Non-melanoma skin cancer segmentation for histopathology dataset Thomas, Simon M. Lefevre, James G. Baxter, Glenn Hamilton, Nicholas A. Data Brief Data Article Densely labelled segmentation data for digital pathology images is costly to produce but is invaluable to training effective machine learning models. We make available 290 hand-annotated histopathology tissue sections of the 3 most common skin cancers; basal cell carcinoma (BCC), squamous cell carcinoma (SCC) and intraepidermal carcinoma (IEC). These non-melanoma skin cancers constitute over 90% of all skin cancer diagnoses and hence this dataset gives an opportunity to the scientific community to benchmark analytic methodologies on a significant portion of the dermatopathology workflow. The data represents typical cases of the three cancer types (not requiring a differential diagnosis) across shave, punch and excision biopsy contexts. Each image is accompanied with a segmentation mask which characterizes the section into 12 tissue types, specifically: keratin, epidermis, papillary dermis, reticular dermis, hypodermis, inflammation, glands, hair follicles and background, as well as BCC, SCC and IEC. Included also are cancer margin measurements to work towards automated assessment of surgical margin clearance and tumour invasion. This leaves open many opportunities for researchers to utilize or extend the dataset, building upon recent work on image analysis problems in skin cancer (Thomas et al., 2021). Elsevier 2021-11-19 /pmc/articles/PMC8627989/ /pubmed/34877372 http://dx.doi.org/10.1016/j.dib.2021.107587 Text en Crown Copyright © 2021 Published by Elsevier Inc. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Data Article
Thomas, Simon M.
Lefevre, James G.
Baxter, Glenn
Hamilton, Nicholas A.
Non-melanoma skin cancer segmentation for histopathology dataset
title Non-melanoma skin cancer segmentation for histopathology dataset
title_full Non-melanoma skin cancer segmentation for histopathology dataset
title_fullStr Non-melanoma skin cancer segmentation for histopathology dataset
title_full_unstemmed Non-melanoma skin cancer segmentation for histopathology dataset
title_short Non-melanoma skin cancer segmentation for histopathology dataset
title_sort non-melanoma skin cancer segmentation for histopathology dataset
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8627989/
https://www.ncbi.nlm.nih.gov/pubmed/34877372
http://dx.doi.org/10.1016/j.dib.2021.107587
work_keys_str_mv AT thomassimonm nonmelanomaskincancersegmentationforhistopathologydataset
AT lefevrejamesg nonmelanomaskincancersegmentationforhistopathologydataset
AT baxterglenn nonmelanomaskincancersegmentationforhistopathologydataset
AT hamiltonnicholasa nonmelanomaskincancersegmentationforhistopathologydataset