Cargando…

Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions

While a publicly available benchmark dataset provides a base for the development of new algorithms and comparison of results, hospital-based data collected from the real-world clinical setup is also very important in AI-based medical research for automated disease diagnosis, prediction or classifica...

Descripción completa

Detalles Bibliográficos
Autores principales: Hussain, Elima, Mahanta, Lipi B., Borah, Himakshi, Das, Chandana Ray
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7186519/
https://www.ncbi.nlm.nih.gov/pubmed/32368601
http://dx.doi.org/10.1016/j.dib.2020.105589
_version_ 1783526967133339648
author Hussain, Elima
Mahanta, Lipi B.
Borah, Himakshi
Das, Chandana Ray
author_facet Hussain, Elima
Mahanta, Lipi B.
Borah, Himakshi
Das, Chandana Ray
author_sort Hussain, Elima
collection PubMed
description While a publicly available benchmark dataset provides a base for the development of new algorithms and comparison of results, hospital-based data collected from the real-world clinical setup is also very important in AI-based medical research for automated disease diagnosis, prediction or classifications as per standard protocol. Primary data must be constantly updated so that the developed algorithms achieve as much accuracy as possible in the regional context. This dataset would support research work related to image segmentation and final classification for a complete decision support system (https://doi.org/10.1016/j.tice.2020.101347) [1]. Liquid-based cytology (LBC) is one of the cervical screening tests. The repository consists of a total of 963 LBC images sub-divided into four sets representing the four classes: NILM, LSIL, HSIL, and SCC. It comprises pre-cancerous and cancerous lesions related to cervical cancer as per standards under The Bethesda System (TBS). The images were captured in 40x magnification using Leica ICC50 HD microscope collected with due consent from 460 patients visiting the O&G department of the public hospital with various gynaecological problems. The images were then viewed and categorized by experts of the pathology department.
format Online
Article
Text
id pubmed-7186519
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-71865192020-05-04 Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions Hussain, Elima Mahanta, Lipi B. Borah, Himakshi Das, Chandana Ray Data Brief Computer Science While a publicly available benchmark dataset provides a base for the development of new algorithms and comparison of results, hospital-based data collected from the real-world clinical setup is also very important in AI-based medical research for automated disease diagnosis, prediction or classifications as per standard protocol. Primary data must be constantly updated so that the developed algorithms achieve as much accuracy as possible in the regional context. This dataset would support research work related to image segmentation and final classification for a complete decision support system (https://doi.org/10.1016/j.tice.2020.101347) [1]. Liquid-based cytology (LBC) is one of the cervical screening tests. The repository consists of a total of 963 LBC images sub-divided into four sets representing the four classes: NILM, LSIL, HSIL, and SCC. It comprises pre-cancerous and cancerous lesions related to cervical cancer as per standards under The Bethesda System (TBS). The images were captured in 40x magnification using Leica ICC50 HD microscope collected with due consent from 460 patients visiting the O&G department of the public hospital with various gynaecological problems. The images were then viewed and categorized by experts of the pathology department. Elsevier 2020-04-22 /pmc/articles/PMC7186519/ /pubmed/32368601 http://dx.doi.org/10.1016/j.dib.2020.105589 Text en © 2020 Published by Elsevier Inc. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Computer Science
Hussain, Elima
Mahanta, Lipi B.
Borah, Himakshi
Das, Chandana Ray
Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions
title Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions
title_full Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions
title_fullStr Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions
title_full_unstemmed Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions
title_short Liquid based-cytology Pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions
title_sort liquid based-cytology pap smear dataset for automated multi-class diagnosis of pre-cancerous and cervical cancer lesions
topic Computer Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7186519/
https://www.ncbi.nlm.nih.gov/pubmed/32368601
http://dx.doi.org/10.1016/j.dib.2020.105589
work_keys_str_mv AT hussainelima liquidbasedcytologypapsmeardatasetforautomatedmulticlassdiagnosisofprecancerousandcervicalcancerlesions
AT mahantalipib liquidbasedcytologypapsmeardatasetforautomatedmulticlassdiagnosisofprecancerousandcervicalcancerlesions
AT borahhimakshi liquidbasedcytologypapsmeardatasetforautomatedmulticlassdiagnosisofprecancerousandcervicalcancerlesions
AT daschandanaray liquidbasedcytologypapsmeardatasetforautomatedmulticlassdiagnosisofprecancerousandcervicalcancerlesions