Cargando…

Analysis: Flawed Datasets of Monkeypox Skin Images

The self-proclaimed first publicly available dataset of Monkeypox skin images consists of medically irrelevant images extracted from Google and photography repositories through a process denominated web-scrapping. Yet, this did not stop other researchers from employing it to build Machine Learning (...

Descripción completa

Detalles Bibliográficos
Autores principales: Vega, Carlos, Schneider, Reinhard, Satagopam, Venkata
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10024024/
https://www.ncbi.nlm.nih.gov/pubmed/36933065
http://dx.doi.org/10.1007/s10916-023-01928-1
_version_ 1784909018087555072
author Vega, Carlos
Schneider, Reinhard
Satagopam, Venkata
author_facet Vega, Carlos
Schneider, Reinhard
Satagopam, Venkata
author_sort Vega, Carlos
collection PubMed
description The self-proclaimed first publicly available dataset of Monkeypox skin images consists of medically irrelevant images extracted from Google and photography repositories through a process denominated web-scrapping. Yet, this did not stop other researchers from employing it to build Machine Learning (ML) solutions aimed at computer-aided diagnosis of Monkeypox and other viral infections presenting skin lesions. Neither did it stop the reviewers or editors from publishing these subsequent works in peer-reviewed journals. Several of these works claimed extraordinary performance in the classification of Monkeypox, Chickenpox and Measles, employing ML and the aforementioned dataset. In this work, we analyse the initiator work that has catalysed the development of several ML solutions, and whose popularity is continuing to grow. Further, we provide a rebuttal experiment that showcases the risks of such methodologies, proving that the ML solutions do not necessarily obtain their performance from the features relevant to the diseases at issue.
format Online
Article
Text
id pubmed-10024024
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-100240242023-03-20 Analysis: Flawed Datasets of Monkeypox Skin Images Vega, Carlos Schneider, Reinhard Satagopam, Venkata J Med Syst Original Paper The self-proclaimed first publicly available dataset of Monkeypox skin images consists of medically irrelevant images extracted from Google and photography repositories through a process denominated web-scrapping. Yet, this did not stop other researchers from employing it to build Machine Learning (ML) solutions aimed at computer-aided diagnosis of Monkeypox and other viral infections presenting skin lesions. Neither did it stop the reviewers or editors from publishing these subsequent works in peer-reviewed journals. Several of these works claimed extraordinary performance in the classification of Monkeypox, Chickenpox and Measles, employing ML and the aforementioned dataset. In this work, we analyse the initiator work that has catalysed the development of several ML solutions, and whose popularity is continuing to grow. Further, we provide a rebuttal experiment that showcases the risks of such methodologies, proving that the ML solutions do not necessarily obtain their performance from the features relevant to the diseases at issue. Springer US 2023-03-18 2023 /pmc/articles/PMC10024024/ /pubmed/36933065 http://dx.doi.org/10.1007/s10916-023-01928-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Paper
Vega, Carlos
Schneider, Reinhard
Satagopam, Venkata
Analysis: Flawed Datasets of Monkeypox Skin Images
title Analysis: Flawed Datasets of Monkeypox Skin Images
title_full Analysis: Flawed Datasets of Monkeypox Skin Images
title_fullStr Analysis: Flawed Datasets of Monkeypox Skin Images
title_full_unstemmed Analysis: Flawed Datasets of Monkeypox Skin Images
title_short Analysis: Flawed Datasets of Monkeypox Skin Images
title_sort analysis: flawed datasets of monkeypox skin images
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10024024/
https://www.ncbi.nlm.nih.gov/pubmed/36933065
http://dx.doi.org/10.1007/s10916-023-01928-1
work_keys_str_mv AT vegacarlos analysisflaweddatasetsofmonkeypoxskinimages
AT schneiderreinhard analysisflaweddatasetsofmonkeypoxskinimages
AT satagopamvenkata analysisflaweddatasetsofmonkeypoxskinimages