Cargando…

Differences between human and machine perception in medical diagnosis

Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since they can fail for reasons unrelated to underlying pathology. Humans are less likely to make such superficial mistakes, since they use features that are grounded on medical science. It is ther...

Descripción completa

Detalles Bibliográficos
Autores principales: Makino, Taro, Jastrzębski, Stanisław, Oleszkiewicz, Witold, Chacko, Celin, Ehrenpreis, Robin, Samreen, Naziya, Chhor, Chloe, Kim, Eric, Lee, Jiyon, Pysarenko, Kristine, Reig, Beatriu, Toth, Hildegard, Awal, Divya, Du, Linda, Kim, Alice, Park, James, Sodickson, Daniel K., Heacock, Laura, Moy, Linda, Cho, Kyunghyun, Geras, Krzysztof J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046399/
https://www.ncbi.nlm.nih.gov/pubmed/35477730
http://dx.doi.org/10.1038/s41598-022-10526-z
_version_ 1784695515518074880
author Makino, Taro
Jastrzębski, Stanisław
Oleszkiewicz, Witold
Chacko, Celin
Ehrenpreis, Robin
Samreen, Naziya
Chhor, Chloe
Kim, Eric
Lee, Jiyon
Pysarenko, Kristine
Reig, Beatriu
Toth, Hildegard
Awal, Divya
Du, Linda
Kim, Alice
Park, James
Sodickson, Daniel K.
Heacock, Laura
Moy, Linda
Cho, Kyunghyun
Geras, Krzysztof J.
author_facet Makino, Taro
Jastrzębski, Stanisław
Oleszkiewicz, Witold
Chacko, Celin
Ehrenpreis, Robin
Samreen, Naziya
Chhor, Chloe
Kim, Eric
Lee, Jiyon
Pysarenko, Kristine
Reig, Beatriu
Toth, Hildegard
Awal, Divya
Du, Linda
Kim, Alice
Park, James
Sodickson, Daniel K.
Heacock, Laura
Moy, Linda
Cho, Kyunghyun
Geras, Krzysztof J.
author_sort Makino, Taro
collection PubMed
description Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since they can fail for reasons unrelated to underlying pathology. Humans are less likely to make such superficial mistakes, since they use features that are grounded on medical science. It is therefore important to know whether DNNs use different features than humans. Towards this end, we propose a framework for comparing human and machine perception in medical diagnosis. We frame the comparison in terms of perturbation robustness, and mitigate Simpson’s paradox by performing a subgroup analysis. The framework is demonstrated with a case study in breast cancer screening, where we separately analyze microcalcifications and soft tissue lesions. While it is inconclusive whether humans and DNNs use different features to detect microcalcifications, we find that for soft tissue lesions, DNNs rely on high frequency components ignored by radiologists. Moreover, these features are located outside of the region of the images found most suspicious by radiologists. This difference between humans and machines was only visible through subgroup analysis, which highlights the importance of incorporating medical domain knowledge into the comparison.
format Online
Article
Text
id pubmed-9046399
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-90463992022-04-29 Differences between human and machine perception in medical diagnosis Makino, Taro Jastrzębski, Stanisław Oleszkiewicz, Witold Chacko, Celin Ehrenpreis, Robin Samreen, Naziya Chhor, Chloe Kim, Eric Lee, Jiyon Pysarenko, Kristine Reig, Beatriu Toth, Hildegard Awal, Divya Du, Linda Kim, Alice Park, James Sodickson, Daniel K. Heacock, Laura Moy, Linda Cho, Kyunghyun Geras, Krzysztof J. Sci Rep Article Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since they can fail for reasons unrelated to underlying pathology. Humans are less likely to make such superficial mistakes, since they use features that are grounded on medical science. It is therefore important to know whether DNNs use different features than humans. Towards this end, we propose a framework for comparing human and machine perception in medical diagnosis. We frame the comparison in terms of perturbation robustness, and mitigate Simpson’s paradox by performing a subgroup analysis. The framework is demonstrated with a case study in breast cancer screening, where we separately analyze microcalcifications and soft tissue lesions. While it is inconclusive whether humans and DNNs use different features to detect microcalcifications, we find that for soft tissue lesions, DNNs rely on high frequency components ignored by radiologists. Moreover, these features are located outside of the region of the images found most suspicious by radiologists. This difference between humans and machines was only visible through subgroup analysis, which highlights the importance of incorporating medical domain knowledge into the comparison. Nature Publishing Group UK 2022-04-27 /pmc/articles/PMC9046399/ /pubmed/35477730 http://dx.doi.org/10.1038/s41598-022-10526-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Makino, Taro
Jastrzębski, Stanisław
Oleszkiewicz, Witold
Chacko, Celin
Ehrenpreis, Robin
Samreen, Naziya
Chhor, Chloe
Kim, Eric
Lee, Jiyon
Pysarenko, Kristine
Reig, Beatriu
Toth, Hildegard
Awal, Divya
Du, Linda
Kim, Alice
Park, James
Sodickson, Daniel K.
Heacock, Laura
Moy, Linda
Cho, Kyunghyun
Geras, Krzysztof J.
Differences between human and machine perception in medical diagnosis
title Differences between human and machine perception in medical diagnosis
title_full Differences between human and machine perception in medical diagnosis
title_fullStr Differences between human and machine perception in medical diagnosis
title_full_unstemmed Differences between human and machine perception in medical diagnosis
title_short Differences between human and machine perception in medical diagnosis
title_sort differences between human and machine perception in medical diagnosis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046399/
https://www.ncbi.nlm.nih.gov/pubmed/35477730
http://dx.doi.org/10.1038/s41598-022-10526-z
work_keys_str_mv AT makinotaro differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT jastrzebskistanisław differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT oleszkiewiczwitold differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT chackocelin differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT ehrenpreisrobin differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT samreennaziya differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT chhorchloe differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT kimeric differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT leejiyon differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT pysarenkokristine differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT reigbeatriu differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT tothhildegard differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT awaldivya differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT dulinda differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT kimalice differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT parkjames differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT sodicksondanielk differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT heacocklaura differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT moylinda differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT chokyunghyun differencesbetweenhumanandmachineperceptioninmedicaldiagnosis
AT geraskrzysztofj differencesbetweenhumanandmachineperceptioninmedicaldiagnosis