Cargando…
Differences between human and machine perception in medical diagnosis
Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since they can fail for reasons unrelated to underlying pathology. Humans are less likely to make such superficial mistakes, since they use features that are grounded on medical science. It is ther...
Autores principales: | , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046399/ https://www.ncbi.nlm.nih.gov/pubmed/35477730 http://dx.doi.org/10.1038/s41598-022-10526-z |
_version_ | 1784695515518074880 |
---|---|
author | Makino, Taro Jastrzębski, Stanisław Oleszkiewicz, Witold Chacko, Celin Ehrenpreis, Robin Samreen, Naziya Chhor, Chloe Kim, Eric Lee, Jiyon Pysarenko, Kristine Reig, Beatriu Toth, Hildegard Awal, Divya Du, Linda Kim, Alice Park, James Sodickson, Daniel K. Heacock, Laura Moy, Linda Cho, Kyunghyun Geras, Krzysztof J. |
author_facet | Makino, Taro Jastrzębski, Stanisław Oleszkiewicz, Witold Chacko, Celin Ehrenpreis, Robin Samreen, Naziya Chhor, Chloe Kim, Eric Lee, Jiyon Pysarenko, Kristine Reig, Beatriu Toth, Hildegard Awal, Divya Du, Linda Kim, Alice Park, James Sodickson, Daniel K. Heacock, Laura Moy, Linda Cho, Kyunghyun Geras, Krzysztof J. |
author_sort | Makino, Taro |
collection | PubMed |
description | Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since they can fail for reasons unrelated to underlying pathology. Humans are less likely to make such superficial mistakes, since they use features that are grounded on medical science. It is therefore important to know whether DNNs use different features than humans. Towards this end, we propose a framework for comparing human and machine perception in medical diagnosis. We frame the comparison in terms of perturbation robustness, and mitigate Simpson’s paradox by performing a subgroup analysis. The framework is demonstrated with a case study in breast cancer screening, where we separately analyze microcalcifications and soft tissue lesions. While it is inconclusive whether humans and DNNs use different features to detect microcalcifications, we find that for soft tissue lesions, DNNs rely on high frequency components ignored by radiologists. Moreover, these features are located outside of the region of the images found most suspicious by radiologists. This difference between humans and machines was only visible through subgroup analysis, which highlights the importance of incorporating medical domain knowledge into the comparison. |
format | Online Article Text |
id | pubmed-9046399 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-90463992022-04-29 Differences between human and machine perception in medical diagnosis Makino, Taro Jastrzębski, Stanisław Oleszkiewicz, Witold Chacko, Celin Ehrenpreis, Robin Samreen, Naziya Chhor, Chloe Kim, Eric Lee, Jiyon Pysarenko, Kristine Reig, Beatriu Toth, Hildegard Awal, Divya Du, Linda Kim, Alice Park, James Sodickson, Daniel K. Heacock, Laura Moy, Linda Cho, Kyunghyun Geras, Krzysztof J. Sci Rep Article Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since they can fail for reasons unrelated to underlying pathology. Humans are less likely to make such superficial mistakes, since they use features that are grounded on medical science. It is therefore important to know whether DNNs use different features than humans. Towards this end, we propose a framework for comparing human and machine perception in medical diagnosis. We frame the comparison in terms of perturbation robustness, and mitigate Simpson’s paradox by performing a subgroup analysis. The framework is demonstrated with a case study in breast cancer screening, where we separately analyze microcalcifications and soft tissue lesions. While it is inconclusive whether humans and DNNs use different features to detect microcalcifications, we find that for soft tissue lesions, DNNs rely on high frequency components ignored by radiologists. Moreover, these features are located outside of the region of the images found most suspicious by radiologists. This difference between humans and machines was only visible through subgroup analysis, which highlights the importance of incorporating medical domain knowledge into the comparison. Nature Publishing Group UK 2022-04-27 /pmc/articles/PMC9046399/ /pubmed/35477730 http://dx.doi.org/10.1038/s41598-022-10526-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Makino, Taro Jastrzębski, Stanisław Oleszkiewicz, Witold Chacko, Celin Ehrenpreis, Robin Samreen, Naziya Chhor, Chloe Kim, Eric Lee, Jiyon Pysarenko, Kristine Reig, Beatriu Toth, Hildegard Awal, Divya Du, Linda Kim, Alice Park, James Sodickson, Daniel K. Heacock, Laura Moy, Linda Cho, Kyunghyun Geras, Krzysztof J. Differences between human and machine perception in medical diagnosis |
title | Differences between human and machine perception in medical diagnosis |
title_full | Differences between human and machine perception in medical diagnosis |
title_fullStr | Differences between human and machine perception in medical diagnosis |
title_full_unstemmed | Differences between human and machine perception in medical diagnosis |
title_short | Differences between human and machine perception in medical diagnosis |
title_sort | differences between human and machine perception in medical diagnosis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046399/ https://www.ncbi.nlm.nih.gov/pubmed/35477730 http://dx.doi.org/10.1038/s41598-022-10526-z |
work_keys_str_mv | AT makinotaro differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT jastrzebskistanisław differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT oleszkiewiczwitold differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT chackocelin differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT ehrenpreisrobin differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT samreennaziya differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT chhorchloe differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT kimeric differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT leejiyon differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT pysarenkokristine differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT reigbeatriu differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT tothhildegard differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT awaldivya differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT dulinda differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT kimalice differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT parkjames differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT sodicksondanielk differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT heacocklaura differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT moylinda differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT chokyunghyun differencesbetweenhumanandmachineperceptioninmedicaldiagnosis AT geraskrzysztofj differencesbetweenhumanandmachineperceptioninmedicaldiagnosis |