Cargando…
A structural characterization of shortcut features for prediction
With the rising use of machine learning for healthcare applications, practitioners are increasingly confronted with the limitations of prediction models that are trained in one setting but meant to be deployed in several others. One recently identified limitation is so-called shortcut learning, wher...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9256901/ https://www.ncbi.nlm.nih.gov/pubmed/35792990 http://dx.doi.org/10.1007/s10654-022-00892-3 |
_version_ | 1784741214535286784 |
---|---|
author | Bellamy, David Hernán, Miguel A. Beam, Andrew |
author_facet | Bellamy, David Hernán, Miguel A. Beam, Andrew |
author_sort | Bellamy, David |
collection | PubMed |
description | With the rising use of machine learning for healthcare applications, practitioners are increasingly confronted with the limitations of prediction models that are trained in one setting but meant to be deployed in several others. One recently identified limitation is so-called shortcut learning, whereby a model learns to associate features with the prediction target that do not maintain their relationship across settings. Famously, the watermark on chest x-rays has been demonstrated to be an instance of a shortcut feature. In this viewpoint, we attempt to give a structural characterization of shortcut features in terms of causal DAGs. This is the first attempt at defining shortcut features in terms of their causal relationship with a model’s prediction target. |
format | Online Article Text |
id | pubmed-9256901 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-92569012022-07-06 A structural characterization of shortcut features for prediction Bellamy, David Hernán, Miguel A. Beam, Andrew Eur J Epidemiol Essay With the rising use of machine learning for healthcare applications, practitioners are increasingly confronted with the limitations of prediction models that are trained in one setting but meant to be deployed in several others. One recently identified limitation is so-called shortcut learning, whereby a model learns to associate features with the prediction target that do not maintain their relationship across settings. Famously, the watermark on chest x-rays has been demonstrated to be an instance of a shortcut feature. In this viewpoint, we attempt to give a structural characterization of shortcut features in terms of causal DAGs. This is the first attempt at defining shortcut features in terms of their causal relationship with a model’s prediction target. Springer Netherlands 2022-07-06 2022 /pmc/articles/PMC9256901/ /pubmed/35792990 http://dx.doi.org/10.1007/s10654-022-00892-3 Text en © Springer Nature B.V. 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Essay Bellamy, David Hernán, Miguel A. Beam, Andrew A structural characterization of shortcut features for prediction |
title | A structural characterization of shortcut features for prediction |
title_full | A structural characterization of shortcut features for prediction |
title_fullStr | A structural characterization of shortcut features for prediction |
title_full_unstemmed | A structural characterization of shortcut features for prediction |
title_short | A structural characterization of shortcut features for prediction |
title_sort | structural characterization of shortcut features for prediction |
topic | Essay |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9256901/ https://www.ncbi.nlm.nih.gov/pubmed/35792990 http://dx.doi.org/10.1007/s10654-022-00892-3 |
work_keys_str_mv | AT bellamydavid astructuralcharacterizationofshortcutfeaturesforprediction AT hernanmiguela astructuralcharacterizationofshortcutfeaturesforprediction AT beamandrew astructuralcharacterizationofshortcutfeaturesforprediction AT bellamydavid structuralcharacterizationofshortcutfeaturesforprediction AT hernanmiguela structuralcharacterizationofshortcutfeaturesforprediction AT beamandrew structuralcharacterizationofshortcutfeaturesforprediction |