Cargando…

The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data

Electronic phenotyping is the task of ascertaining whether an individual has a medical condition of interest by analyzing their medical record and is foundational in clinical informatics. Increasingly, electronic phenotyping is performed via supervised learning. We investigate the effectiveness of m...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, Daisy Yi, Simpson, Chloé, Pfohl, Stephen, Kale, Dave C., Jung, Kenneth, Shah, Nigam H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6662921/
https://www.ncbi.nlm.nih.gov/pubmed/30864307
_version_ 1783439738965852160
author Ding, Daisy Yi
Simpson, Chloé
Pfohl, Stephen
Kale, Dave C.
Jung, Kenneth
Shah, Nigam H.
author_facet Ding, Daisy Yi
Simpson, Chloé
Pfohl, Stephen
Kale, Dave C.
Jung, Kenneth
Shah, Nigam H.
author_sort Ding, Daisy Yi
collection PubMed
description Electronic phenotyping is the task of ascertaining whether an individual has a medical condition of interest by analyzing their medical record and is foundational in clinical informatics. Increasingly, electronic phenotyping is performed via supervised learning. We investigate the effectiveness of multitask learning for phenotyping using electronic health records (EHR) data. Multitask learning aims to improve model performance on a target task by jointly learning additional auxiliary tasks and has been used in disparate areas of machine learning. However, its utility when applied to EHR data has not been established, and prior work suggests that its benefits are inconsistent. We present experiments that elucidate when multitask learning with neural nets improves performance for phenotyping using EHR data relative to neural nets trained for a single phenotype and to well-tuned baselines. We find that multitask neural nets consistently outperform single-task neural nets for rare phenotypes but underperform for relatively more common phenotypes. The effect size increases as more auxiliary tasks are added. Moreover, multitask learning reduces the sensitivity of neural nets to hyperparameter settings for rare phenotypes. Last, we quantify phenotype complexity and find that neural nets trained with or without multitask learning do not improve on simple baselines unless the phenotypes are sufficiently complex.
format Online
Article
Text
id pubmed-6662921
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-66629212019-07-29 The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data Ding, Daisy Yi Simpson, Chloé Pfohl, Stephen Kale, Dave C. Jung, Kenneth Shah, Nigam H. Pac Symp Biocomput Article Electronic phenotyping is the task of ascertaining whether an individual has a medical condition of interest by analyzing their medical record and is foundational in clinical informatics. Increasingly, electronic phenotyping is performed via supervised learning. We investigate the effectiveness of multitask learning for phenotyping using electronic health records (EHR) data. Multitask learning aims to improve model performance on a target task by jointly learning additional auxiliary tasks and has been used in disparate areas of machine learning. However, its utility when applied to EHR data has not been established, and prior work suggests that its benefits are inconsistent. We present experiments that elucidate when multitask learning with neural nets improves performance for phenotyping using EHR data relative to neural nets trained for a single phenotype and to well-tuned baselines. We find that multitask neural nets consistently outperform single-task neural nets for rare phenotypes but underperform for relatively more common phenotypes. The effect size increases as more auxiliary tasks are added. Moreover, multitask learning reduces the sensitivity of neural nets to hyperparameter settings for rare phenotypes. Last, we quantify phenotype complexity and find that neural nets trained with or without multitask learning do not improve on simple baselines unless the phenotypes are sufficiently complex. 2019 /pmc/articles/PMC6662921/ /pubmed/30864307 Text en http://creativecommons.org/licenses/by/4.0/ Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License.
spellingShingle Article
Ding, Daisy Yi
Simpson, Chloé
Pfohl, Stephen
Kale, Dave C.
Jung, Kenneth
Shah, Nigam H.
The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data
title The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data
title_full The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data
title_fullStr The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data
title_full_unstemmed The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data
title_short The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data
title_sort effectiveness of multitask learning for phenotyping with electronic health records data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6662921/
https://www.ncbi.nlm.nih.gov/pubmed/30864307
work_keys_str_mv AT dingdaisyyi theeffectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT simpsonchloe theeffectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT pfohlstephen theeffectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT kaledavec theeffectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT jungkenneth theeffectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT shahnigamh theeffectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT dingdaisyyi effectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT simpsonchloe effectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT pfohlstephen effectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT kaledavec effectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT jungkenneth effectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata
AT shahnigamh effectivenessofmultitasklearningforphenotypingwithelectronichealthrecordsdata