Cargando…

Learning Low-Dimensional Representations of Medical Concepts

We show how to learn low-dimensional representations (embeddings) of a wide range of concepts in medicine, including diseases (e.g., ICD9 codes), medications, procedures, and laboratory tests. We expect that these embeddings will be useful across medical informatics for tasks such as cohort selectio...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Youngduck, Chiu, Chill Yi-I, Sontag, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Informatics Association 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001761/
https://www.ncbi.nlm.nih.gov/pubmed/27570647
_version_ 1782450478998618112
author Choi, Youngduck
Chiu, Chill Yi-I
Sontag, David
author_facet Choi, Youngduck
Chiu, Chill Yi-I
Sontag, David
author_sort Choi, Youngduck
collection PubMed
description We show how to learn low-dimensional representations (embeddings) of a wide range of concepts in medicine, including diseases (e.g., ICD9 codes), medications, procedures, and laboratory tests. We expect that these embeddings will be useful across medical informatics for tasks such as cohort selection and patient summarization. These embeddings are learned using a technique called neural language modeling from the natural language processing community. However, rather than learning the embeddings solely from text, we show how to learn the embeddings from claims data, which is widely available both to providers and to payers. We also show that with a simple algorithmic adjustment, it is possible to learn medical concept embeddings in a privacy preserving manner from co-occurrence counts derived from clinical narratives. Finally, we establish a methodological framework, arising from standard medical ontologies such as UMLS, NDF-RT, and CCS, to further investigate the embeddings and precisely characterize their quantitative properties.
format Online
Article
Text
id pubmed-5001761
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher American Medical Informatics Association
record_format MEDLINE/PubMed
spelling pubmed-50017612016-08-26 Learning Low-Dimensional Representations of Medical Concepts Choi, Youngduck Chiu, Chill Yi-I Sontag, David AMIA Jt Summits Transl Sci Proc Articles We show how to learn low-dimensional representations (embeddings) of a wide range of concepts in medicine, including diseases (e.g., ICD9 codes), medications, procedures, and laboratory tests. We expect that these embeddings will be useful across medical informatics for tasks such as cohort selection and patient summarization. These embeddings are learned using a technique called neural language modeling from the natural language processing community. However, rather than learning the embeddings solely from text, we show how to learn the embeddings from claims data, which is widely available both to providers and to payers. We also show that with a simple algorithmic adjustment, it is possible to learn medical concept embeddings in a privacy preserving manner from co-occurrence counts derived from clinical narratives. Finally, we establish a methodological framework, arising from standard medical ontologies such as UMLS, NDF-RT, and CCS, to further investigate the embeddings and precisely characterize their quantitative properties. American Medical Informatics Association 2016-07-20 /pmc/articles/PMC5001761/ /pubmed/27570647 Text en ©2016 AMIA - All rights reserved. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose
spellingShingle Articles
Choi, Youngduck
Chiu, Chill Yi-I
Sontag, David
Learning Low-Dimensional Representations of Medical Concepts
title Learning Low-Dimensional Representations of Medical Concepts
title_full Learning Low-Dimensional Representations of Medical Concepts
title_fullStr Learning Low-Dimensional Representations of Medical Concepts
title_full_unstemmed Learning Low-Dimensional Representations of Medical Concepts
title_short Learning Low-Dimensional Representations of Medical Concepts
title_sort learning low-dimensional representations of medical concepts
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001761/
https://www.ncbi.nlm.nih.gov/pubmed/27570647
work_keys_str_mv AT choiyoungduck learninglowdimensionalrepresentationsofmedicalconcepts
AT chiuchillyii learninglowdimensionalrepresentationsofmedicalconcepts
AT sontagdavid learninglowdimensionalrepresentationsofmedicalconcepts