Cargando…

Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes

Biomedical association studies are increasingly done using clinical concepts, and in particular diagnostic codes from clinical data repositories as phenotypes. Clinical concepts can be represented in a meaningful, vector space using word embedding models. These embeddings allow for comparison betwee...

Descripción completa

Detalles Bibliográficos
Autores principales: Beaulieu-Jones, Brett K., Kohane, Isaac S., Beam, Andrew L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417814/
https://www.ncbi.nlm.nih.gov/pubmed/30864306
_version_ 1783403627233148928
author Beaulieu-Jones, Brett K.
Kohane, Isaac S.
Beam, Andrew L.
author_facet Beaulieu-Jones, Brett K.
Kohane, Isaac S.
Beam, Andrew L.
author_sort Beaulieu-Jones, Brett K.
collection PubMed
description Biomedical association studies are increasingly done using clinical concepts, and in particular diagnostic codes from clinical data repositories as phenotypes. Clinical concepts can be represented in a meaningful, vector space using word embedding models. These embeddings allow for comparison between clinical concepts or for straightforward input to machine learning models. Using traditional approaches, good representations require high dimensionality, making downstream tasks such as visualization more difficult. We applied Poincaré embeddings in a 2-dimensional hyperbolic space to a large-scale administrative claims database and show performance comparable to 100-dimensional embeddings in a euclidean space. We then examine disease relationships under different disease contexts to better understand potential phenotypes.
format Online
Article
Text
id pubmed-6417814
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-64178142019-03-14 Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes Beaulieu-Jones, Brett K. Kohane, Isaac S. Beam, Andrew L. Pac Symp Biocomput Article Biomedical association studies are increasingly done using clinical concepts, and in particular diagnostic codes from clinical data repositories as phenotypes. Clinical concepts can be represented in a meaningful, vector space using word embedding models. These embeddings allow for comparison between clinical concepts or for straightforward input to machine learning models. Using traditional approaches, good representations require high dimensionality, making downstream tasks such as visualization more difficult. We applied Poincaré embeddings in a 2-dimensional hyperbolic space to a large-scale administrative claims database and show performance comparable to 100-dimensional embeddings in a euclidean space. We then examine disease relationships under different disease contexts to better understand potential phenotypes. 2019 /pmc/articles/PMC6417814/ /pubmed/30864306 Text en Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Beaulieu-Jones, Brett K.
Kohane, Isaac S.
Beam, Andrew L.
Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes
title Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes
title_full Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes
title_fullStr Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes
title_full_unstemmed Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes
title_short Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes
title_sort learning contextual hierarchical structure of medical concepts with poincairé embeddings to clarify phenotypes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417814/
https://www.ncbi.nlm.nih.gov/pubmed/30864306
work_keys_str_mv AT beaulieujonesbrettk learningcontextualhierarchicalstructureofmedicalconceptswithpoincaireembeddingstoclarifyphenotypes
AT kohaneisaacs learningcontextualhierarchicalstructureofmedicalconceptswithpoincaireembeddingstoclarifyphenotypes
AT beamandrewl learningcontextualhierarchicalstructureofmedicalconceptswithpoincaireembeddingstoclarifyphenotypes