Cargando…

Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer’s disease

BACKGROUND: Extracting useful information from biomedical literature plays an important role in the development of modern medicine. In natural language processing, there have been rigorous attempts to find meaningful relationships between entities automatically by co-occurrence-based methods. It has...

Descripción completa

Detalles Bibliográficos
Autores principales: Heo, Go Eun, Xie, Qing, Song, Min, Lee, Jeong-Hoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6894106/
https://www.ncbi.nlm.nih.gov/pubmed/31801521
http://dx.doi.org/10.1186/s12911-019-0934-5
_version_ 1783476324022616064
author Heo, Go Eun
Xie, Qing
Song, Min
Lee, Jeong-Hoon
author_facet Heo, Go Eun
Xie, Qing
Song, Min
Lee, Jeong-Hoon
author_sort Heo, Go Eun
collection PubMed
description BACKGROUND: Extracting useful information from biomedical literature plays an important role in the development of modern medicine. In natural language processing, there have been rigorous attempts to find meaningful relationships between entities automatically by co-occurrence-based methods. It has been increasingly important to understand whether relationships exist, and if so how strong, between any two entities extracted from a large number of texts. One of the defining methods is to measure semantic similarity and relatedness between two entities. METHODS: We propose a hybrid ranking method that combines a co-occurrence approach considering both direct and indirect entity pair relationship with specialized word embeddings for measuring the relatedness of two entities. RESULTS: We evaluate the proposed ranking method comparatively with other well-known methods such as co-occurrence, Word2Vec, COALS (Correlated Occurrence Analog to Lexical Semantics), and random indexing by calculating top-ranked entities related to Alzheimer’s disease. In addition, we analyze gene, pathway, and gene–phenotype relationships. Overall, the proposed method tends to find more hidden relationships than the other methods. CONCLUSION: Our proposed method is able to select more useful related entities that not only highly co-occur but also have more indirect relations for the target entity. In pathway analysis, our proposed method shows superior performance at identifying (functional) cross clustering and higher-level pathways. Our proposed method, resulting from phenotype analysis, has an advantage in identifying the common genotype relating to phenotypes from biological literature.
format Online
Article
Text
id pubmed-6894106
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68941062019-12-11 Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer’s disease Heo, Go Eun Xie, Qing Song, Min Lee, Jeong-Hoon BMC Med Inform Decis Mak Research BACKGROUND: Extracting useful information from biomedical literature plays an important role in the development of modern medicine. In natural language processing, there have been rigorous attempts to find meaningful relationships between entities automatically by co-occurrence-based methods. It has been increasingly important to understand whether relationships exist, and if so how strong, between any two entities extracted from a large number of texts. One of the defining methods is to measure semantic similarity and relatedness between two entities. METHODS: We propose a hybrid ranking method that combines a co-occurrence approach considering both direct and indirect entity pair relationship with specialized word embeddings for measuring the relatedness of two entities. RESULTS: We evaluate the proposed ranking method comparatively with other well-known methods such as co-occurrence, Word2Vec, COALS (Correlated Occurrence Analog to Lexical Semantics), and random indexing by calculating top-ranked entities related to Alzheimer’s disease. In addition, we analyze gene, pathway, and gene–phenotype relationships. Overall, the proposed method tends to find more hidden relationships than the other methods. CONCLUSION: Our proposed method is able to select more useful related entities that not only highly co-occur but also have more indirect relations for the target entity. In pathway analysis, our proposed method shows superior performance at identifying (functional) cross clustering and higher-level pathways. Our proposed method, resulting from phenotype analysis, has an advantage in identifying the common genotype relating to phenotypes from biological literature. BioMed Central 2019-12-05 /pmc/articles/PMC6894106/ /pubmed/31801521 http://dx.doi.org/10.1186/s12911-019-0934-5 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Heo, Go Eun
Xie, Qing
Song, Min
Lee, Jeong-Hoon
Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer’s disease
title Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer’s disease
title_full Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer’s disease
title_fullStr Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer’s disease
title_full_unstemmed Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer’s disease
title_short Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer’s disease
title_sort combining entity co-occurrence with specialized word embeddings to measure entity relation in alzheimer’s disease
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6894106/
https://www.ncbi.nlm.nih.gov/pubmed/31801521
http://dx.doi.org/10.1186/s12911-019-0934-5
work_keys_str_mv AT heogoeun combiningentitycooccurrencewithspecializedwordembeddingstomeasureentityrelationinalzheimersdisease
AT xieqing combiningentitycooccurrencewithspecializedwordembeddingstomeasureentityrelationinalzheimersdisease
AT songmin combiningentitycooccurrencewithspecializedwordembeddingstomeasureentityrelationinalzheimersdisease
AT leejeonghoon combiningentitycooccurrencewithspecializedwordembeddingstomeasureentityrelationinalzheimersdisease