Cargando…

Inferring new relations between medical entities using literature curated term co-occurrences

OBJECTIVES: Identifying new relations between medical entities, such as drugs, diseases, and side effects, is typically a resource-intensive task, involving experimentation and clinical trials. The increased availability of related data and curated knowledge enables a computational approach to this...

Descripción completa

Detalles Bibliográficos
Autores principales: Spiro, Adam, Fernández García, Jonatan, Yanover, Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6951958/
https://www.ncbi.nlm.nih.gov/pubmed/31984370
http://dx.doi.org/10.1093/jamiaopen/ooz022
_version_ 1783486366223433728
author Spiro, Adam
Fernández García, Jonatan
Yanover, Chen
author_facet Spiro, Adam
Fernández García, Jonatan
Yanover, Chen
author_sort Spiro, Adam
collection PubMed
description OBJECTIVES: Identifying new relations between medical entities, such as drugs, diseases, and side effects, is typically a resource-intensive task, involving experimentation and clinical trials. The increased availability of related data and curated knowledge enables a computational approach to this task, notably by training models to predict likely relations. Such models rely on meaningful representations of the medical entities being studied. We propose a generic features vector representation that leverages co-occurrences of medical terms, linked with PubMed citations. MATERIALS AND METHODS: We demonstrate the usefulness of the proposed representation by inferring two types of relations: a drug causes a side effect and a drug treats an indication. To predict these relations and assess their effectiveness, we applied 2 modeling approaches: multi-task modeling using neural networks and single-task modeling based on gradient boosting machines and logistic regression. RESULTS: These trained models, which predict either side effects or indications, obtained significantly better results than baseline models that use a single direct co-occurrence feature. The results demonstrate the advantage of a comprehensive representation. DISCUSSION: Selecting the appropriate representation has an immense impact on the predictive performance of machine learning models. Our proposed representation is powerful, as it spans multiple medical domains and can be used to predict a wide range of relation types. CONCLUSION: The discovery of new relations between various medical entities can be translated into meaningful insights, for example, related to drug development or disease understanding. Our representation of medical entities can be used to train models that predict such relations, thus accelerating healthcare-related discoveries.
format Online
Article
Text
id pubmed-6951958
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-69519582020-01-24 Inferring new relations between medical entities using literature curated term co-occurrences Spiro, Adam Fernández García, Jonatan Yanover, Chen JAMIA Open Research and Applications OBJECTIVES: Identifying new relations between medical entities, such as drugs, diseases, and side effects, is typically a resource-intensive task, involving experimentation and clinical trials. The increased availability of related data and curated knowledge enables a computational approach to this task, notably by training models to predict likely relations. Such models rely on meaningful representations of the medical entities being studied. We propose a generic features vector representation that leverages co-occurrences of medical terms, linked with PubMed citations. MATERIALS AND METHODS: We demonstrate the usefulness of the proposed representation by inferring two types of relations: a drug causes a side effect and a drug treats an indication. To predict these relations and assess their effectiveness, we applied 2 modeling approaches: multi-task modeling using neural networks and single-task modeling based on gradient boosting machines and logistic regression. RESULTS: These trained models, which predict either side effects or indications, obtained significantly better results than baseline models that use a single direct co-occurrence feature. The results demonstrate the advantage of a comprehensive representation. DISCUSSION: Selecting the appropriate representation has an immense impact on the predictive performance of machine learning models. Our proposed representation is powerful, as it spans multiple medical domains and can be used to predict a wide range of relation types. CONCLUSION: The discovery of new relations between various medical entities can be translated into meaningful insights, for example, related to drug development or disease understanding. Our representation of medical entities can be used to train models that predict such relations, thus accelerating healthcare-related discoveries. Oxford University Press 2019-07-01 /pmc/articles/PMC6951958/ /pubmed/31984370 http://dx.doi.org/10.1093/jamiaopen/ooz022 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Spiro, Adam
Fernández García, Jonatan
Yanover, Chen
Inferring new relations between medical entities using literature curated term co-occurrences
title Inferring new relations between medical entities using literature curated term co-occurrences
title_full Inferring new relations between medical entities using literature curated term co-occurrences
title_fullStr Inferring new relations between medical entities using literature curated term co-occurrences
title_full_unstemmed Inferring new relations between medical entities using literature curated term co-occurrences
title_short Inferring new relations between medical entities using literature curated term co-occurrences
title_sort inferring new relations between medical entities using literature curated term co-occurrences
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6951958/
https://www.ncbi.nlm.nih.gov/pubmed/31984370
http://dx.doi.org/10.1093/jamiaopen/ooz022
work_keys_str_mv AT spiroadam inferringnewrelationsbetweenmedicalentitiesusingliteraturecuratedtermcooccurrences
AT fernandezgarciajonatan inferringnewrelationsbetweenmedicalentitiesusingliteraturecuratedtermcooccurrences
AT yanoverchen inferringnewrelationsbetweenmedicalentitiesusingliteraturecuratedtermcooccurrences