Cargando…

Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)

ABSTRACT: Biological contextual information helps understand various phenomena occurring in the biological systems consisting of complex molecular relations. The construction of context-specific relational resources vastly relies on laborious manual extraction from unstructured literature. In this p...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Jaehyun, Lee, Doheon, Lee, Kwang Hyung
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2020
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586695/ https://www.ncbi.nlm.nih.gov/pubmed/33106154 http://dx.doi.org/10.1186/s12859-020-3396-y

_version_	1783600052380368896
author	Lee, Jaehyun Lee, Doheon Lee, Kwang Hyung
author_facet	Lee, Jaehyun Lee, Doheon Lee, Kwang Hyung
author_sort	Lee, Jaehyun
collection	PubMed
description	ABSTRACT: Biological contextual information helps understand various phenomena occurring in the biological systems consisting of complex molecular relations. The construction of context-specific relational resources vastly relies on laborious manual extraction from unstructured literature. In this paper, we propose COMMODAR, a machine learning-based literature mining framework for context-specific molecular relations using multimodal representations. The main idea of COMMODAR is the feature augmentation by the cooperation of multimodal representations for relation extraction. We leveraged biomedical domain knowledge as well as canonical linguistic information for more comprehensive representations of textual sources. The models based on multiple modalities outperformed those solely based on the linguistic modality. We applied COMMODAR to the 14 million PubMed abstracts and extracted 9214 context-specific molecular relations. All corpora, extracted data, evaluation results, and the implementation code are downloadable at https://github.com/jae-hyun-lee/commodar. CCS CONCEPTS: • Computing methodologies~Information extraction • Computing methodologies~Neural networks • Applied computing~Biological networks.
format	Online Article Text
id	pubmed-7586695
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-75866952020-10-27 Literature mining for context-specific molecular relations using multimodal representations (COMMODAR) Lee, Jaehyun Lee, Doheon Lee, Kwang Hyung BMC Bioinformatics Research ABSTRACT: Biological contextual information helps understand various phenomena occurring in the biological systems consisting of complex molecular relations. The construction of context-specific relational resources vastly relies on laborious manual extraction from unstructured literature. In this paper, we propose COMMODAR, a machine learning-based literature mining framework for context-specific molecular relations using multimodal representations. The main idea of COMMODAR is the feature augmentation by the cooperation of multimodal representations for relation extraction. We leveraged biomedical domain knowledge as well as canonical linguistic information for more comprehensive representations of textual sources. The models based on multiple modalities outperformed those solely based on the linguistic modality. We applied COMMODAR to the 14 million PubMed abstracts and extracted 9214 context-specific molecular relations. All corpora, extracted data, evaluation results, and the implementation code are downloadable at https://github.com/jae-hyun-lee/commodar. CCS CONCEPTS: • Computing methodologies~Information extraction • Computing methodologies~Neural networks • Applied computing~Biological networks. BioMed Central 2020-10-26 /pmc/articles/PMC7586695/ /pubmed/33106154 http://dx.doi.org/10.1186/s12859-020-3396-y Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Lee, Jaehyun Lee, Doheon Lee, Kwang Hyung Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)
title	Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)
title_full	Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)
title_fullStr	Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)
title_full_unstemmed	Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)
title_short	Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)
title_sort	literature mining for context-specific molecular relations using multimodal representations (commodar)
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586695/ https://www.ncbi.nlm.nih.gov/pubmed/33106154 http://dx.doi.org/10.1186/s12859-020-3396-y
work_keys_str_mv	AT leejaehyun literatureminingforcontextspecificmolecularrelationsusingmultimodalrepresentationscommodar AT leedoheon literatureminingforcontextspecificmolecularrelationsusingmultimodalrepresentationscommodar AT leekwanghyung literatureminingforcontextspecificmolecularrelationsusingmultimodalrepresentationscommodar

Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)

Ejemplares similares