Cargando…

Hierarchical sequence labeling for extracting BEL statements from biomedical literature

BACKGROUND: Extracting relations between bio-entities from biomedical literature is often a challenging task and also an essential step towards biomedical knowledge expansion. The BioCreative community has organized a shared task to evaluate the robustness of the causal relationship extraction algor...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Suwen, Shao, Yifan, Qian, Longhua, Zhou, Guodong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454591/
https://www.ncbi.nlm.nih.gov/pubmed/30961584
http://dx.doi.org/10.1186/s12911-019-0758-3
_version_ 1783409566240735232
author Liu, Suwen
Shao, Yifan
Qian, Longhua
Zhou, Guodong
author_facet Liu, Suwen
Shao, Yifan
Qian, Longhua
Zhou, Guodong
author_sort Liu, Suwen
collection PubMed
description BACKGROUND: Extracting relations between bio-entities from biomedical literature is often a challenging task and also an essential step towards biomedical knowledge expansion. The BioCreative community has organized a shared task to evaluate the robustness of the causal relationship extraction algorithms in Biological Expression Language (BEL) from biomedical literature. METHOD: We first map the sentence-level BEL statements in the BC-V training corpus to the corresponding text segments, thus generating hierarchically tagged training instances. A hierarchical sequence labeling model was afterwards induced from these training instances and applied to the test sentences in order to construct the BEL statements. RESULTS: The experimental results on extracting BEL statements from BioCreative V Track 4 test corpus show that our method achieves promising performance with an overall F-measure of 31.6%. Furthermore, it has the potential to be enhanced by adopting more advanced machine learning approaches. CONCLUSION: We propose a framework for hierarchical relation extraction using hierarchical sequence labeling on the instance-level training corpus derived from the original sentence-level corpus via word alignment. Its main advantage is that we can make full use of the original training corpus to induce the sequence labelers and then apply them to the test corpus.
format Online
Article
Text
id pubmed-6454591
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64545912019-04-19 Hierarchical sequence labeling for extracting BEL statements from biomedical literature Liu, Suwen Shao, Yifan Qian, Longhua Zhou, Guodong BMC Med Inform Decis Mak Research BACKGROUND: Extracting relations between bio-entities from biomedical literature is often a challenging task and also an essential step towards biomedical knowledge expansion. The BioCreative community has organized a shared task to evaluate the robustness of the causal relationship extraction algorithms in Biological Expression Language (BEL) from biomedical literature. METHOD: We first map the sentence-level BEL statements in the BC-V training corpus to the corresponding text segments, thus generating hierarchically tagged training instances. A hierarchical sequence labeling model was afterwards induced from these training instances and applied to the test sentences in order to construct the BEL statements. RESULTS: The experimental results on extracting BEL statements from BioCreative V Track 4 test corpus show that our method achieves promising performance with an overall F-measure of 31.6%. Furthermore, it has the potential to be enhanced by adopting more advanced machine learning approaches. CONCLUSION: We propose a framework for hierarchical relation extraction using hierarchical sequence labeling on the instance-level training corpus derived from the original sentence-level corpus via word alignment. Its main advantage is that we can make full use of the original training corpus to induce the sequence labelers and then apply them to the test corpus. BioMed Central 2019-04-09 /pmc/articles/PMC6454591/ /pubmed/30961584 http://dx.doi.org/10.1186/s12911-019-0758-3 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Liu, Suwen
Shao, Yifan
Qian, Longhua
Zhou, Guodong
Hierarchical sequence labeling for extracting BEL statements from biomedical literature
title Hierarchical sequence labeling for extracting BEL statements from biomedical literature
title_full Hierarchical sequence labeling for extracting BEL statements from biomedical literature
title_fullStr Hierarchical sequence labeling for extracting BEL statements from biomedical literature
title_full_unstemmed Hierarchical sequence labeling for extracting BEL statements from biomedical literature
title_short Hierarchical sequence labeling for extracting BEL statements from biomedical literature
title_sort hierarchical sequence labeling for extracting bel statements from biomedical literature
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454591/
https://www.ncbi.nlm.nih.gov/pubmed/30961584
http://dx.doi.org/10.1186/s12911-019-0758-3
work_keys_str_mv AT liusuwen hierarchicalsequencelabelingforextractingbelstatementsfrombiomedicalliterature
AT shaoyifan hierarchicalsequencelabelingforextractingbelstatementsfrombiomedicalliterature
AT qianlonghua hierarchicalsequencelabelingforextractingbelstatementsfrombiomedicalliterature
AT zhouguodong hierarchicalsequencelabelingforextractingbelstatementsfrombiomedicalliterature