Cargando…
Hierarchical sequence labeling for extracting BEL statements from biomedical literature
BACKGROUND: Extracting relations between bio-entities from biomedical literature is often a challenging task and also an essential step towards biomedical knowledge expansion. The BioCreative community has organized a shared task to evaluate the robustness of the causal relationship extraction algor...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454591/ https://www.ncbi.nlm.nih.gov/pubmed/30961584 http://dx.doi.org/10.1186/s12911-019-0758-3 |
_version_ | 1783409566240735232 |
---|---|
author | Liu, Suwen Shao, Yifan Qian, Longhua Zhou, Guodong |
author_facet | Liu, Suwen Shao, Yifan Qian, Longhua Zhou, Guodong |
author_sort | Liu, Suwen |
collection | PubMed |
description | BACKGROUND: Extracting relations between bio-entities from biomedical literature is often a challenging task and also an essential step towards biomedical knowledge expansion. The BioCreative community has organized a shared task to evaluate the robustness of the causal relationship extraction algorithms in Biological Expression Language (BEL) from biomedical literature. METHOD: We first map the sentence-level BEL statements in the BC-V training corpus to the corresponding text segments, thus generating hierarchically tagged training instances. A hierarchical sequence labeling model was afterwards induced from these training instances and applied to the test sentences in order to construct the BEL statements. RESULTS: The experimental results on extracting BEL statements from BioCreative V Track 4 test corpus show that our method achieves promising performance with an overall F-measure of 31.6%. Furthermore, it has the potential to be enhanced by adopting more advanced machine learning approaches. CONCLUSION: We propose a framework for hierarchical relation extraction using hierarchical sequence labeling on the instance-level training corpus derived from the original sentence-level corpus via word alignment. Its main advantage is that we can make full use of the original training corpus to induce the sequence labelers and then apply them to the test corpus. |
format | Online Article Text |
id | pubmed-6454591 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64545912019-04-19 Hierarchical sequence labeling for extracting BEL statements from biomedical literature Liu, Suwen Shao, Yifan Qian, Longhua Zhou, Guodong BMC Med Inform Decis Mak Research BACKGROUND: Extracting relations between bio-entities from biomedical literature is often a challenging task and also an essential step towards biomedical knowledge expansion. The BioCreative community has organized a shared task to evaluate the robustness of the causal relationship extraction algorithms in Biological Expression Language (BEL) from biomedical literature. METHOD: We first map the sentence-level BEL statements in the BC-V training corpus to the corresponding text segments, thus generating hierarchically tagged training instances. A hierarchical sequence labeling model was afterwards induced from these training instances and applied to the test sentences in order to construct the BEL statements. RESULTS: The experimental results on extracting BEL statements from BioCreative V Track 4 test corpus show that our method achieves promising performance with an overall F-measure of 31.6%. Furthermore, it has the potential to be enhanced by adopting more advanced machine learning approaches. CONCLUSION: We propose a framework for hierarchical relation extraction using hierarchical sequence labeling on the instance-level training corpus derived from the original sentence-level corpus via word alignment. Its main advantage is that we can make full use of the original training corpus to induce the sequence labelers and then apply them to the test corpus. BioMed Central 2019-04-09 /pmc/articles/PMC6454591/ /pubmed/30961584 http://dx.doi.org/10.1186/s12911-019-0758-3 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Liu, Suwen Shao, Yifan Qian, Longhua Zhou, Guodong Hierarchical sequence labeling for extracting BEL statements from biomedical literature |
title | Hierarchical sequence labeling for extracting BEL statements from biomedical literature |
title_full | Hierarchical sequence labeling for extracting BEL statements from biomedical literature |
title_fullStr | Hierarchical sequence labeling for extracting BEL statements from biomedical literature |
title_full_unstemmed | Hierarchical sequence labeling for extracting BEL statements from biomedical literature |
title_short | Hierarchical sequence labeling for extracting BEL statements from biomedical literature |
title_sort | hierarchical sequence labeling for extracting bel statements from biomedical literature |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454591/ https://www.ncbi.nlm.nih.gov/pubmed/30961584 http://dx.doi.org/10.1186/s12911-019-0758-3 |
work_keys_str_mv | AT liusuwen hierarchicalsequencelabelingforextractingbelstatementsfrombiomedicalliterature AT shaoyifan hierarchicalsequencelabelingforextractingbelstatementsfrombiomedicalliterature AT qianlonghua hierarchicalsequencelabelingforextractingbelstatementsfrombiomedicalliterature AT zhouguodong hierarchicalsequencelabelingforextractingbelstatementsfrombiomedicalliterature |