Cargando…

Exploiting sequence labeling framework to extract document-level relations from biomedical texts

BACKGROUND: Both intra- and inter-sentential semantic relations in biomedical texts provide valuable information for biomedical research. However, most existing methods either focus on extracting intra-sentential relations and ignore inter-sentential ones or fail to extract inter-sentential relation...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zhiheng, Yang, Zhihao, Xiang, Yang, Luo, Ling, Sun, Yuanyuan, Lin, Hongfei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7099809/
https://www.ncbi.nlm.nih.gov/pubmed/32216746
http://dx.doi.org/10.1186/s12859-020-3457-2
_version_ 1783511375045197824
author Li, Zhiheng
Yang, Zhihao
Xiang, Yang
Luo, Ling
Sun, Yuanyuan
Lin, Hongfei
author_facet Li, Zhiheng
Yang, Zhihao
Xiang, Yang
Luo, Ling
Sun, Yuanyuan
Lin, Hongfei
author_sort Li, Zhiheng
collection PubMed
description BACKGROUND: Both intra- and inter-sentential semantic relations in biomedical texts provide valuable information for biomedical research. However, most existing methods either focus on extracting intra-sentential relations and ignore inter-sentential ones or fail to extract inter-sentential relations accurately and regard the instances containing entity relations as being independent, which neglects the interactions between relations. We propose a novel sequence labeling-based biomedical relation extraction method named Bio-Seq. In the method, sequence labeling framework is extended by multiple specified feature extractors so as to facilitate the feature extractions at different levels, especially at the inter-sentential level. Besides, the sequence labeling framework enables Bio-Seq to take advantage of the interactions between relations, and thus, further improves the precision of document-level relation extraction. RESULTS: Our proposed method obtained an F1-score of 63.5% on BioCreative V chemical disease relation corpus, and an F1-score of 54.4% on inter-sentential relations, which was 10.5% better than the document-level classification baseline. Also, our method achieved an F1-score of 85.1% on n2c2-ADE sub-dataset. CONCLUSION: Sequence labeling method can be successfully used to extract document-level relations, especially for boosting the performance on inter-sentential relation extraction. Our work can facilitate the research on document-level biomedical text mining.
format Online
Article
Text
id pubmed-7099809
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70998092020-03-30 Exploiting sequence labeling framework to extract document-level relations from biomedical texts Li, Zhiheng Yang, Zhihao Xiang, Yang Luo, Ling Sun, Yuanyuan Lin, Hongfei BMC Bioinformatics Methodology Article BACKGROUND: Both intra- and inter-sentential semantic relations in biomedical texts provide valuable information for biomedical research. However, most existing methods either focus on extracting intra-sentential relations and ignore inter-sentential ones or fail to extract inter-sentential relations accurately and regard the instances containing entity relations as being independent, which neglects the interactions between relations. We propose a novel sequence labeling-based biomedical relation extraction method named Bio-Seq. In the method, sequence labeling framework is extended by multiple specified feature extractors so as to facilitate the feature extractions at different levels, especially at the inter-sentential level. Besides, the sequence labeling framework enables Bio-Seq to take advantage of the interactions between relations, and thus, further improves the precision of document-level relation extraction. RESULTS: Our proposed method obtained an F1-score of 63.5% on BioCreative V chemical disease relation corpus, and an F1-score of 54.4% on inter-sentential relations, which was 10.5% better than the document-level classification baseline. Also, our method achieved an F1-score of 85.1% on n2c2-ADE sub-dataset. CONCLUSION: Sequence labeling method can be successfully used to extract document-level relations, especially for boosting the performance on inter-sentential relation extraction. Our work can facilitate the research on document-level biomedical text mining. BioMed Central 2020-03-27 /pmc/articles/PMC7099809/ /pubmed/32216746 http://dx.doi.org/10.1186/s12859-020-3457-2 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Li, Zhiheng
Yang, Zhihao
Xiang, Yang
Luo, Ling
Sun, Yuanyuan
Lin, Hongfei
Exploiting sequence labeling framework to extract document-level relations from biomedical texts
title Exploiting sequence labeling framework to extract document-level relations from biomedical texts
title_full Exploiting sequence labeling framework to extract document-level relations from biomedical texts
title_fullStr Exploiting sequence labeling framework to extract document-level relations from biomedical texts
title_full_unstemmed Exploiting sequence labeling framework to extract document-level relations from biomedical texts
title_short Exploiting sequence labeling framework to extract document-level relations from biomedical texts
title_sort exploiting sequence labeling framework to extract document-level relations from biomedical texts
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7099809/
https://www.ncbi.nlm.nih.gov/pubmed/32216746
http://dx.doi.org/10.1186/s12859-020-3457-2
work_keys_str_mv AT lizhiheng exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts
AT yangzhihao exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts
AT xiangyang exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts
AT luoling exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts
AT sunyuanyuan exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts
AT linhongfei exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts