Cargando…
Exploiting sequence labeling framework to extract document-level relations from biomedical texts
BACKGROUND: Both intra- and inter-sentential semantic relations in biomedical texts provide valuable information for biomedical research. However, most existing methods either focus on extracting intra-sentential relations and ignore inter-sentential ones or fail to extract inter-sentential relation...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7099809/ https://www.ncbi.nlm.nih.gov/pubmed/32216746 http://dx.doi.org/10.1186/s12859-020-3457-2 |
_version_ | 1783511375045197824 |
---|---|
author | Li, Zhiheng Yang, Zhihao Xiang, Yang Luo, Ling Sun, Yuanyuan Lin, Hongfei |
author_facet | Li, Zhiheng Yang, Zhihao Xiang, Yang Luo, Ling Sun, Yuanyuan Lin, Hongfei |
author_sort | Li, Zhiheng |
collection | PubMed |
description | BACKGROUND: Both intra- and inter-sentential semantic relations in biomedical texts provide valuable information for biomedical research. However, most existing methods either focus on extracting intra-sentential relations and ignore inter-sentential ones or fail to extract inter-sentential relations accurately and regard the instances containing entity relations as being independent, which neglects the interactions between relations. We propose a novel sequence labeling-based biomedical relation extraction method named Bio-Seq. In the method, sequence labeling framework is extended by multiple specified feature extractors so as to facilitate the feature extractions at different levels, especially at the inter-sentential level. Besides, the sequence labeling framework enables Bio-Seq to take advantage of the interactions between relations, and thus, further improves the precision of document-level relation extraction. RESULTS: Our proposed method obtained an F1-score of 63.5% on BioCreative V chemical disease relation corpus, and an F1-score of 54.4% on inter-sentential relations, which was 10.5% better than the document-level classification baseline. Also, our method achieved an F1-score of 85.1% on n2c2-ADE sub-dataset. CONCLUSION: Sequence labeling method can be successfully used to extract document-level relations, especially for boosting the performance on inter-sentential relation extraction. Our work can facilitate the research on document-level biomedical text mining. |
format | Online Article Text |
id | pubmed-7099809 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-70998092020-03-30 Exploiting sequence labeling framework to extract document-level relations from biomedical texts Li, Zhiheng Yang, Zhihao Xiang, Yang Luo, Ling Sun, Yuanyuan Lin, Hongfei BMC Bioinformatics Methodology Article BACKGROUND: Both intra- and inter-sentential semantic relations in biomedical texts provide valuable information for biomedical research. However, most existing methods either focus on extracting intra-sentential relations and ignore inter-sentential ones or fail to extract inter-sentential relations accurately and regard the instances containing entity relations as being independent, which neglects the interactions between relations. We propose a novel sequence labeling-based biomedical relation extraction method named Bio-Seq. In the method, sequence labeling framework is extended by multiple specified feature extractors so as to facilitate the feature extractions at different levels, especially at the inter-sentential level. Besides, the sequence labeling framework enables Bio-Seq to take advantage of the interactions between relations, and thus, further improves the precision of document-level relation extraction. RESULTS: Our proposed method obtained an F1-score of 63.5% on BioCreative V chemical disease relation corpus, and an F1-score of 54.4% on inter-sentential relations, which was 10.5% better than the document-level classification baseline. Also, our method achieved an F1-score of 85.1% on n2c2-ADE sub-dataset. CONCLUSION: Sequence labeling method can be successfully used to extract document-level relations, especially for boosting the performance on inter-sentential relation extraction. Our work can facilitate the research on document-level biomedical text mining. BioMed Central 2020-03-27 /pmc/articles/PMC7099809/ /pubmed/32216746 http://dx.doi.org/10.1186/s12859-020-3457-2 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Li, Zhiheng Yang, Zhihao Xiang, Yang Luo, Ling Sun, Yuanyuan Lin, Hongfei Exploiting sequence labeling framework to extract document-level relations from biomedical texts |
title | Exploiting sequence labeling framework to extract document-level relations from biomedical texts |
title_full | Exploiting sequence labeling framework to extract document-level relations from biomedical texts |
title_fullStr | Exploiting sequence labeling framework to extract document-level relations from biomedical texts |
title_full_unstemmed | Exploiting sequence labeling framework to extract document-level relations from biomedical texts |
title_short | Exploiting sequence labeling framework to extract document-level relations from biomedical texts |
title_sort | exploiting sequence labeling framework to extract document-level relations from biomedical texts |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7099809/ https://www.ncbi.nlm.nih.gov/pubmed/32216746 http://dx.doi.org/10.1186/s12859-020-3457-2 |
work_keys_str_mv | AT lizhiheng exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts AT yangzhihao exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts AT xiangyang exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts AT luoling exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts AT sunyuanyuan exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts AT linhongfei exploitingsequencelabelingframeworktoextractdocumentlevelrelationsfrombiomedicaltexts |