Cargando…

K-RET: knowledgeable biomedical relation extraction system

MOTIVATION: Relation extraction (RE) is a crucial process to deal with the amount of text published daily, e.g. to find missing associations in a database. RE is a text mining task for which the state-of-the-art approaches use bidirectional encoders, namely, BERT. However, state-of-the-art performan...

Descripción completa

Detalles Bibliográficos
Autores principales: Sousa, Diana F, Couto, Francisco M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10112952/
https://www.ncbi.nlm.nih.gov/pubmed/37018156
http://dx.doi.org/10.1093/bioinformatics/btad174
_version_ 1785027723786190848
author Sousa, Diana F
Couto, Francisco M
author_facet Sousa, Diana F
Couto, Francisco M
author_sort Sousa, Diana F
collection PubMed
description MOTIVATION: Relation extraction (RE) is a crucial process to deal with the amount of text published daily, e.g. to find missing associations in a database. RE is a text mining task for which the state-of-the-art approaches use bidirectional encoders, namely, BERT. However, state-of-the-art performance may be limited by the lack of efficient external knowledge injection approaches, with a larger impact in the biomedical area given the widespread usage and high quality of biomedical ontologies. This knowledge can propel these systems forward by aiding them in predicting more explainable biomedical associations. With this in mind, we developed K-RET, a novel, knowledgeable biomedical RE system that, for the first time, injects knowledge by handling different types of associations, multiple sources and where to apply it, and multi-token entities. RESULTS: We tested K-RET on three independent and open-access corpora (DDI, BC5CDR, and PGR) using four biomedical ontologies handling different entities. K-RET improved state-of-the-art results by 2.68% on average, with the DDI Corpus yielding the most significant boost in performance, from 79.30% to 87.19% in F-measure, representing a P-value of [Formula: see text]. AVAILABILITY AND IMPLEMENTATION: https://github.com/lasigeBioTM/K-RET.
format Online
Article
Text
id pubmed-10112952
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101129522023-04-19 K-RET: knowledgeable biomedical relation extraction system Sousa, Diana F Couto, Francisco M Bioinformatics Original Paper MOTIVATION: Relation extraction (RE) is a crucial process to deal with the amount of text published daily, e.g. to find missing associations in a database. RE is a text mining task for which the state-of-the-art approaches use bidirectional encoders, namely, BERT. However, state-of-the-art performance may be limited by the lack of efficient external knowledge injection approaches, with a larger impact in the biomedical area given the widespread usage and high quality of biomedical ontologies. This knowledge can propel these systems forward by aiding them in predicting more explainable biomedical associations. With this in mind, we developed K-RET, a novel, knowledgeable biomedical RE system that, for the first time, injects knowledge by handling different types of associations, multiple sources and where to apply it, and multi-token entities. RESULTS: We tested K-RET on three independent and open-access corpora (DDI, BC5CDR, and PGR) using four biomedical ontologies handling different entities. K-RET improved state-of-the-art results by 2.68% on average, with the DDI Corpus yielding the most significant boost in performance, from 79.30% to 87.19% in F-measure, representing a P-value of [Formula: see text]. AVAILABILITY AND IMPLEMENTATION: https://github.com/lasigeBioTM/K-RET. Oxford University Press 2023-04-05 /pmc/articles/PMC10112952/ /pubmed/37018156 http://dx.doi.org/10.1093/bioinformatics/btad174 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Sousa, Diana F
Couto, Francisco M
K-RET: knowledgeable biomedical relation extraction system
title K-RET: knowledgeable biomedical relation extraction system
title_full K-RET: knowledgeable biomedical relation extraction system
title_fullStr K-RET: knowledgeable biomedical relation extraction system
title_full_unstemmed K-RET: knowledgeable biomedical relation extraction system
title_short K-RET: knowledgeable biomedical relation extraction system
title_sort k-ret: knowledgeable biomedical relation extraction system
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10112952/
https://www.ncbi.nlm.nih.gov/pubmed/37018156
http://dx.doi.org/10.1093/bioinformatics/btad174
work_keys_str_mv AT sousadianaf kretknowledgeablebiomedicalrelationextractionsystem
AT coutofranciscom kretknowledgeablebiomedicalrelationextractionsystem