Cargando…

Assigning factuality values to semantic relations extracted from biomedical research literature

Biomedical knowledge claims are often expressed as hypotheses, speculations, or opinions, rather than explicit facts (propositions). Much biomedical text mining has focused on extracting propositions from biomedical literature. One such system is SemRep, which extracts propositional content in the f...

Descripción completa

Detalles Bibliográficos
Autores principales: Kilicoglu, Halil, Rosemblat, Graciela, Rindflesch, Thomas C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5497973/
https://www.ncbi.nlm.nih.gov/pubmed/28678823
http://dx.doi.org/10.1371/journal.pone.0179926
_version_ 1783248229726420992
author Kilicoglu, Halil
Rosemblat, Graciela
Rindflesch, Thomas C.
author_facet Kilicoglu, Halil
Rosemblat, Graciela
Rindflesch, Thomas C.
author_sort Kilicoglu, Halil
collection PubMed
description Biomedical knowledge claims are often expressed as hypotheses, speculations, or opinions, rather than explicit facts (propositions). Much biomedical text mining has focused on extracting propositions from biomedical literature. One such system is SemRep, which extracts propositional content in the form of subject-predicate-object triples called predications. In this study, we investigated the feasibility of assessing the factuality level of SemRep predications to provide more nuanced distinctions between predications for downstream applications. We annotated semantic predications extracted from 500 PubMed abstracts with seven factuality values (fact, probable, possible, doubtful, counterfact, uncommitted, and conditional). We extended a rule-based, compositional approach that uses lexical and syntactic information to predict factuality levels. We compared this approach to a supervised machine learning method that uses a rich feature set based on the annotated corpus. Our results indicate that the compositional approach is more effective than the machine learning method in recognizing the factuality values of predications. The annotated corpus as well as the source code and binaries for factuality assignment are publicly available. We will also incorporate the results of the better performing compositional approach into SemMedDB, a PubMed-scale repository of semantic predications extracted using SemRep.
format Online
Article
Text
id pubmed-5497973
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54979732017-07-25 Assigning factuality values to semantic relations extracted from biomedical research literature Kilicoglu, Halil Rosemblat, Graciela Rindflesch, Thomas C. PLoS One Research Article Biomedical knowledge claims are often expressed as hypotheses, speculations, or opinions, rather than explicit facts (propositions). Much biomedical text mining has focused on extracting propositions from biomedical literature. One such system is SemRep, which extracts propositional content in the form of subject-predicate-object triples called predications. In this study, we investigated the feasibility of assessing the factuality level of SemRep predications to provide more nuanced distinctions between predications for downstream applications. We annotated semantic predications extracted from 500 PubMed abstracts with seven factuality values (fact, probable, possible, doubtful, counterfact, uncommitted, and conditional). We extended a rule-based, compositional approach that uses lexical and syntactic information to predict factuality levels. We compared this approach to a supervised machine learning method that uses a rich feature set based on the annotated corpus. Our results indicate that the compositional approach is more effective than the machine learning method in recognizing the factuality values of predications. The annotated corpus as well as the source code and binaries for factuality assignment are publicly available. We will also incorporate the results of the better performing compositional approach into SemMedDB, a PubMed-scale repository of semantic predications extracted using SemRep. Public Library of Science 2017-07-05 /pmc/articles/PMC5497973/ /pubmed/28678823 http://dx.doi.org/10.1371/journal.pone.0179926 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Kilicoglu, Halil
Rosemblat, Graciela
Rindflesch, Thomas C.
Assigning factuality values to semantic relations extracted from biomedical research literature
title Assigning factuality values to semantic relations extracted from biomedical research literature
title_full Assigning factuality values to semantic relations extracted from biomedical research literature
title_fullStr Assigning factuality values to semantic relations extracted from biomedical research literature
title_full_unstemmed Assigning factuality values to semantic relations extracted from biomedical research literature
title_short Assigning factuality values to semantic relations extracted from biomedical research literature
title_sort assigning factuality values to semantic relations extracted from biomedical research literature
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5497973/
https://www.ncbi.nlm.nih.gov/pubmed/28678823
http://dx.doi.org/10.1371/journal.pone.0179926
work_keys_str_mv AT kilicogluhalil assigningfactualityvaluestosemanticrelationsextractedfrombiomedicalresearchliterature
AT rosemblatgraciela assigningfactualityvaluestosemanticrelationsextractedfrombiomedicalresearchliterature
AT rindfleschthomasc assigningfactualityvaluestosemanticrelationsextractedfrombiomedicalresearchliterature