Cargando…
Assigning factuality values to semantic relations extracted from biomedical research literature
Biomedical knowledge claims are often expressed as hypotheses, speculations, or opinions, rather than explicit facts (propositions). Much biomedical text mining has focused on extracting propositions from biomedical literature. One such system is SemRep, which extracts propositional content in the f...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5497973/ https://www.ncbi.nlm.nih.gov/pubmed/28678823 http://dx.doi.org/10.1371/journal.pone.0179926 |
_version_ | 1783248229726420992 |
---|---|
author | Kilicoglu, Halil Rosemblat, Graciela Rindflesch, Thomas C. |
author_facet | Kilicoglu, Halil Rosemblat, Graciela Rindflesch, Thomas C. |
author_sort | Kilicoglu, Halil |
collection | PubMed |
description | Biomedical knowledge claims are often expressed as hypotheses, speculations, or opinions, rather than explicit facts (propositions). Much biomedical text mining has focused on extracting propositions from biomedical literature. One such system is SemRep, which extracts propositional content in the form of subject-predicate-object triples called predications. In this study, we investigated the feasibility of assessing the factuality level of SemRep predications to provide more nuanced distinctions between predications for downstream applications. We annotated semantic predications extracted from 500 PubMed abstracts with seven factuality values (fact, probable, possible, doubtful, counterfact, uncommitted, and conditional). We extended a rule-based, compositional approach that uses lexical and syntactic information to predict factuality levels. We compared this approach to a supervised machine learning method that uses a rich feature set based on the annotated corpus. Our results indicate that the compositional approach is more effective than the machine learning method in recognizing the factuality values of predications. The annotated corpus as well as the source code and binaries for factuality assignment are publicly available. We will also incorporate the results of the better performing compositional approach into SemMedDB, a PubMed-scale repository of semantic predications extracted using SemRep. |
format | Online Article Text |
id | pubmed-5497973 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-54979732017-07-25 Assigning factuality values to semantic relations extracted from biomedical research literature Kilicoglu, Halil Rosemblat, Graciela Rindflesch, Thomas C. PLoS One Research Article Biomedical knowledge claims are often expressed as hypotheses, speculations, or opinions, rather than explicit facts (propositions). Much biomedical text mining has focused on extracting propositions from biomedical literature. One such system is SemRep, which extracts propositional content in the form of subject-predicate-object triples called predications. In this study, we investigated the feasibility of assessing the factuality level of SemRep predications to provide more nuanced distinctions between predications for downstream applications. We annotated semantic predications extracted from 500 PubMed abstracts with seven factuality values (fact, probable, possible, doubtful, counterfact, uncommitted, and conditional). We extended a rule-based, compositional approach that uses lexical and syntactic information to predict factuality levels. We compared this approach to a supervised machine learning method that uses a rich feature set based on the annotated corpus. Our results indicate that the compositional approach is more effective than the machine learning method in recognizing the factuality values of predications. The annotated corpus as well as the source code and binaries for factuality assignment are publicly available. We will also incorporate the results of the better performing compositional approach into SemMedDB, a PubMed-scale repository of semantic predications extracted using SemRep. Public Library of Science 2017-07-05 /pmc/articles/PMC5497973/ /pubmed/28678823 http://dx.doi.org/10.1371/journal.pone.0179926 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Kilicoglu, Halil Rosemblat, Graciela Rindflesch, Thomas C. Assigning factuality values to semantic relations extracted from biomedical research literature |
title | Assigning factuality values to semantic relations extracted from biomedical research literature |
title_full | Assigning factuality values to semantic relations extracted from biomedical research literature |
title_fullStr | Assigning factuality values to semantic relations extracted from biomedical research literature |
title_full_unstemmed | Assigning factuality values to semantic relations extracted from biomedical research literature |
title_short | Assigning factuality values to semantic relations extracted from biomedical research literature |
title_sort | assigning factuality values to semantic relations extracted from biomedical research literature |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5497973/ https://www.ncbi.nlm.nih.gov/pubmed/28678823 http://dx.doi.org/10.1371/journal.pone.0179926 |
work_keys_str_mv | AT kilicogluhalil assigningfactualityvaluestosemanticrelationsextractedfrombiomedicalresearchliterature AT rosemblatgraciela assigningfactualityvaluestosemanticrelationsextractedfrombiomedicalresearchliterature AT rindfleschthomasc assigningfactualityvaluestosemanticrelationsextractedfrombiomedicalresearchliterature |