Cargando…
Quantifying and filtering knowledge generated by literature based discovery
BACKGROUND: Literature based discovery (LBD) automatically infers missed connections between concepts in literature. It is often assumed that LBD generates more information than can be reasonably examined. METHODS: We present a detailed analysis of the quantity of hidden knowledge produced by an LBD...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471938/ https://www.ncbi.nlm.nih.gov/pubmed/28617217 http://dx.doi.org/10.1186/s12859-017-1641-9 |
_version_ | 1783244049153523712 |
---|---|
author | Preiss, Judita Stevenson, Mark |
author_facet | Preiss, Judita Stevenson, Mark |
author_sort | Preiss, Judita |
collection | PubMed |
description | BACKGROUND: Literature based discovery (LBD) automatically infers missed connections between concepts in literature. It is often assumed that LBD generates more information than can be reasonably examined. METHODS: We present a detailed analysis of the quantity of hidden knowledge produced by an LBD system and the effect of various filtering approaches upon this. The investigation of filtering combined with single or multi-step linking term chains is carried out on all articles in PubMed. RESULTS: The evaluation is carried out using both replication of existing discoveries, which provides justification for multi-step linking chain knowledge in specific cases, and using timeslicing, which gives a large scale measure of performance. CONCLUSIONS: While the quantity of hidden knowledge generated by LBD can be vast, we demonstrate that (a) intelligent filtering can greatly reduce the number of hidden knowledge pairs generated, (b) for a specific term, the number of single step connections can be manageable, and (c) in the absence of single step hidden links, considering multiple steps can provide valid links. |
format | Online Article Text |
id | pubmed-5471938 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54719382017-06-19 Quantifying and filtering knowledge generated by literature based discovery Preiss, Judita Stevenson, Mark BMC Bioinformatics Research BACKGROUND: Literature based discovery (LBD) automatically infers missed connections between concepts in literature. It is often assumed that LBD generates more information than can be reasonably examined. METHODS: We present a detailed analysis of the quantity of hidden knowledge produced by an LBD system and the effect of various filtering approaches upon this. The investigation of filtering combined with single or multi-step linking term chains is carried out on all articles in PubMed. RESULTS: The evaluation is carried out using both replication of existing discoveries, which provides justification for multi-step linking chain knowledge in specific cases, and using timeslicing, which gives a large scale measure of performance. CONCLUSIONS: While the quantity of hidden knowledge generated by LBD can be vast, we demonstrate that (a) intelligent filtering can greatly reduce the number of hidden knowledge pairs generated, (b) for a specific term, the number of single step connections can be manageable, and (c) in the absence of single step hidden links, considering multiple steps can provide valid links. BioMed Central 2017-05-31 /pmc/articles/PMC5471938/ /pubmed/28617217 http://dx.doi.org/10.1186/s12859-017-1641-9 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Preiss, Judita Stevenson, Mark Quantifying and filtering knowledge generated by literature based discovery |
title | Quantifying and filtering knowledge generated by literature based discovery |
title_full | Quantifying and filtering knowledge generated by literature based discovery |
title_fullStr | Quantifying and filtering knowledge generated by literature based discovery |
title_full_unstemmed | Quantifying and filtering knowledge generated by literature based discovery |
title_short | Quantifying and filtering knowledge generated by literature based discovery |
title_sort | quantifying and filtering knowledge generated by literature based discovery |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471938/ https://www.ncbi.nlm.nih.gov/pubmed/28617217 http://dx.doi.org/10.1186/s12859-017-1641-9 |
work_keys_str_mv | AT preissjudita quantifyingandfilteringknowledgegeneratedbyliteraturebaseddiscovery AT stevensonmark quantifyingandfilteringknowledgegeneratedbyliteraturebaseddiscovery |