Cargando…

Quantifying and filtering knowledge generated by literature based discovery

BACKGROUND: Literature based discovery (LBD) automatically infers missed connections between concepts in literature. It is often assumed that LBD generates more information than can be reasonably examined. METHODS: We present a detailed analysis of the quantity of hidden knowledge produced by an LBD...

Descripción completa

Detalles Bibliográficos
Autores principales: Preiss, Judita, Stevenson, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471938/
https://www.ncbi.nlm.nih.gov/pubmed/28617217
http://dx.doi.org/10.1186/s12859-017-1641-9
_version_ 1783244049153523712
author Preiss, Judita
Stevenson, Mark
author_facet Preiss, Judita
Stevenson, Mark
author_sort Preiss, Judita
collection PubMed
description BACKGROUND: Literature based discovery (LBD) automatically infers missed connections between concepts in literature. It is often assumed that LBD generates more information than can be reasonably examined. METHODS: We present a detailed analysis of the quantity of hidden knowledge produced by an LBD system and the effect of various filtering approaches upon this. The investigation of filtering combined with single or multi-step linking term chains is carried out on all articles in PubMed. RESULTS: The evaluation is carried out using both replication of existing discoveries, which provides justification for multi-step linking chain knowledge in specific cases, and using timeslicing, which gives a large scale measure of performance. CONCLUSIONS: While the quantity of hidden knowledge generated by LBD can be vast, we demonstrate that (a) intelligent filtering can greatly reduce the number of hidden knowledge pairs generated, (b) for a specific term, the number of single step connections can be manageable, and (c) in the absence of single step hidden links, considering multiple steps can provide valid links.
format Online
Article
Text
id pubmed-5471938
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54719382017-06-19 Quantifying and filtering knowledge generated by literature based discovery Preiss, Judita Stevenson, Mark BMC Bioinformatics Research BACKGROUND: Literature based discovery (LBD) automatically infers missed connections between concepts in literature. It is often assumed that LBD generates more information than can be reasonably examined. METHODS: We present a detailed analysis of the quantity of hidden knowledge produced by an LBD system and the effect of various filtering approaches upon this. The investigation of filtering combined with single or multi-step linking term chains is carried out on all articles in PubMed. RESULTS: The evaluation is carried out using both replication of existing discoveries, which provides justification for multi-step linking chain knowledge in specific cases, and using timeslicing, which gives a large scale measure of performance. CONCLUSIONS: While the quantity of hidden knowledge generated by LBD can be vast, we demonstrate that (a) intelligent filtering can greatly reduce the number of hidden knowledge pairs generated, (b) for a specific term, the number of single step connections can be manageable, and (c) in the absence of single step hidden links, considering multiple steps can provide valid links. BioMed Central 2017-05-31 /pmc/articles/PMC5471938/ /pubmed/28617217 http://dx.doi.org/10.1186/s12859-017-1641-9 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Preiss, Judita
Stevenson, Mark
Quantifying and filtering knowledge generated by literature based discovery
title Quantifying and filtering knowledge generated by literature based discovery
title_full Quantifying and filtering knowledge generated by literature based discovery
title_fullStr Quantifying and filtering knowledge generated by literature based discovery
title_full_unstemmed Quantifying and filtering knowledge generated by literature based discovery
title_short Quantifying and filtering knowledge generated by literature based discovery
title_sort quantifying and filtering knowledge generated by literature based discovery
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471938/
https://www.ncbi.nlm.nih.gov/pubmed/28617217
http://dx.doi.org/10.1186/s12859-017-1641-9
work_keys_str_mv AT preissjudita quantifyingandfilteringknowledgegeneratedbyliteraturebaseddiscovery
AT stevensonmark quantifyingandfilteringknowledgegeneratedbyliteraturebaseddiscovery