Cargando…

Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts

Literature-based Discovery (LBD) (a.k.a. Hypotheses Generation) is a systematic knowledge discovery process that elicit novel inferences about previously unknown scientific knowledge by rationally connecting complementary and non-interactive literature. Prompt identification of such novel knowledge...

Descripción completa

Detalles Bibliográficos
Autores principales: Thilakaratne, Menasha, Falkner, Katrina, Atapattu, Thushari
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206271/
http://dx.doi.org/10.1007/978-3-030-47436-2_25
_version_ 1783530382690353152
author Thilakaratne, Menasha
Falkner, Katrina
Atapattu, Thushari
author_facet Thilakaratne, Menasha
Falkner, Katrina
Atapattu, Thushari
author_sort Thilakaratne, Menasha
collection PubMed
description Literature-based Discovery (LBD) (a.k.a. Hypotheses Generation) is a systematic knowledge discovery process that elicit novel inferences about previously unknown scientific knowledge by rationally connecting complementary and non-interactive literature. Prompt identification of such novel knowledge is beneficial not only for researchers but also for various other stakeholders such as universities, funding bodies and academic publishers. Almost all the prior LBD research suffer from two major limitations. Firstly, the over-reliance of domain-dependent resources which restrict the models’ applicability to certain domains/problems. In this regard, we propose a generalisable LBD model that supports both cross-domain and cross-lingual knowledge discovery. The second persistent research deficiency is the mere focus of static snapshot of the corpus (i.e. ignoring the temporal evolution of topics) to detect the new knowledge. However, the knowledge in scientific literature changes dynamically and thus relying merely on static snapshot limits the model’s ability in capturing semantically meaningful connections. As a result, we propose a novel temporal model that captures semantic change of topics using diachronic word embeddings to unravel more accurate connections. The model was evaluated using the largest available literature repository to demonstrate the efficiency of the proposed cues towards recommending novel knowledge. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this chapter (10.1007/978-3-030-47436-2_25) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-7206271
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72062712020-05-08 Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts Thilakaratne, Menasha Falkner, Katrina Atapattu, Thushari Advances in Knowledge Discovery and Data Mining Article Literature-based Discovery (LBD) (a.k.a. Hypotheses Generation) is a systematic knowledge discovery process that elicit novel inferences about previously unknown scientific knowledge by rationally connecting complementary and non-interactive literature. Prompt identification of such novel knowledge is beneficial not only for researchers but also for various other stakeholders such as universities, funding bodies and academic publishers. Almost all the prior LBD research suffer from two major limitations. Firstly, the over-reliance of domain-dependent resources which restrict the models’ applicability to certain domains/problems. In this regard, we propose a generalisable LBD model that supports both cross-domain and cross-lingual knowledge discovery. The second persistent research deficiency is the mere focus of static snapshot of the corpus (i.e. ignoring the temporal evolution of topics) to detect the new knowledge. However, the knowledge in scientific literature changes dynamically and thus relying merely on static snapshot limits the model’s ability in capturing semantically meaningful connections. As a result, we propose a novel temporal model that captures semantic change of topics using diachronic word embeddings to unravel more accurate connections. The model was evaluated using the largest available literature repository to demonstrate the efficiency of the proposed cues towards recommending novel knowledge. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this chapter (10.1007/978-3-030-47436-2_25) contains supplementary material, which is available to authorized users. 2020-04-17 /pmc/articles/PMC7206271/ http://dx.doi.org/10.1007/978-3-030-47436-2_25 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Thilakaratne, Menasha
Falkner, Katrina
Atapattu, Thushari
Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts
title Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts
title_full Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts
title_fullStr Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts
title_full_unstemmed Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts
title_short Connecting the Dots: Hypotheses Generation by Leveraging Semantic Shifts
title_sort connecting the dots: hypotheses generation by leveraging semantic shifts
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206271/
http://dx.doi.org/10.1007/978-3-030-47436-2_25
work_keys_str_mv AT thilakaratnemenasha connectingthedotshypothesesgenerationbyleveragingsemanticshifts
AT falknerkatrina connectingthedotshypothesesgenerationbyleveragingsemanticshifts
AT atapattuthushari connectingthedotshypothesesgenerationbyleveragingsemanticshifts