Cargando…

Neural networks for open and closed Literature-based Discovery

Literature-based Discovery (LBD) aims to discover new knowledge automatically from large collections of literature. Scientific literature is growing at an exponential rate, making it difficult for researchers to stay current in their discipline and easy to miss knowledge necessary to advance their r...

Descripción completa

Detalles Bibliográficos
Autores principales: Crichton, Gamal, Baker, Simon, Guo, Yufan, Korhonen, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7228051/
https://www.ncbi.nlm.nih.gov/pubmed/32413059
http://dx.doi.org/10.1371/journal.pone.0232891
_version_ 1783534566951092224
author Crichton, Gamal
Baker, Simon
Guo, Yufan
Korhonen, Anna
author_facet Crichton, Gamal
Baker, Simon
Guo, Yufan
Korhonen, Anna
author_sort Crichton, Gamal
collection PubMed
description Literature-based Discovery (LBD) aims to discover new knowledge automatically from large collections of literature. Scientific literature is growing at an exponential rate, making it difficult for researchers to stay current in their discipline and easy to miss knowledge necessary to advance their research. LBD can facilitate hypothesis testing and generation and thus accelerate scientific progress. Neural networks have demonstrated improved performance on LBD-related tasks but are yet to be applied to it. We propose four graph-based, neural network methods to perform open and closed LBD. We compared our methods with those used by the state-of-the-art LION LBD system on the same evaluations to replicate recently published findings in cancer biology. We also applied them to a time-sliced dataset of human-curated peer-reviewed biological interactions. These evaluations and the metrics they employ represent performance on real-world knowledge advances and are thus robust indicators of approach efficacy. In the first experiments, our best methods performed 2-4 times better than the baselines in closed discovery and 2-3 times better in open discovery. In the second, our best methods performed almost 2 times better than the baselines in open discovery. These results are strong indications that neural LBD is potentially a very effective approach for generating new scientific discoveries from existing literature. The code for our models and other information can be found at: https://github.com/cambridgeltl/nn_for_LBD.
format Online
Article
Text
id pubmed-7228051
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-72280512020-06-01 Neural networks for open and closed Literature-based Discovery Crichton, Gamal Baker, Simon Guo, Yufan Korhonen, Anna PLoS One Research Article Literature-based Discovery (LBD) aims to discover new knowledge automatically from large collections of literature. Scientific literature is growing at an exponential rate, making it difficult for researchers to stay current in their discipline and easy to miss knowledge necessary to advance their research. LBD can facilitate hypothesis testing and generation and thus accelerate scientific progress. Neural networks have demonstrated improved performance on LBD-related tasks but are yet to be applied to it. We propose four graph-based, neural network methods to perform open and closed LBD. We compared our methods with those used by the state-of-the-art LION LBD system on the same evaluations to replicate recently published findings in cancer biology. We also applied them to a time-sliced dataset of human-curated peer-reviewed biological interactions. These evaluations and the metrics they employ represent performance on real-world knowledge advances and are thus robust indicators of approach efficacy. In the first experiments, our best methods performed 2-4 times better than the baselines in closed discovery and 2-3 times better in open discovery. In the second, our best methods performed almost 2 times better than the baselines in open discovery. These results are strong indications that neural LBD is potentially a very effective approach for generating new scientific discoveries from existing literature. The code for our models and other information can be found at: https://github.com/cambridgeltl/nn_for_LBD. Public Library of Science 2020-05-15 /pmc/articles/PMC7228051/ /pubmed/32413059 http://dx.doi.org/10.1371/journal.pone.0232891 Text en © 2020 Crichton et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Crichton, Gamal
Baker, Simon
Guo, Yufan
Korhonen, Anna
Neural networks for open and closed Literature-based Discovery
title Neural networks for open and closed Literature-based Discovery
title_full Neural networks for open and closed Literature-based Discovery
title_fullStr Neural networks for open and closed Literature-based Discovery
title_full_unstemmed Neural networks for open and closed Literature-based Discovery
title_short Neural networks for open and closed Literature-based Discovery
title_sort neural networks for open and closed literature-based discovery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7228051/
https://www.ncbi.nlm.nih.gov/pubmed/32413059
http://dx.doi.org/10.1371/journal.pone.0232891
work_keys_str_mv AT crichtongamal neuralnetworksforopenandclosedliteraturebaseddiscovery
AT bakersimon neuralnetworksforopenandclosedliteraturebaseddiscovery
AT guoyufan neuralnetworksforopenandclosedliteraturebaseddiscovery
AT korhonenanna neuralnetworksforopenandclosedliteraturebaseddiscovery