Cargando…

Extraction of chemical–protein interactions from the literature using neural networks and narrow instance representation

The scientific literature contains large amounts of information on genes, proteins, chemicals and their interactions. Extraction and integration of this information in curated knowledge bases help researchers support their experimental results, leading to new hypotheses and discoveries. This is espe...

Descripción completa

Detalles Bibliográficos
Autores principales: Antunes, Rui, Matos, Sérgio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6796919/
https://www.ncbi.nlm.nih.gov/pubmed/31622463
http://dx.doi.org/10.1093/database/baz095
_version_ 1783459715064266752
author Antunes, Rui
Matos, Sérgio
author_facet Antunes, Rui
Matos, Sérgio
author_sort Antunes, Rui
collection PubMed
description The scientific literature contains large amounts of information on genes, proteins, chemicals and their interactions. Extraction and integration of this information in curated knowledge bases help researchers support their experimental results, leading to new hypotheses and discoveries. This is especially relevant for precision medicine, which aims to understand the individual variability across patient groups in order to select the most appropriate treatments. Methods for improved retrieval and automatic relation extraction from biomedical literature are therefore required for collecting structured information from the growing number of published works. In this paper, we follow a deep learning approach for extracting mentions of chemical–protein interactions from biomedical articles, based on various enhancements over our participation in the BioCreative VI CHEMPROT task. A significant aspect of our best method is the use of a simple deep learning model together with a very narrow representation of the relation instances, using only up to 10 words from the shortest dependency path and the respective dependency edges. Bidirectional long short-term memory recurrent networks or convolutional neural networks are used to build the deep learning models. We report the results of several experiments and show that our best model is competitive with more complex sentence representations or network structures, achieving an F1-score of 0.6306 on the test set. The source code of our work, along with detailed statistics, is publicly available.
format Online
Article
Text
id pubmed-6796919
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-67969192019-10-21 Extraction of chemical–protein interactions from the literature using neural networks and narrow instance representation Antunes, Rui Matos, Sérgio Database (Oxford) Original Article The scientific literature contains large amounts of information on genes, proteins, chemicals and their interactions. Extraction and integration of this information in curated knowledge bases help researchers support their experimental results, leading to new hypotheses and discoveries. This is especially relevant for precision medicine, which aims to understand the individual variability across patient groups in order to select the most appropriate treatments. Methods for improved retrieval and automatic relation extraction from biomedical literature are therefore required for collecting structured information from the growing number of published works. In this paper, we follow a deep learning approach for extracting mentions of chemical–protein interactions from biomedical articles, based on various enhancements over our participation in the BioCreative VI CHEMPROT task. A significant aspect of our best method is the use of a simple deep learning model together with a very narrow representation of the relation instances, using only up to 10 words from the shortest dependency path and the respective dependency edges. Bidirectional long short-term memory recurrent networks or convolutional neural networks are used to build the deep learning models. We report the results of several experiments and show that our best model is competitive with more complex sentence representations or network structures, achieving an F1-score of 0.6306 on the test set. The source code of our work, along with detailed statistics, is publicly available. Oxford University Press 2019-10-17 /pmc/articles/PMC6796919/ /pubmed/31622463 http://dx.doi.org/10.1093/database/baz095 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Antunes, Rui
Matos, Sérgio
Extraction of chemical–protein interactions from the literature using neural networks and narrow instance representation
title Extraction of chemical–protein interactions from the literature using neural networks and narrow instance representation
title_full Extraction of chemical–protein interactions from the literature using neural networks and narrow instance representation
title_fullStr Extraction of chemical–protein interactions from the literature using neural networks and narrow instance representation
title_full_unstemmed Extraction of chemical–protein interactions from the literature using neural networks and narrow instance representation
title_short Extraction of chemical–protein interactions from the literature using neural networks and narrow instance representation
title_sort extraction of chemical–protein interactions from the literature using neural networks and narrow instance representation
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6796919/
https://www.ncbi.nlm.nih.gov/pubmed/31622463
http://dx.doi.org/10.1093/database/baz095
work_keys_str_mv AT antunesrui extractionofchemicalproteininteractionsfromtheliteratureusingneuralnetworksandnarrowinstancerepresentation
AT matossergio extractionofchemicalproteininteractionsfromtheliteratureusingneuralnetworksandnarrowinstancerepresentation