Cargando…

A neural classification method for supporting the creation of BioVerbNet

BACKGROUND: VerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs. Biomedical text processing and mining could benefit from a similar resource. We...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiu, Billy, Majewska, Olga, Pyysalo, Sampo, Wey, Laura, Stenius, Ulla, Korhonen, Anna, Palmer, Martha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6339329/
https://www.ncbi.nlm.nih.gov/pubmed/30658707
http://dx.doi.org/10.1186/s13326-018-0193-x
_version_ 1783388613478711296
author Chiu, Billy
Majewska, Olga
Pyysalo, Sampo
Wey, Laura
Stenius, Ulla
Korhonen, Anna
Palmer, Martha
author_facet Chiu, Billy
Majewska, Olga
Pyysalo, Sampo
Wey, Laura
Stenius, Ulla
Korhonen, Anna
Palmer, Martha
author_sort Chiu, Billy
collection PubMed
description BACKGROUND: VerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs. Biomedical text processing and mining could benefit from a similar resource. We take the first step towards the development of BioVerbNet: A VerbNet specifically aimed at describing verbs in the area of biomedicine. Because VerbNet-style classification is extremely time consuming, we start from a small manual classification of biomedical verbs and apply a state-of-the-art neural representation model, specifically developed for class-based optimization, to expand the classification with new verbs, using all the PubMed abstracts and the full articles in the PubMed Central Open Access subset as data. RESULTS: Direct evaluation of the resulting classification against BioSimVerb (verb similarity judgement data in biomedicine) shows promising results when representation learning is performed using verb class-based contexts. Human validation by linguists and biologists reveals that the automatically expanded classification is highly accurate. Including novel, valid member verbs and classes, our method can be used to facilitate cost-effective development of BioVerbNet. CONCLUSION: This work constitutes the first effort on applying a state-of-the-art architecture for neural representation learning to biomedical verb classification. While we discuss future optimization of the method, our promising results suggest that the automatic classification released with this article can be used to readily support application tasks in biomedicine. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13326-018-0193-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6339329
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63393292019-01-23 A neural classification method for supporting the creation of BioVerbNet Chiu, Billy Majewska, Olga Pyysalo, Sampo Wey, Laura Stenius, Ulla Korhonen, Anna Palmer, Martha J Biomed Semantics Database BACKGROUND: VerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs. Biomedical text processing and mining could benefit from a similar resource. We take the first step towards the development of BioVerbNet: A VerbNet specifically aimed at describing verbs in the area of biomedicine. Because VerbNet-style classification is extremely time consuming, we start from a small manual classification of biomedical verbs and apply a state-of-the-art neural representation model, specifically developed for class-based optimization, to expand the classification with new verbs, using all the PubMed abstracts and the full articles in the PubMed Central Open Access subset as data. RESULTS: Direct evaluation of the resulting classification against BioSimVerb (verb similarity judgement data in biomedicine) shows promising results when representation learning is performed using verb class-based contexts. Human validation by linguists and biologists reveals that the automatically expanded classification is highly accurate. Including novel, valid member verbs and classes, our method can be used to facilitate cost-effective development of BioVerbNet. CONCLUSION: This work constitutes the first effort on applying a state-of-the-art architecture for neural representation learning to biomedical verb classification. While we discuss future optimization of the method, our promising results suggest that the automatic classification released with this article can be used to readily support application tasks in biomedicine. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13326-018-0193-x) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-18 /pmc/articles/PMC6339329/ /pubmed/30658707 http://dx.doi.org/10.1186/s13326-018-0193-x Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Database
Chiu, Billy
Majewska, Olga
Pyysalo, Sampo
Wey, Laura
Stenius, Ulla
Korhonen, Anna
Palmer, Martha
A neural classification method for supporting the creation of BioVerbNet
title A neural classification method for supporting the creation of BioVerbNet
title_full A neural classification method for supporting the creation of BioVerbNet
title_fullStr A neural classification method for supporting the creation of BioVerbNet
title_full_unstemmed A neural classification method for supporting the creation of BioVerbNet
title_short A neural classification method for supporting the creation of BioVerbNet
title_sort neural classification method for supporting the creation of bioverbnet
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6339329/
https://www.ncbi.nlm.nih.gov/pubmed/30658707
http://dx.doi.org/10.1186/s13326-018-0193-x
work_keys_str_mv AT chiubilly aneuralclassificationmethodforsupportingthecreationofbioverbnet
AT majewskaolga aneuralclassificationmethodforsupportingthecreationofbioverbnet
AT pyysalosampo aneuralclassificationmethodforsupportingthecreationofbioverbnet
AT weylaura aneuralclassificationmethodforsupportingthecreationofbioverbnet
AT steniusulla aneuralclassificationmethodforsupportingthecreationofbioverbnet
AT korhonenanna aneuralclassificationmethodforsupportingthecreationofbioverbnet
AT palmermartha aneuralclassificationmethodforsupportingthecreationofbioverbnet
AT chiubilly neuralclassificationmethodforsupportingthecreationofbioverbnet
AT majewskaolga neuralclassificationmethodforsupportingthecreationofbioverbnet
AT pyysalosampo neuralclassificationmethodforsupportingthecreationofbioverbnet
AT weylaura neuralclassificationmethodforsupportingthecreationofbioverbnet
AT steniusulla neuralclassificationmethodforsupportingthecreationofbioverbnet
AT korhonenanna neuralclassificationmethodforsupportingthecreationofbioverbnet
AT palmermartha neuralclassificationmethodforsupportingthecreationofbioverbnet