Cargando…
A neural classification method for supporting the creation of BioVerbNet
BACKGROUND: VerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs. Biomedical text processing and mining could benefit from a similar resource. We...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6339329/ https://www.ncbi.nlm.nih.gov/pubmed/30658707 http://dx.doi.org/10.1186/s13326-018-0193-x |
_version_ | 1783388613478711296 |
---|---|
author | Chiu, Billy Majewska, Olga Pyysalo, Sampo Wey, Laura Stenius, Ulla Korhonen, Anna Palmer, Martha |
author_facet | Chiu, Billy Majewska, Olga Pyysalo, Sampo Wey, Laura Stenius, Ulla Korhonen, Anna Palmer, Martha |
author_sort | Chiu, Billy |
collection | PubMed |
description | BACKGROUND: VerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs. Biomedical text processing and mining could benefit from a similar resource. We take the first step towards the development of BioVerbNet: A VerbNet specifically aimed at describing verbs in the area of biomedicine. Because VerbNet-style classification is extremely time consuming, we start from a small manual classification of biomedical verbs and apply a state-of-the-art neural representation model, specifically developed for class-based optimization, to expand the classification with new verbs, using all the PubMed abstracts and the full articles in the PubMed Central Open Access subset as data. RESULTS: Direct evaluation of the resulting classification against BioSimVerb (verb similarity judgement data in biomedicine) shows promising results when representation learning is performed using verb class-based contexts. Human validation by linguists and biologists reveals that the automatically expanded classification is highly accurate. Including novel, valid member verbs and classes, our method can be used to facilitate cost-effective development of BioVerbNet. CONCLUSION: This work constitutes the first effort on applying a state-of-the-art architecture for neural representation learning to biomedical verb classification. While we discuss future optimization of the method, our promising results suggest that the automatic classification released with this article can be used to readily support application tasks in biomedicine. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13326-018-0193-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6339329 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63393292019-01-23 A neural classification method for supporting the creation of BioVerbNet Chiu, Billy Majewska, Olga Pyysalo, Sampo Wey, Laura Stenius, Ulla Korhonen, Anna Palmer, Martha J Biomed Semantics Database BACKGROUND: VerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs. Biomedical text processing and mining could benefit from a similar resource. We take the first step towards the development of BioVerbNet: A VerbNet specifically aimed at describing verbs in the area of biomedicine. Because VerbNet-style classification is extremely time consuming, we start from a small manual classification of biomedical verbs and apply a state-of-the-art neural representation model, specifically developed for class-based optimization, to expand the classification with new verbs, using all the PubMed abstracts and the full articles in the PubMed Central Open Access subset as data. RESULTS: Direct evaluation of the resulting classification against BioSimVerb (verb similarity judgement data in biomedicine) shows promising results when representation learning is performed using verb class-based contexts. Human validation by linguists and biologists reveals that the automatically expanded classification is highly accurate. Including novel, valid member verbs and classes, our method can be used to facilitate cost-effective development of BioVerbNet. CONCLUSION: This work constitutes the first effort on applying a state-of-the-art architecture for neural representation learning to biomedical verb classification. While we discuss future optimization of the method, our promising results suggest that the automatic classification released with this article can be used to readily support application tasks in biomedicine. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13326-018-0193-x) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-18 /pmc/articles/PMC6339329/ /pubmed/30658707 http://dx.doi.org/10.1186/s13326-018-0193-x Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Database Chiu, Billy Majewska, Olga Pyysalo, Sampo Wey, Laura Stenius, Ulla Korhonen, Anna Palmer, Martha A neural classification method for supporting the creation of BioVerbNet |
title | A neural classification method for supporting the creation of BioVerbNet |
title_full | A neural classification method for supporting the creation of BioVerbNet |
title_fullStr | A neural classification method for supporting the creation of BioVerbNet |
title_full_unstemmed | A neural classification method for supporting the creation of BioVerbNet |
title_short | A neural classification method for supporting the creation of BioVerbNet |
title_sort | neural classification method for supporting the creation of bioverbnet |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6339329/ https://www.ncbi.nlm.nih.gov/pubmed/30658707 http://dx.doi.org/10.1186/s13326-018-0193-x |
work_keys_str_mv | AT chiubilly aneuralclassificationmethodforsupportingthecreationofbioverbnet AT majewskaolga aneuralclassificationmethodforsupportingthecreationofbioverbnet AT pyysalosampo aneuralclassificationmethodforsupportingthecreationofbioverbnet AT weylaura aneuralclassificationmethodforsupportingthecreationofbioverbnet AT steniusulla aneuralclassificationmethodforsupportingthecreationofbioverbnet AT korhonenanna aneuralclassificationmethodforsupportingthecreationofbioverbnet AT palmermartha aneuralclassificationmethodforsupportingthecreationofbioverbnet AT chiubilly neuralclassificationmethodforsupportingthecreationofbioverbnet AT majewskaolga neuralclassificationmethodforsupportingthecreationofbioverbnet AT pyysalosampo neuralclassificationmethodforsupportingthecreationofbioverbnet AT weylaura neuralclassificationmethodforsupportingthecreationofbioverbnet AT steniusulla neuralclassificationmethodforsupportingthecreationofbioverbnet AT korhonenanna neuralclassificationmethodforsupportingthecreationofbioverbnet AT palmermartha neuralclassificationmethodforsupportingthecreationofbioverbnet |