Cargando…

Biomedical event trigger detection by dependency-based word embedding

BACKGROUND: In biomedical research, events revealing complex relations between entities play an important role. Biomedical event trigger identification has become a research hotspot since its important role in biomedical event extraction. Traditional machine learning methods, such as support vector...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jian, Zhang, Jianhai, An, Yuan, Lin, Hongfei, Yang, Zhihao, Zhang, Yijia, Sun, Yuanyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4980775/
https://www.ncbi.nlm.nih.gov/pubmed/27510445
http://dx.doi.org/10.1186/s12920-016-0203-8
_version_ 1782447513113985024
author Wang, Jian
Zhang, Jianhai
An, Yuan
Lin, Hongfei
Yang, Zhihao
Zhang, Yijia
Sun, Yuanyuan
author_facet Wang, Jian
Zhang, Jianhai
An, Yuan
Lin, Hongfei
Yang, Zhihao
Zhang, Yijia
Sun, Yuanyuan
author_sort Wang, Jian
collection PubMed
description BACKGROUND: In biomedical research, events revealing complex relations between entities play an important role. Biomedical event trigger identification has become a research hotspot since its important role in biomedical event extraction. Traditional machine learning methods, such as support vector machines (SVM) and maxent classifiers, which aim to manually design powerful features fed to the classifiers, depend on the understanding of the specific task and cannot generalize to the new domain or new examples. METHODS: In this paper, we propose an approach which utilizes neural network model based on dependency-based word embedding to automatically learn significant features from raw input for trigger classification. First, we employ Word2vecf, the modified version of Word2vec, to learn word embedding with rich semantic and functional information based on dependency relation tree. Then neural network architecture is used to learn more significant feature representation based on raw dependency-based word embedding. Meanwhile, we dynamically adjust the embedding while training for adapting to the trigger classification task. Finally, softmax classifier labels the examples by specific trigger class using the features learned by the model. RESULTS: The experimental results show that our approach achieves a micro-averaging F1 score of 78.27 and a macro-averaging F1 score of 76.94 % in significant trigger classes, and performs better than baseline methods. In addition, we can achieve the semantic distributed representation of every trigger word.
format Online
Article
Text
id pubmed-4980775
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49807752016-08-19 Biomedical event trigger detection by dependency-based word embedding Wang, Jian Zhang, Jianhai An, Yuan Lin, Hongfei Yang, Zhihao Zhang, Yijia Sun, Yuanyuan BMC Med Genomics Research BACKGROUND: In biomedical research, events revealing complex relations between entities play an important role. Biomedical event trigger identification has become a research hotspot since its important role in biomedical event extraction. Traditional machine learning methods, such as support vector machines (SVM) and maxent classifiers, which aim to manually design powerful features fed to the classifiers, depend on the understanding of the specific task and cannot generalize to the new domain or new examples. METHODS: In this paper, we propose an approach which utilizes neural network model based on dependency-based word embedding to automatically learn significant features from raw input for trigger classification. First, we employ Word2vecf, the modified version of Word2vec, to learn word embedding with rich semantic and functional information based on dependency relation tree. Then neural network architecture is used to learn more significant feature representation based on raw dependency-based word embedding. Meanwhile, we dynamically adjust the embedding while training for adapting to the trigger classification task. Finally, softmax classifier labels the examples by specific trigger class using the features learned by the model. RESULTS: The experimental results show that our approach achieves a micro-averaging F1 score of 78.27 and a macro-averaging F1 score of 76.94 % in significant trigger classes, and performs better than baseline methods. In addition, we can achieve the semantic distributed representation of every trigger word. BioMed Central 2016-08-10 /pmc/articles/PMC4980775/ /pubmed/27510445 http://dx.doi.org/10.1186/s12920-016-0203-8 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Wang, Jian
Zhang, Jianhai
An, Yuan
Lin, Hongfei
Yang, Zhihao
Zhang, Yijia
Sun, Yuanyuan
Biomedical event trigger detection by dependency-based word embedding
title Biomedical event trigger detection by dependency-based word embedding
title_full Biomedical event trigger detection by dependency-based word embedding
title_fullStr Biomedical event trigger detection by dependency-based word embedding
title_full_unstemmed Biomedical event trigger detection by dependency-based word embedding
title_short Biomedical event trigger detection by dependency-based word embedding
title_sort biomedical event trigger detection by dependency-based word embedding
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4980775/
https://www.ncbi.nlm.nih.gov/pubmed/27510445
http://dx.doi.org/10.1186/s12920-016-0203-8
work_keys_str_mv AT wangjian biomedicaleventtriggerdetectionbydependencybasedwordembedding
AT zhangjianhai biomedicaleventtriggerdetectionbydependencybasedwordembedding
AT anyuan biomedicaleventtriggerdetectionbydependencybasedwordembedding
AT linhongfei biomedicaleventtriggerdetectionbydependencybasedwordembedding
AT yangzhihao biomedicaleventtriggerdetectionbydependencybasedwordembedding
AT zhangyijia biomedicaleventtriggerdetectionbydependencybasedwordembedding
AT sunyuanyuan biomedicaleventtriggerdetectionbydependencybasedwordembedding