Cargando…
Biomedical event trigger detection by dependency-based word embedding
BACKGROUND: In biomedical research, events revealing complex relations between entities play an important role. Biomedical event trigger identification has become a research hotspot since its important role in biomedical event extraction. Traditional machine learning methods, such as support vector...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4980775/ https://www.ncbi.nlm.nih.gov/pubmed/27510445 http://dx.doi.org/10.1186/s12920-016-0203-8 |
_version_ | 1782447513113985024 |
---|---|
author | Wang, Jian Zhang, Jianhai An, Yuan Lin, Hongfei Yang, Zhihao Zhang, Yijia Sun, Yuanyuan |
author_facet | Wang, Jian Zhang, Jianhai An, Yuan Lin, Hongfei Yang, Zhihao Zhang, Yijia Sun, Yuanyuan |
author_sort | Wang, Jian |
collection | PubMed |
description | BACKGROUND: In biomedical research, events revealing complex relations between entities play an important role. Biomedical event trigger identification has become a research hotspot since its important role in biomedical event extraction. Traditional machine learning methods, such as support vector machines (SVM) and maxent classifiers, which aim to manually design powerful features fed to the classifiers, depend on the understanding of the specific task and cannot generalize to the new domain or new examples. METHODS: In this paper, we propose an approach which utilizes neural network model based on dependency-based word embedding to automatically learn significant features from raw input for trigger classification. First, we employ Word2vecf, the modified version of Word2vec, to learn word embedding with rich semantic and functional information based on dependency relation tree. Then neural network architecture is used to learn more significant feature representation based on raw dependency-based word embedding. Meanwhile, we dynamically adjust the embedding while training for adapting to the trigger classification task. Finally, softmax classifier labels the examples by specific trigger class using the features learned by the model. RESULTS: The experimental results show that our approach achieves a micro-averaging F1 score of 78.27 and a macro-averaging F1 score of 76.94 % in significant trigger classes, and performs better than baseline methods. In addition, we can achieve the semantic distributed representation of every trigger word. |
format | Online Article Text |
id | pubmed-4980775 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49807752016-08-19 Biomedical event trigger detection by dependency-based word embedding Wang, Jian Zhang, Jianhai An, Yuan Lin, Hongfei Yang, Zhihao Zhang, Yijia Sun, Yuanyuan BMC Med Genomics Research BACKGROUND: In biomedical research, events revealing complex relations between entities play an important role. Biomedical event trigger identification has become a research hotspot since its important role in biomedical event extraction. Traditional machine learning methods, such as support vector machines (SVM) and maxent classifiers, which aim to manually design powerful features fed to the classifiers, depend on the understanding of the specific task and cannot generalize to the new domain or new examples. METHODS: In this paper, we propose an approach which utilizes neural network model based on dependency-based word embedding to automatically learn significant features from raw input for trigger classification. First, we employ Word2vecf, the modified version of Word2vec, to learn word embedding with rich semantic and functional information based on dependency relation tree. Then neural network architecture is used to learn more significant feature representation based on raw dependency-based word embedding. Meanwhile, we dynamically adjust the embedding while training for adapting to the trigger classification task. Finally, softmax classifier labels the examples by specific trigger class using the features learned by the model. RESULTS: The experimental results show that our approach achieves a micro-averaging F1 score of 78.27 and a macro-averaging F1 score of 76.94 % in significant trigger classes, and performs better than baseline methods. In addition, we can achieve the semantic distributed representation of every trigger word. BioMed Central 2016-08-10 /pmc/articles/PMC4980775/ /pubmed/27510445 http://dx.doi.org/10.1186/s12920-016-0203-8 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Wang, Jian Zhang, Jianhai An, Yuan Lin, Hongfei Yang, Zhihao Zhang, Yijia Sun, Yuanyuan Biomedical event trigger detection by dependency-based word embedding |
title | Biomedical event trigger detection by dependency-based word embedding |
title_full | Biomedical event trigger detection by dependency-based word embedding |
title_fullStr | Biomedical event trigger detection by dependency-based word embedding |
title_full_unstemmed | Biomedical event trigger detection by dependency-based word embedding |
title_short | Biomedical event trigger detection by dependency-based word embedding |
title_sort | biomedical event trigger detection by dependency-based word embedding |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4980775/ https://www.ncbi.nlm.nih.gov/pubmed/27510445 http://dx.doi.org/10.1186/s12920-016-0203-8 |
work_keys_str_mv | AT wangjian biomedicaleventtriggerdetectionbydependencybasedwordembedding AT zhangjianhai biomedicaleventtriggerdetectionbydependencybasedwordembedding AT anyuan biomedicaleventtriggerdetectionbydependencybasedwordembedding AT linhongfei biomedicaleventtriggerdetectionbydependencybasedwordembedding AT yangzhihao biomedicaleventtriggerdetectionbydependencybasedwordembedding AT zhangyijia biomedicaleventtriggerdetectionbydependencybasedwordembedding AT sunyuanyuan biomedicaleventtriggerdetectionbydependencybasedwordembedding |