Cargando…

Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach

Named Entity Recognition (NER) is a crucial step in mining information from massive agricultural texts, which is required in the construction of many knowledge-based agricultural support systems, such as agricultural technology question answering systems. The vital domain characteristics of Chinese...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Lilin, Nie, Xiaolin, Zhang, Mingmei, Gu, Mingyang, Geissen, Violette, Ritsema, Coen J., Niu, Dangdang, Zhang, Hongming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9714304/
https://www.ncbi.nlm.nih.gov/pubmed/36466267
http://dx.doi.org/10.3389/fpls.2022.1053449
_version_ 1784842194221268992
author Zhang, Lilin
Nie, Xiaolin
Zhang, Mingmei
Gu, Mingyang
Geissen, Violette
Ritsema, Coen J.
Niu, Dangdang
Zhang, Hongming
author_facet Zhang, Lilin
Nie, Xiaolin
Zhang, Mingmei
Gu, Mingyang
Geissen, Violette
Ritsema, Coen J.
Niu, Dangdang
Zhang, Hongming
author_sort Zhang, Lilin
collection PubMed
description Named Entity Recognition (NER) is a crucial step in mining information from massive agricultural texts, which is required in the construction of many knowledge-based agricultural support systems, such as agricultural technology question answering systems. The vital domain characteristics of Chinese agricultural text cause the Chinese NER (CNER) in kiwifruit diseases and pests to suffer from the insensitivity of common word segmentation tools to kiwifruit-related texts and the feature extraction capability of the sequence encoding layer being challenged. In order to alleviate the above problems, effectively mine information from kiwifruit-related texts to provide support for agricultural support systems such as agricultural question answering systems, this study constructed a novel Chinese agricultural NER (CANER) model KIWINER by statistics-based new word detection and two novel modules, AttSoftlexicon (Criss-cross attention-based Softlexicon) and PCAT (Parallel connection criss-cross attention), proposed in this paper. Specifically, new words were detected to improve the adaptability of word segmentation tools to kiwifruit-related texts, thereby constructing a kiwifruit lexicon. The AttSoftlexicon integrates word information into the model and makes full use of the word information with the help of Criss-cross attention network (CCNet). And the PCAT improves the feature extraction ability of sequence encoding layer through CCNet and parallel connection structure. The performance of KIWINER was evaluated on four datasets, namely KIWID (Self-annotated), Boson, ClueNER, and People’s Daily, which achieved optimal F(1)-scores of 88.94%, 85.13%, 80.52%, and 92.82%, respectively. Experimental results in many aspects illustrated that methods proposed in this paper can effectively improve the recognition effect of kiwifruit diseases and pests named entities, especially for diseases and pests with strong domain characteristics
format Online
Article
Text
id pubmed-9714304
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97143042022-12-02 Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach Zhang, Lilin Nie, Xiaolin Zhang, Mingmei Gu, Mingyang Geissen, Violette Ritsema, Coen J. Niu, Dangdang Zhang, Hongming Front Plant Sci Plant Science Named Entity Recognition (NER) is a crucial step in mining information from massive agricultural texts, which is required in the construction of many knowledge-based agricultural support systems, such as agricultural technology question answering systems. The vital domain characteristics of Chinese agricultural text cause the Chinese NER (CNER) in kiwifruit diseases and pests to suffer from the insensitivity of common word segmentation tools to kiwifruit-related texts and the feature extraction capability of the sequence encoding layer being challenged. In order to alleviate the above problems, effectively mine information from kiwifruit-related texts to provide support for agricultural support systems such as agricultural question answering systems, this study constructed a novel Chinese agricultural NER (CANER) model KIWINER by statistics-based new word detection and two novel modules, AttSoftlexicon (Criss-cross attention-based Softlexicon) and PCAT (Parallel connection criss-cross attention), proposed in this paper. Specifically, new words were detected to improve the adaptability of word segmentation tools to kiwifruit-related texts, thereby constructing a kiwifruit lexicon. The AttSoftlexicon integrates word information into the model and makes full use of the word information with the help of Criss-cross attention network (CCNet). And the PCAT improves the feature extraction ability of sequence encoding layer through CCNet and parallel connection structure. The performance of KIWINER was evaluated on four datasets, namely KIWID (Self-annotated), Boson, ClueNER, and People’s Daily, which achieved optimal F(1)-scores of 88.94%, 85.13%, 80.52%, and 92.82%, respectively. Experimental results in many aspects illustrated that methods proposed in this paper can effectively improve the recognition effect of kiwifruit diseases and pests named entities, especially for diseases and pests with strong domain characteristics Frontiers Media S.A. 2022-11-17 /pmc/articles/PMC9714304/ /pubmed/36466267 http://dx.doi.org/10.3389/fpls.2022.1053449 Text en Copyright © 2022 Zhang, Nie, Zhang, Gu, Geissen, Ritsema, Niu and Zhang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Zhang, Lilin
Nie, Xiaolin
Zhang, Mingmei
Gu, Mingyang
Geissen, Violette
Ritsema, Coen J.
Niu, Dangdang
Zhang, Hongming
Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach
title Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach
title_full Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach
title_fullStr Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach
title_full_unstemmed Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach
title_short Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach
title_sort lexicon and attention-based named entity recognition for kiwifruit diseases and pests: a deep learning approach
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9714304/
https://www.ncbi.nlm.nih.gov/pubmed/36466267
http://dx.doi.org/10.3389/fpls.2022.1053449
work_keys_str_mv AT zhanglilin lexiconandattentionbasednamedentityrecognitionforkiwifruitdiseasesandpestsadeeplearningapproach
AT niexiaolin lexiconandattentionbasednamedentityrecognitionforkiwifruitdiseasesandpestsadeeplearningapproach
AT zhangmingmei lexiconandattentionbasednamedentityrecognitionforkiwifruitdiseasesandpestsadeeplearningapproach
AT gumingyang lexiconandattentionbasednamedentityrecognitionforkiwifruitdiseasesandpestsadeeplearningapproach
AT geissenviolette lexiconandattentionbasednamedentityrecognitionforkiwifruitdiseasesandpestsadeeplearningapproach
AT ritsemacoenj lexiconandattentionbasednamedentityrecognitionforkiwifruitdiseasesandpestsadeeplearningapproach
AT niudangdang lexiconandattentionbasednamedentityrecognitionforkiwifruitdiseasesandpestsadeeplearningapproach
AT zhanghongming lexiconandattentionbasednamedentityrecognitionforkiwifruitdiseasesandpestsadeeplearningapproach