Cargando…

Research on named entity recognition method of marine natural products based on attention mechanism

Marine natural product (MNP) entity property information is the basis of marine drug development, and this entity property information can be obtained from the original literature. However, the traditional methods require several manual annotations, the accuracy of the model is low and slow, and the...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Xiaodong, Yu, Rilei, Gao, Chunxiao, Wei, Zhiqiang, Xia, Yimin, Wang, Xiaowei, Liu, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9944735/
https://www.ncbi.nlm.nih.gov/pubmed/36846857
http://dx.doi.org/10.3389/fchem.2023.958002
_version_ 1784891980422053888
author Ma, Xiaodong
Yu, Rilei
Gao, Chunxiao
Wei, Zhiqiang
Xia, Yimin
Wang, Xiaowei
Liu, Hao
author_facet Ma, Xiaodong
Yu, Rilei
Gao, Chunxiao
Wei, Zhiqiang
Xia, Yimin
Wang, Xiaowei
Liu, Hao
author_sort Ma, Xiaodong
collection PubMed
description Marine natural product (MNP) entity property information is the basis of marine drug development, and this entity property information can be obtained from the original literature. However, the traditional methods require several manual annotations, the accuracy of the model is low and slow, and the problem of inconsistent lexical contexts cannot be solved well. In order to solve the aforementioned problems, this study proposes a named entity recognition method based on the attention mechanism, inflated convolutional neural network (IDCNN), and conditional random field (CRF), combining the attention mechanism that can use the lexicality of words to make attention-weighted mentions of the extracted features, the ability of the inflated convolutional neural network to parallelize operations and long- and short-term memory, and the excellent learning ability. A named entity recognition algorithm model is developed for the automatic recognition of entity information in the MNP domain literature. Experiments demonstrate that the proposed model can properly identify entity information from the unstructured chapter-level literature and outperform the control model in several metrics. In addition, we construct an unstructured text dataset related to MNPs from an open-source dataset, which can be used for the research and development of resource scarcity scenarios.
format Online
Article
Text
id pubmed-9944735
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99447352023-02-23 Research on named entity recognition method of marine natural products based on attention mechanism Ma, Xiaodong Yu, Rilei Gao, Chunxiao Wei, Zhiqiang Xia, Yimin Wang, Xiaowei Liu, Hao Front Chem Chemistry Marine natural product (MNP) entity property information is the basis of marine drug development, and this entity property information can be obtained from the original literature. However, the traditional methods require several manual annotations, the accuracy of the model is low and slow, and the problem of inconsistent lexical contexts cannot be solved well. In order to solve the aforementioned problems, this study proposes a named entity recognition method based on the attention mechanism, inflated convolutional neural network (IDCNN), and conditional random field (CRF), combining the attention mechanism that can use the lexicality of words to make attention-weighted mentions of the extracted features, the ability of the inflated convolutional neural network to parallelize operations and long- and short-term memory, and the excellent learning ability. A named entity recognition algorithm model is developed for the automatic recognition of entity information in the MNP domain literature. Experiments demonstrate that the proposed model can properly identify entity information from the unstructured chapter-level literature and outperform the control model in several metrics. In addition, we construct an unstructured text dataset related to MNPs from an open-source dataset, which can be used for the research and development of resource scarcity scenarios. Frontiers Media S.A. 2023-02-08 /pmc/articles/PMC9944735/ /pubmed/36846857 http://dx.doi.org/10.3389/fchem.2023.958002 Text en Copyright © 2023 Ma, Yu, Gao, Wei, Xia, Wang and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Chemistry
Ma, Xiaodong
Yu, Rilei
Gao, Chunxiao
Wei, Zhiqiang
Xia, Yimin
Wang, Xiaowei
Liu, Hao
Research on named entity recognition method of marine natural products based on attention mechanism
title Research on named entity recognition method of marine natural products based on attention mechanism
title_full Research on named entity recognition method of marine natural products based on attention mechanism
title_fullStr Research on named entity recognition method of marine natural products based on attention mechanism
title_full_unstemmed Research on named entity recognition method of marine natural products based on attention mechanism
title_short Research on named entity recognition method of marine natural products based on attention mechanism
title_sort research on named entity recognition method of marine natural products based on attention mechanism
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9944735/
https://www.ncbi.nlm.nih.gov/pubmed/36846857
http://dx.doi.org/10.3389/fchem.2023.958002
work_keys_str_mv AT maxiaodong researchonnamedentityrecognitionmethodofmarinenaturalproductsbasedonattentionmechanism
AT yurilei researchonnamedentityrecognitionmethodofmarinenaturalproductsbasedonattentionmechanism
AT gaochunxiao researchonnamedentityrecognitionmethodofmarinenaturalproductsbasedonattentionmechanism
AT weizhiqiang researchonnamedentityrecognitionmethodofmarinenaturalproductsbasedonattentionmechanism
AT xiayimin researchonnamedentityrecognitionmethodofmarinenaturalproductsbasedonattentionmechanism
AT wangxiaowei researchonnamedentityrecognitionmethodofmarinenaturalproductsbasedonattentionmechanism
AT liuhao researchonnamedentityrecognitionmethodofmarinenaturalproductsbasedonattentionmechanism