Cargando…

Legal Text Recognition Using LSTM-CRF Deep Learning Model

In legal texts, named entity recognition (NER) is researched using deep learning models. First, the bidirectional (Bi)-long short-term memory (LSTM)-conditional random field (CRF) model for studying NER in legal texts is established. Second, different annotation methods are used to compare and analy...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Hesheng, Hu, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947905/
https://www.ncbi.nlm.nih.gov/pubmed/35341203
http://dx.doi.org/10.1155/2022/9933929
_version_ 1784674547716325376
author Xu, Hesheng
Hu, Bin
author_facet Xu, Hesheng
Hu, Bin
author_sort Xu, Hesheng
collection PubMed
description In legal texts, named entity recognition (NER) is researched using deep learning models. First, the bidirectional (Bi)-long short-term memory (LSTM)-conditional random field (CRF) model for studying NER in legal texts is established. Second, different annotation methods are used to compare and analyze the entity recognition effect of the Bi-LSTM-CRF model. Finally, other objective loss functions are set to compare and analyze the entity recognition effect of the Bi-LSTM-CRF model. The research results show that the F1 value of the model trained on the word sequence labeling corpus on the named entity is 88.13%, higher than that of the word sequence labeling corpus. For the two types of entities, place names and organization names, the F1 values obtained by the Bi-LSTM-CRF model using word segmentation are 67.60% and 89.45%, respectively, higher than the F1 values obtained by the model using character segmentation. Therefore, the Bi-LSTM-CRF model using word segmentation is more suitable for recognizing extended entities. The parameter learning result using log-likelihood is better than that using the maximum interval criterion, and it is ideal for the Bi-LSTM-CRF model. This method provides ideas for the research of legal text recognition and has a particular value.
format Online
Article
Text
id pubmed-8947905
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-89479052022-03-25 Legal Text Recognition Using LSTM-CRF Deep Learning Model Xu, Hesheng Hu, Bin Comput Intell Neurosci Research Article In legal texts, named entity recognition (NER) is researched using deep learning models. First, the bidirectional (Bi)-long short-term memory (LSTM)-conditional random field (CRF) model for studying NER in legal texts is established. Second, different annotation methods are used to compare and analyze the entity recognition effect of the Bi-LSTM-CRF model. Finally, other objective loss functions are set to compare and analyze the entity recognition effect of the Bi-LSTM-CRF model. The research results show that the F1 value of the model trained on the word sequence labeling corpus on the named entity is 88.13%, higher than that of the word sequence labeling corpus. For the two types of entities, place names and organization names, the F1 values obtained by the Bi-LSTM-CRF model using word segmentation are 67.60% and 89.45%, respectively, higher than the F1 values obtained by the model using character segmentation. Therefore, the Bi-LSTM-CRF model using word segmentation is more suitable for recognizing extended entities. The parameter learning result using log-likelihood is better than that using the maximum interval criterion, and it is ideal for the Bi-LSTM-CRF model. This method provides ideas for the research of legal text recognition and has a particular value. Hindawi 2022-03-17 /pmc/articles/PMC8947905/ /pubmed/35341203 http://dx.doi.org/10.1155/2022/9933929 Text en Copyright © 2022 Hesheng Xu and Bin Hu. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xu, Hesheng
Hu, Bin
Legal Text Recognition Using LSTM-CRF Deep Learning Model
title Legal Text Recognition Using LSTM-CRF Deep Learning Model
title_full Legal Text Recognition Using LSTM-CRF Deep Learning Model
title_fullStr Legal Text Recognition Using LSTM-CRF Deep Learning Model
title_full_unstemmed Legal Text Recognition Using LSTM-CRF Deep Learning Model
title_short Legal Text Recognition Using LSTM-CRF Deep Learning Model
title_sort legal text recognition using lstm-crf deep learning model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947905/
https://www.ncbi.nlm.nih.gov/pubmed/35341203
http://dx.doi.org/10.1155/2022/9933929
work_keys_str_mv AT xuhesheng legaltextrecognitionusinglstmcrfdeeplearningmodel
AT hubin legaltextrecognitionusinglstmcrfdeeplearningmodel