Cargando…

Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching

Similar judicial case matching aims to enable an accurate selection of a judicial document that is most similar to the target document from multiple candidates. The core of similar judicial case matching is to calculate the similarity between two fact case documents. Owing to similar judicial case m...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jiamin, Liu, Xingbo, Nie, Xiushan, Ma, Lele, Li, Peng, Zhang, Kai, Yin, Yilong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8064799/
https://www.ncbi.nlm.nih.gov/pubmed/33953738
http://dx.doi.org/10.1155/2021/6650962
_version_ 1783682211493445632
author Li, Jiamin
Liu, Xingbo
Nie, Xiushan
Ma, Lele
Li, Peng
Zhang, Kai
Yin, Yilong
author_facet Li, Jiamin
Liu, Xingbo
Nie, Xiushan
Ma, Lele
Li, Peng
Zhang, Kai
Yin, Yilong
author_sort Li, Jiamin
collection PubMed
description Similar judicial case matching aims to enable an accurate selection of a judicial document that is most similar to the target document from multiple candidates. The core of similar judicial case matching is to calculate the similarity between two fact case documents. Owing to similar judicial case matching techniques, legal professionals can promptly find and judge similar cases in a candidate set. These techniques can also benefit the development of judicial systems. However, the document of judicial cases not only is long in length but also has a certain degree of structural complexity. Meanwhile, a variety of judicial cases are also increasing rapidly; thus, it is difficult to find the document most similar to the target document in a large corpus. In this study, we present a novel similar judicial case matching model, which obtains the weight of judicial feature attributes based on hash learning and realizes fast similar matching by using a binary code. The proposed model extracts the judicial feature attributes vector using the bidirectional encoder representations from transformers (BERT) model and subsequently obtains the weighted judicial feature attributes through learning the hash function. We further impose triplet constraints to ensure that the similarity of judicial case data is well preserved when projected into the Hamming space. Comprehensive experimental results on public datasets show that the proposed method is superior in the task of similar judicial case matching and is suitable for large-scale similar judicial case matching.
format Online
Article
Text
id pubmed-8064799
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-80647992021-05-04 Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching Li, Jiamin Liu, Xingbo Nie, Xiushan Ma, Lele Li, Peng Zhang, Kai Yin, Yilong Comput Intell Neurosci Research Article Similar judicial case matching aims to enable an accurate selection of a judicial document that is most similar to the target document from multiple candidates. The core of similar judicial case matching is to calculate the similarity between two fact case documents. Owing to similar judicial case matching techniques, legal professionals can promptly find and judge similar cases in a candidate set. These techniques can also benefit the development of judicial systems. However, the document of judicial cases not only is long in length but also has a certain degree of structural complexity. Meanwhile, a variety of judicial cases are also increasing rapidly; thus, it is difficult to find the document most similar to the target document in a large corpus. In this study, we present a novel similar judicial case matching model, which obtains the weight of judicial feature attributes based on hash learning and realizes fast similar matching by using a binary code. The proposed model extracts the judicial feature attributes vector using the bidirectional encoder representations from transformers (BERT) model and subsequently obtains the weighted judicial feature attributes through learning the hash function. We further impose triplet constraints to ensure that the similarity of judicial case data is well preserved when projected into the Hamming space. Comprehensive experimental results on public datasets show that the proposed method is superior in the task of similar judicial case matching and is suitable for large-scale similar judicial case matching. Hindawi 2021-04-16 /pmc/articles/PMC8064799/ /pubmed/33953738 http://dx.doi.org/10.1155/2021/6650962 Text en Copyright © 2021 Jiamin Li et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Li, Jiamin
Liu, Xingbo
Nie, Xiushan
Ma, Lele
Li, Peng
Zhang, Kai
Yin, Yilong
Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching
title Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching
title_full Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching
title_fullStr Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching
title_full_unstemmed Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching
title_short Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching
title_sort weighted-attribute triplet hashing for large-scale similar judicial case matching
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8064799/
https://www.ncbi.nlm.nih.gov/pubmed/33953738
http://dx.doi.org/10.1155/2021/6650962
work_keys_str_mv AT lijiamin weightedattributetriplethashingforlargescalesimilarjudicialcasematching
AT liuxingbo weightedattributetriplethashingforlargescalesimilarjudicialcasematching
AT niexiushan weightedattributetriplethashingforlargescalesimilarjudicialcasematching
AT malele weightedattributetriplethashingforlargescalesimilarjudicialcasematching
AT lipeng weightedattributetriplethashingforlargescalesimilarjudicialcasematching
AT zhangkai weightedattributetriplethashingforlargescalesimilarjudicialcasematching
AT yinyilong weightedattributetriplethashingforlargescalesimilarjudicialcasematching