Cargando…
Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching
Similar judicial case matching aims to enable an accurate selection of a judicial document that is most similar to the target document from multiple candidates. The core of similar judicial case matching is to calculate the similarity between two fact case documents. Owing to similar judicial case m...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8064799/ https://www.ncbi.nlm.nih.gov/pubmed/33953738 http://dx.doi.org/10.1155/2021/6650962 |
_version_ | 1783682211493445632 |
---|---|
author | Li, Jiamin Liu, Xingbo Nie, Xiushan Ma, Lele Li, Peng Zhang, Kai Yin, Yilong |
author_facet | Li, Jiamin Liu, Xingbo Nie, Xiushan Ma, Lele Li, Peng Zhang, Kai Yin, Yilong |
author_sort | Li, Jiamin |
collection | PubMed |
description | Similar judicial case matching aims to enable an accurate selection of a judicial document that is most similar to the target document from multiple candidates. The core of similar judicial case matching is to calculate the similarity between two fact case documents. Owing to similar judicial case matching techniques, legal professionals can promptly find and judge similar cases in a candidate set. These techniques can also benefit the development of judicial systems. However, the document of judicial cases not only is long in length but also has a certain degree of structural complexity. Meanwhile, a variety of judicial cases are also increasing rapidly; thus, it is difficult to find the document most similar to the target document in a large corpus. In this study, we present a novel similar judicial case matching model, which obtains the weight of judicial feature attributes based on hash learning and realizes fast similar matching by using a binary code. The proposed model extracts the judicial feature attributes vector using the bidirectional encoder representations from transformers (BERT) model and subsequently obtains the weighted judicial feature attributes through learning the hash function. We further impose triplet constraints to ensure that the similarity of judicial case data is well preserved when projected into the Hamming space. Comprehensive experimental results on public datasets show that the proposed method is superior in the task of similar judicial case matching and is suitable for large-scale similar judicial case matching. |
format | Online Article Text |
id | pubmed-8064799 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-80647992021-05-04 Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching Li, Jiamin Liu, Xingbo Nie, Xiushan Ma, Lele Li, Peng Zhang, Kai Yin, Yilong Comput Intell Neurosci Research Article Similar judicial case matching aims to enable an accurate selection of a judicial document that is most similar to the target document from multiple candidates. The core of similar judicial case matching is to calculate the similarity between two fact case documents. Owing to similar judicial case matching techniques, legal professionals can promptly find and judge similar cases in a candidate set. These techniques can also benefit the development of judicial systems. However, the document of judicial cases not only is long in length but also has a certain degree of structural complexity. Meanwhile, a variety of judicial cases are also increasing rapidly; thus, it is difficult to find the document most similar to the target document in a large corpus. In this study, we present a novel similar judicial case matching model, which obtains the weight of judicial feature attributes based on hash learning and realizes fast similar matching by using a binary code. The proposed model extracts the judicial feature attributes vector using the bidirectional encoder representations from transformers (BERT) model and subsequently obtains the weighted judicial feature attributes through learning the hash function. We further impose triplet constraints to ensure that the similarity of judicial case data is well preserved when projected into the Hamming space. Comprehensive experimental results on public datasets show that the proposed method is superior in the task of similar judicial case matching and is suitable for large-scale similar judicial case matching. Hindawi 2021-04-16 /pmc/articles/PMC8064799/ /pubmed/33953738 http://dx.doi.org/10.1155/2021/6650962 Text en Copyright © 2021 Jiamin Li et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Li, Jiamin Liu, Xingbo Nie, Xiushan Ma, Lele Li, Peng Zhang, Kai Yin, Yilong Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching |
title | Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching |
title_full | Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching |
title_fullStr | Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching |
title_full_unstemmed | Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching |
title_short | Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching |
title_sort | weighted-attribute triplet hashing for large-scale similar judicial case matching |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8064799/ https://www.ncbi.nlm.nih.gov/pubmed/33953738 http://dx.doi.org/10.1155/2021/6650962 |
work_keys_str_mv | AT lijiamin weightedattributetriplethashingforlargescalesimilarjudicialcasematching AT liuxingbo weightedattributetriplethashingforlargescalesimilarjudicialcasematching AT niexiushan weightedattributetriplethashingforlargescalesimilarjudicialcasematching AT malele weightedattributetriplethashingforlargescalesimilarjudicialcasematching AT lipeng weightedattributetriplethashingforlargescalesimilarjudicialcasematching AT zhangkai weightedattributetriplethashingforlargescalesimilarjudicialcasematching AT yinyilong weightedattributetriplethashingforlargescalesimilarjudicialcasematching |