Cargando…

IPC prediction of patent documents using neural network with attention for hierarchical structure

International patent classifications (IPCs) are assigned to patent documents; however, since the procedure for assigning classifications is manually done by the patent examiner, it takes a lot of time and effort to select some IPCs from about 70,000 IPCs. Hence, some research has been conducted on p...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoshino, Yuki, Utsumi, Yoshimasa, Matsuda, Yoshiro, Tanaka, Yoshitoshi, Nakata, Kazuhide
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9980776/
https://www.ncbi.nlm.nih.gov/pubmed/36862711
http://dx.doi.org/10.1371/journal.pone.0282361
_version_ 1784899961614237696
author Hoshino, Yuki
Utsumi, Yoshimasa
Matsuda, Yoshiro
Tanaka, Yoshitoshi
Nakata, Kazuhide
author_facet Hoshino, Yuki
Utsumi, Yoshimasa
Matsuda, Yoshiro
Tanaka, Yoshitoshi
Nakata, Kazuhide
author_sort Hoshino, Yuki
collection PubMed
description International patent classifications (IPCs) are assigned to patent documents; however, since the procedure for assigning classifications is manually done by the patent examiner, it takes a lot of time and effort to select some IPCs from about 70,000 IPCs. Hence, some research has been conducted on patent classification with machine learning. However, patent documents are very voluminous, and learning with all the claims (the part describing the content of the patent) as input would run out of the necessary memory, even if the batch size is set to a very small size. Therefore, most of the existing methods learn by excluding some information, such as using only the first claim as input. In this study, we propose a model that considers the contents of all claims by extracting important information for input. In addition, we focus on the hierarchical structure of the IPC, and propose a new decoder architecture to consider it. Finally, we conducted an experiment using actual patent data to verify the accuracy of the prediction. The results showed a significant improvement in accuracy compared to existing methods, and the actual applicability of the method was also discussed.
format Online
Article
Text
id pubmed-9980776
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-99807762023-03-03 IPC prediction of patent documents using neural network with attention for hierarchical structure Hoshino, Yuki Utsumi, Yoshimasa Matsuda, Yoshiro Tanaka, Yoshitoshi Nakata, Kazuhide PLoS One Research Article International patent classifications (IPCs) are assigned to patent documents; however, since the procedure for assigning classifications is manually done by the patent examiner, it takes a lot of time and effort to select some IPCs from about 70,000 IPCs. Hence, some research has been conducted on patent classification with machine learning. However, patent documents are very voluminous, and learning with all the claims (the part describing the content of the patent) as input would run out of the necessary memory, even if the batch size is set to a very small size. Therefore, most of the existing methods learn by excluding some information, such as using only the first claim as input. In this study, we propose a model that considers the contents of all claims by extracting important information for input. In addition, we focus on the hierarchical structure of the IPC, and propose a new decoder architecture to consider it. Finally, we conducted an experiment using actual patent data to verify the accuracy of the prediction. The results showed a significant improvement in accuracy compared to existing methods, and the actual applicability of the method was also discussed. Public Library of Science 2023-03-02 /pmc/articles/PMC9980776/ /pubmed/36862711 http://dx.doi.org/10.1371/journal.pone.0282361 Text en © 2023 Hoshino et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hoshino, Yuki
Utsumi, Yoshimasa
Matsuda, Yoshiro
Tanaka, Yoshitoshi
Nakata, Kazuhide
IPC prediction of patent documents using neural network with attention for hierarchical structure
title IPC prediction of patent documents using neural network with attention for hierarchical structure
title_full IPC prediction of patent documents using neural network with attention for hierarchical structure
title_fullStr IPC prediction of patent documents using neural network with attention for hierarchical structure
title_full_unstemmed IPC prediction of patent documents using neural network with attention for hierarchical structure
title_short IPC prediction of patent documents using neural network with attention for hierarchical structure
title_sort ipc prediction of patent documents using neural network with attention for hierarchical structure
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9980776/
https://www.ncbi.nlm.nih.gov/pubmed/36862711
http://dx.doi.org/10.1371/journal.pone.0282361
work_keys_str_mv AT hoshinoyuki ipcpredictionofpatentdocumentsusingneuralnetworkwithattentionforhierarchicalstructure
AT utsumiyoshimasa ipcpredictionofpatentdocumentsusingneuralnetworkwithattentionforhierarchicalstructure
AT matsudayoshiro ipcpredictionofpatentdocumentsusingneuralnetworkwithattentionforhierarchicalstructure
AT tanakayoshitoshi ipcpredictionofpatentdocumentsusingneuralnetworkwithattentionforhierarchicalstructure
AT nakatakazuhide ipcpredictionofpatentdocumentsusingneuralnetworkwithattentionforhierarchicalstructure