Cargando…

Automating the search for a patent’s prior art with a full text similarity search

More than ever, technical inventions are the symbol of our society’s advance. Patents guarantee their creators protection against infringement. For an invention being patentable, its novelty and inventiveness have to be assessed. Therefore, a search for published work that describes similar inventio...

Descripción completa

Detalles Bibliográficos
Autores principales: Helmers, Lea, Horn, Franziska, Biegler, Franziska, Oppermann, Tim, Müller, Klaus-Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6398827/
https://www.ncbi.nlm.nih.gov/pubmed/30830911
http://dx.doi.org/10.1371/journal.pone.0212103
_version_ 1783399648301416448
author Helmers, Lea
Horn, Franziska
Biegler, Franziska
Oppermann, Tim
Müller, Klaus-Robert
author_facet Helmers, Lea
Horn, Franziska
Biegler, Franziska
Oppermann, Tim
Müller, Klaus-Robert
author_sort Helmers, Lea
collection PubMed
description More than ever, technical inventions are the symbol of our society’s advance. Patents guarantee their creators protection against infringement. For an invention being patentable, its novelty and inventiveness have to be assessed. Therefore, a search for published work that describes similar inventions to a given patent application needs to be performed. Currently, this so-called search for prior art is executed with semi-automatically composed keyword queries, which is not only time consuming, but also prone to errors. In particular, errors may systematically arise by the fact that different keywords for the same technical concepts may exist across disciplines. In this paper, a novel approach is proposed, where the full text of a given patent application is compared to existing patents using machine learning and natural language processing techniques to automatically detect inventions that are similar to the one described in the submitted document. Various state-of-the-art approaches for feature extraction and document comparison are evaluated. In addition to that, the quality of the current search process is assessed based on ratings of a domain expert. The evaluation results show that our automated approach, besides accelerating the search process, also improves the search results for prior art with respect to their quality.
format Online
Article
Text
id pubmed-6398827
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63988272019-03-08 Automating the search for a patent’s prior art with a full text similarity search Helmers, Lea Horn, Franziska Biegler, Franziska Oppermann, Tim Müller, Klaus-Robert PLoS One Research Article More than ever, technical inventions are the symbol of our society’s advance. Patents guarantee their creators protection against infringement. For an invention being patentable, its novelty and inventiveness have to be assessed. Therefore, a search for published work that describes similar inventions to a given patent application needs to be performed. Currently, this so-called search for prior art is executed with semi-automatically composed keyword queries, which is not only time consuming, but also prone to errors. In particular, errors may systematically arise by the fact that different keywords for the same technical concepts may exist across disciplines. In this paper, a novel approach is proposed, where the full text of a given patent application is compared to existing patents using machine learning and natural language processing techniques to automatically detect inventions that are similar to the one described in the submitted document. Various state-of-the-art approaches for feature extraction and document comparison are evaluated. In addition to that, the quality of the current search process is assessed based on ratings of a domain expert. The evaluation results show that our automated approach, besides accelerating the search process, also improves the search results for prior art with respect to their quality. Public Library of Science 2019-03-04 /pmc/articles/PMC6398827/ /pubmed/30830911 http://dx.doi.org/10.1371/journal.pone.0212103 Text en © 2019 Helmers et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Helmers, Lea
Horn, Franziska
Biegler, Franziska
Oppermann, Tim
Müller, Klaus-Robert
Automating the search for a patent’s prior art with a full text similarity search
title Automating the search for a patent’s prior art with a full text similarity search
title_full Automating the search for a patent’s prior art with a full text similarity search
title_fullStr Automating the search for a patent’s prior art with a full text similarity search
title_full_unstemmed Automating the search for a patent’s prior art with a full text similarity search
title_short Automating the search for a patent’s prior art with a full text similarity search
title_sort automating the search for a patent’s prior art with a full text similarity search
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6398827/
https://www.ncbi.nlm.nih.gov/pubmed/30830911
http://dx.doi.org/10.1371/journal.pone.0212103
work_keys_str_mv AT helmerslea automatingthesearchforapatentspriorartwithafulltextsimilaritysearch
AT hornfranziska automatingthesearchforapatentspriorartwithafulltextsimilaritysearch
AT bieglerfranziska automatingthesearchforapatentspriorartwithafulltextsimilaritysearch
AT oppermanntim automatingthesearchforapatentspriorartwithafulltextsimilaritysearch
AT mullerklausrobert automatingthesearchforapatentspriorartwithafulltextsimilaritysearch