Cargando…

The proximity of ideas: An analysis of patent text using machine learning

This paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent’s abstract text usi...

Descripción completa

Detalles Bibliográficos
Autor principal: Feng, Sijie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7347140/
https://www.ncbi.nlm.nih.gov/pubmed/32645050
http://dx.doi.org/10.1371/journal.pone.0234880
_version_ 1783556536030724096
author Feng, Sijie
author_facet Feng, Sijie
author_sort Feng, Sijie
collection PubMed
description This paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent’s abstract text using Document Vectors (Doc2Vec), and using cosine similarity to measure their proximity in ideas space. I illustrate the potential uses of this method with an application to geographic localization in knowledge spillovers. For patents in the same technology field, their normalized text similarity is 0.02-0.05 S.D.s higher if they are located within the same city, compared to patents from other cities. This effect is much smaller than when knowledge transfers are measured using normalized patent citations: local patents receive about 0.23-0.30 S.D.s more local citations than compared to non-local control patents. These findings suggest that the effect of geography on knowledge transfers may be much smaller than the previous literature using citations suggests.
format Online
Article
Text
id pubmed-7347140
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73471402020-07-20 The proximity of ideas: An analysis of patent text using machine learning Feng, Sijie PLoS One Research Article This paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent’s abstract text using Document Vectors (Doc2Vec), and using cosine similarity to measure their proximity in ideas space. I illustrate the potential uses of this method with an application to geographic localization in knowledge spillovers. For patents in the same technology field, their normalized text similarity is 0.02-0.05 S.D.s higher if they are located within the same city, compared to patents from other cities. This effect is much smaller than when knowledge transfers are measured using normalized patent citations: local patents receive about 0.23-0.30 S.D.s more local citations than compared to non-local control patents. These findings suggest that the effect of geography on knowledge transfers may be much smaller than the previous literature using citations suggests. Public Library of Science 2020-07-09 /pmc/articles/PMC7347140/ /pubmed/32645050 http://dx.doi.org/10.1371/journal.pone.0234880 Text en © 2020 Sijie Feng http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Feng, Sijie
The proximity of ideas: An analysis of patent text using machine learning
title The proximity of ideas: An analysis of patent text using machine learning
title_full The proximity of ideas: An analysis of patent text using machine learning
title_fullStr The proximity of ideas: An analysis of patent text using machine learning
title_full_unstemmed The proximity of ideas: An analysis of patent text using machine learning
title_short The proximity of ideas: An analysis of patent text using machine learning
title_sort proximity of ideas: an analysis of patent text using machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7347140/
https://www.ncbi.nlm.nih.gov/pubmed/32645050
http://dx.doi.org/10.1371/journal.pone.0234880
work_keys_str_mv AT fengsijie theproximityofideasananalysisofpatenttextusingmachinelearning
AT fengsijie proximityofideasananalysisofpatenttextusingmachinelearning