Cargando…
The proximity of ideas: An analysis of patent text using machine learning
This paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent’s abstract text usi...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7347140/ https://www.ncbi.nlm.nih.gov/pubmed/32645050 http://dx.doi.org/10.1371/journal.pone.0234880 |
_version_ | 1783556536030724096 |
---|---|
author | Feng, Sijie |
author_facet | Feng, Sijie |
author_sort | Feng, Sijie |
collection | PubMed |
description | This paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent’s abstract text using Document Vectors (Doc2Vec), and using cosine similarity to measure their proximity in ideas space. I illustrate the potential uses of this method with an application to geographic localization in knowledge spillovers. For patents in the same technology field, their normalized text similarity is 0.02-0.05 S.D.s higher if they are located within the same city, compared to patents from other cities. This effect is much smaller than when knowledge transfers are measured using normalized patent citations: local patents receive about 0.23-0.30 S.D.s more local citations than compared to non-local control patents. These findings suggest that the effect of geography on knowledge transfers may be much smaller than the previous literature using citations suggests. |
format | Online Article Text |
id | pubmed-7347140 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-73471402020-07-20 The proximity of ideas: An analysis of patent text using machine learning Feng, Sijie PLoS One Research Article This paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent’s abstract text using Document Vectors (Doc2Vec), and using cosine similarity to measure their proximity in ideas space. I illustrate the potential uses of this method with an application to geographic localization in knowledge spillovers. For patents in the same technology field, their normalized text similarity is 0.02-0.05 S.D.s higher if they are located within the same city, compared to patents from other cities. This effect is much smaller than when knowledge transfers are measured using normalized patent citations: local patents receive about 0.23-0.30 S.D.s more local citations than compared to non-local control patents. These findings suggest that the effect of geography on knowledge transfers may be much smaller than the previous literature using citations suggests. Public Library of Science 2020-07-09 /pmc/articles/PMC7347140/ /pubmed/32645050 http://dx.doi.org/10.1371/journal.pone.0234880 Text en © 2020 Sijie Feng http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Feng, Sijie The proximity of ideas: An analysis of patent text using machine learning |
title | The proximity of ideas: An analysis of patent text using machine learning |
title_full | The proximity of ideas: An analysis of patent text using machine learning |
title_fullStr | The proximity of ideas: An analysis of patent text using machine learning |
title_full_unstemmed | The proximity of ideas: An analysis of patent text using machine learning |
title_short | The proximity of ideas: An analysis of patent text using machine learning |
title_sort | proximity of ideas: an analysis of patent text using machine learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7347140/ https://www.ncbi.nlm.nih.gov/pubmed/32645050 http://dx.doi.org/10.1371/journal.pone.0234880 |
work_keys_str_mv | AT fengsijie theproximityofideasananalysisofpatenttextusingmachinelearning AT fengsijie proximityofideasananalysisofpatenttextusingmachinelearning |