Cargando…
Measuring the diffusion of innovations with paragraph vector topic models
Measuring the diffusion of innovations from textual data sources besides patent data has not been studied extensively. However, early and accurate indicators of innovation and the recognition of trends in innovation are mandatory to successfully promote economic growth through technological progress...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6976149/ https://www.ncbi.nlm.nih.gov/pubmed/31967999 http://dx.doi.org/10.1371/journal.pone.0226685 |
_version_ | 1783490311539916800 |
---|---|
author | Lenz, David Winker, Peter |
author_facet | Lenz, David Winker, Peter |
author_sort | Lenz, David |
collection | PubMed |
description | Measuring the diffusion of innovations from textual data sources besides patent data has not been studied extensively. However, early and accurate indicators of innovation and the recognition of trends in innovation are mandatory to successfully promote economic growth through technological progress via evidence-based policy making. In this study, we propose Paragraph Vector Topic Model (PVTM) and apply it to technology-related news articles to analyze innovation-related topics over time and gain insights regarding their diffusion process. PVTM represents documents in a semantic space, which has been shown to capture latent variables of the underlying documents, e.g., the latent topics. Clusters of documents in the semantic space can then be interpreted and transformed into meaningful topics by means of Gaussian mixture modeling. In using PVTM, we identify innovation-related topics from 170, 000 technology news articles published over a span of 20 years and gather insights about their diffusion state by measuring the topic importance in the corpus over time. Our results suggest that PVTM is a credible alternative to widely used topic models for the discovery of latent topics in (technology-related) news articles. An examination of three exemplary topics shows that innovation diffusion could be assessed using topic importance measures derived from PVTM. Thereby, we find that PVTM diffusion indicators for certain topics are Granger causal to Google Trend indices with matching search terms. |
format | Online Article Text |
id | pubmed-6976149 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-69761492020-02-04 Measuring the diffusion of innovations with paragraph vector topic models Lenz, David Winker, Peter PLoS One Research Article Measuring the diffusion of innovations from textual data sources besides patent data has not been studied extensively. However, early and accurate indicators of innovation and the recognition of trends in innovation are mandatory to successfully promote economic growth through technological progress via evidence-based policy making. In this study, we propose Paragraph Vector Topic Model (PVTM) and apply it to technology-related news articles to analyze innovation-related topics over time and gain insights regarding their diffusion process. PVTM represents documents in a semantic space, which has been shown to capture latent variables of the underlying documents, e.g., the latent topics. Clusters of documents in the semantic space can then be interpreted and transformed into meaningful topics by means of Gaussian mixture modeling. In using PVTM, we identify innovation-related topics from 170, 000 technology news articles published over a span of 20 years and gather insights about their diffusion state by measuring the topic importance in the corpus over time. Our results suggest that PVTM is a credible alternative to widely used topic models for the discovery of latent topics in (technology-related) news articles. An examination of three exemplary topics shows that innovation diffusion could be assessed using topic importance measures derived from PVTM. Thereby, we find that PVTM diffusion indicators for certain topics are Granger causal to Google Trend indices with matching search terms. Public Library of Science 2020-01-22 /pmc/articles/PMC6976149/ /pubmed/31967999 http://dx.doi.org/10.1371/journal.pone.0226685 Text en © 2020 Lenz, Winker http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lenz, David Winker, Peter Measuring the diffusion of innovations with paragraph vector topic models |
title | Measuring the diffusion of innovations with paragraph vector topic models |
title_full | Measuring the diffusion of innovations with paragraph vector topic models |
title_fullStr | Measuring the diffusion of innovations with paragraph vector topic models |
title_full_unstemmed | Measuring the diffusion of innovations with paragraph vector topic models |
title_short | Measuring the diffusion of innovations with paragraph vector topic models |
title_sort | measuring the diffusion of innovations with paragraph vector topic models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6976149/ https://www.ncbi.nlm.nih.gov/pubmed/31967999 http://dx.doi.org/10.1371/journal.pone.0226685 |
work_keys_str_mv | AT lenzdavid measuringthediffusionofinnovationswithparagraphvectortopicmodels AT winkerpeter measuringthediffusionofinnovationswithparagraphvectortopicmodels |