Cargando…

Measuring the diffusion of innovations with paragraph vector topic models

Measuring the diffusion of innovations from textual data sources besides patent data has not been studied extensively. However, early and accurate indicators of innovation and the recognition of trends in innovation are mandatory to successfully promote economic growth through technological progress...

Descripción completa

Detalles Bibliográficos
Autores principales: Lenz, David, Winker, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6976149/
https://www.ncbi.nlm.nih.gov/pubmed/31967999
http://dx.doi.org/10.1371/journal.pone.0226685
_version_ 1783490311539916800
author Lenz, David
Winker, Peter
author_facet Lenz, David
Winker, Peter
author_sort Lenz, David
collection PubMed
description Measuring the diffusion of innovations from textual data sources besides patent data has not been studied extensively. However, early and accurate indicators of innovation and the recognition of trends in innovation are mandatory to successfully promote economic growth through technological progress via evidence-based policy making. In this study, we propose Paragraph Vector Topic Model (PVTM) and apply it to technology-related news articles to analyze innovation-related topics over time and gain insights regarding their diffusion process. PVTM represents documents in a semantic space, which has been shown to capture latent variables of the underlying documents, e.g., the latent topics. Clusters of documents in the semantic space can then be interpreted and transformed into meaningful topics by means of Gaussian mixture modeling. In using PVTM, we identify innovation-related topics from 170, 000 technology news articles published over a span of 20 years and gather insights about their diffusion state by measuring the topic importance in the corpus over time. Our results suggest that PVTM is a credible alternative to widely used topic models for the discovery of latent topics in (technology-related) news articles. An examination of three exemplary topics shows that innovation diffusion could be assessed using topic importance measures derived from PVTM. Thereby, we find that PVTM diffusion indicators for certain topics are Granger causal to Google Trend indices with matching search terms.
format Online
Article
Text
id pubmed-6976149
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69761492020-02-04 Measuring the diffusion of innovations with paragraph vector topic models Lenz, David Winker, Peter PLoS One Research Article Measuring the diffusion of innovations from textual data sources besides patent data has not been studied extensively. However, early and accurate indicators of innovation and the recognition of trends in innovation are mandatory to successfully promote economic growth through technological progress via evidence-based policy making. In this study, we propose Paragraph Vector Topic Model (PVTM) and apply it to technology-related news articles to analyze innovation-related topics over time and gain insights regarding their diffusion process. PVTM represents documents in a semantic space, which has been shown to capture latent variables of the underlying documents, e.g., the latent topics. Clusters of documents in the semantic space can then be interpreted and transformed into meaningful topics by means of Gaussian mixture modeling. In using PVTM, we identify innovation-related topics from 170, 000 technology news articles published over a span of 20 years and gather insights about their diffusion state by measuring the topic importance in the corpus over time. Our results suggest that PVTM is a credible alternative to widely used topic models for the discovery of latent topics in (technology-related) news articles. An examination of three exemplary topics shows that innovation diffusion could be assessed using topic importance measures derived from PVTM. Thereby, we find that PVTM diffusion indicators for certain topics are Granger causal to Google Trend indices with matching search terms. Public Library of Science 2020-01-22 /pmc/articles/PMC6976149/ /pubmed/31967999 http://dx.doi.org/10.1371/journal.pone.0226685 Text en © 2020 Lenz, Winker http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lenz, David
Winker, Peter
Measuring the diffusion of innovations with paragraph vector topic models
title Measuring the diffusion of innovations with paragraph vector topic models
title_full Measuring the diffusion of innovations with paragraph vector topic models
title_fullStr Measuring the diffusion of innovations with paragraph vector topic models
title_full_unstemmed Measuring the diffusion of innovations with paragraph vector topic models
title_short Measuring the diffusion of innovations with paragraph vector topic models
title_sort measuring the diffusion of innovations with paragraph vector topic models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6976149/
https://www.ncbi.nlm.nih.gov/pubmed/31967999
http://dx.doi.org/10.1371/journal.pone.0226685
work_keys_str_mv AT lenzdavid measuringthediffusionofinnovationswithparagraphvectortopicmodels
AT winkerpeter measuringthediffusionofinnovationswithparagraphvectortopicmodels