Cargando…
Predicting innovative firms using web mining and deep learning
Evidence-based STI (science, technology, and innovation) policy making requires accurate indicators of innovation in order to promote economic growth. However, traditional indicators from patents and questionnaire-based surveys often lack coverage, granularity as well as timeliness and may involve h...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016297/ https://www.ncbi.nlm.nih.gov/pubmed/33793626 http://dx.doi.org/10.1371/journal.pone.0249071 |
_version_ | 1783673830615547904 |
---|---|
author | Kinne, Jan Lenz, David |
author_facet | Kinne, Jan Lenz, David |
author_sort | Kinne, Jan |
collection | PubMed |
description | Evidence-based STI (science, technology, and innovation) policy making requires accurate indicators of innovation in order to promote economic growth. However, traditional indicators from patents and questionnaire-based surveys often lack coverage, granularity as well as timeliness and may involve high data collection costs, especially when conducted at a large scale. Consequently, they struggle to provide policy makers and scientists with the full picture of the current state of the innovation system. In this paper, we propose a first approach on generating web-based innovation indicators which may have the potential to overcome some of the shortcomings of traditional indicators. Specifically, we develop a method to identify product innovator firms at a large scale and very low costs. We use traditional firm-level indicators from a questionnaire-based innovation survey (German Community Innovation Survey) to train an artificial neural network classification model on labelled (product innovator/no product innovator) web texts of surveyed firms. Subsequently, we apply this classification model to the web texts of hundreds of thousands of firms in Germany to predict whether they are product innovators or not. We then compare these predictions to firm-level patent statistics, survey extrapolation benchmark data, and regional innovation indicators. The results show that our approach produces reliable predictions and has the potential to be a valuable and highly cost-efficient addition to the existing set of innovation indicators, especially due to its coverage and regional granularity. |
format | Online Article Text |
id | pubmed-8016297 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-80162972021-04-08 Predicting innovative firms using web mining and deep learning Kinne, Jan Lenz, David PLoS One Research Article Evidence-based STI (science, technology, and innovation) policy making requires accurate indicators of innovation in order to promote economic growth. However, traditional indicators from patents and questionnaire-based surveys often lack coverage, granularity as well as timeliness and may involve high data collection costs, especially when conducted at a large scale. Consequently, they struggle to provide policy makers and scientists with the full picture of the current state of the innovation system. In this paper, we propose a first approach on generating web-based innovation indicators which may have the potential to overcome some of the shortcomings of traditional indicators. Specifically, we develop a method to identify product innovator firms at a large scale and very low costs. We use traditional firm-level indicators from a questionnaire-based innovation survey (German Community Innovation Survey) to train an artificial neural network classification model on labelled (product innovator/no product innovator) web texts of surveyed firms. Subsequently, we apply this classification model to the web texts of hundreds of thousands of firms in Germany to predict whether they are product innovators or not. We then compare these predictions to firm-level patent statistics, survey extrapolation benchmark data, and regional innovation indicators. The results show that our approach produces reliable predictions and has the potential to be a valuable and highly cost-efficient addition to the existing set of innovation indicators, especially due to its coverage and regional granularity. Public Library of Science 2021-04-01 /pmc/articles/PMC8016297/ /pubmed/33793626 http://dx.doi.org/10.1371/journal.pone.0249071 Text en © 2021 Kinne, Lenz http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Kinne, Jan Lenz, David Predicting innovative firms using web mining and deep learning |
title | Predicting innovative firms using web mining and deep learning |
title_full | Predicting innovative firms using web mining and deep learning |
title_fullStr | Predicting innovative firms using web mining and deep learning |
title_full_unstemmed | Predicting innovative firms using web mining and deep learning |
title_short | Predicting innovative firms using web mining and deep learning |
title_sort | predicting innovative firms using web mining and deep learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016297/ https://www.ncbi.nlm.nih.gov/pubmed/33793626 http://dx.doi.org/10.1371/journal.pone.0249071 |
work_keys_str_mv | AT kinnejan predictinginnovativefirmsusingwebmininganddeeplearning AT lenzdavid predictinginnovativefirmsusingwebmininganddeeplearning |