Cargando…

Can we predict firms’ innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach

The study shows the feasibility of predicting firms’ expenditures in innovation, as reported in the Community Innovation Survey, applying a supervised machine-learning approach on a sample of Italian firms. Using an integrated dataset of administrative records and balance sheet data, designed to inc...

Descripción completa

Detalles Bibliográficos
Autores principales: Gandin, Ilaria, Cozza, Claudio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6559647/
https://www.ncbi.nlm.nih.gov/pubmed/31185045
http://dx.doi.org/10.1371/journal.pone.0218175
_version_ 1783425836665274368
author Gandin, Ilaria
Cozza, Claudio
author_facet Gandin, Ilaria
Cozza, Claudio
author_sort Gandin, Ilaria
collection PubMed
description The study shows the feasibility of predicting firms’ expenditures in innovation, as reported in the Community Innovation Survey, applying a supervised machine-learning approach on a sample of Italian firms. Using an integrated dataset of administrative records and balance sheet data, designed to include all informative variables related to innovation but also easily accessible for most of the cohort, random forest algorithm is implemented to obtain a classification model aimed to identify firms that are potential innovation performers. The performance of the classifier, estimated in terms of AUC, is 0.794. Although innovation investments do not always result in patenting, the model is able to identify 71.92% of firms with patents. More encouraging results emerge from the analysis of the inner working of the model: predictors identified as most important—such as firm size, sector belonging and investment in intangible assets—confirm previous findings of literature, but in a completely different framework. The outcomes of this study are considered relevant for both economic analysts, because it demonstrates the potential of data-driven models for understanding the nature of innovation behaviour, and practitioners, such as policymakers or venture capitalists, who can benefit by evidence-based tools in the decision-making process.
format Online
Article
Text
id pubmed-6559647
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-65596472019-06-17 Can we predict firms’ innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach Gandin, Ilaria Cozza, Claudio PLoS One Research Article The study shows the feasibility of predicting firms’ expenditures in innovation, as reported in the Community Innovation Survey, applying a supervised machine-learning approach on a sample of Italian firms. Using an integrated dataset of administrative records and balance sheet data, designed to include all informative variables related to innovation but also easily accessible for most of the cohort, random forest algorithm is implemented to obtain a classification model aimed to identify firms that are potential innovation performers. The performance of the classifier, estimated in terms of AUC, is 0.794. Although innovation investments do not always result in patenting, the model is able to identify 71.92% of firms with patents. More encouraging results emerge from the analysis of the inner working of the model: predictors identified as most important—such as firm size, sector belonging and investment in intangible assets—confirm previous findings of literature, but in a completely different framework. The outcomes of this study are considered relevant for both economic analysts, because it demonstrates the potential of data-driven models for understanding the nature of innovation behaviour, and practitioners, such as policymakers or venture capitalists, who can benefit by evidence-based tools in the decision-making process. Public Library of Science 2019-06-11 /pmc/articles/PMC6559647/ /pubmed/31185045 http://dx.doi.org/10.1371/journal.pone.0218175 Text en © 2019 Gandin, Cozza http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gandin, Ilaria
Cozza, Claudio
Can we predict firms’ innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach
title Can we predict firms’ innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach
title_full Can we predict firms’ innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach
title_fullStr Can we predict firms’ innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach
title_full_unstemmed Can we predict firms’ innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach
title_short Can we predict firms’ innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach
title_sort can we predict firms’ innovativeness? the identification of innovation performers in an italian region through a supervised learning approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6559647/
https://www.ncbi.nlm.nih.gov/pubmed/31185045
http://dx.doi.org/10.1371/journal.pone.0218175
work_keys_str_mv AT gandinilaria canwepredictfirmsinnovativenesstheidentificationofinnovationperformersinanitalianregionthroughasupervisedlearningapproach
AT cozzaclaudio canwepredictfirmsinnovativenesstheidentificationofinnovationperformersinanitalianregionthroughasupervisedlearningapproach