Cargando…

Extraction of Important Factors in a High-Dimensional Data Space: An Application for High-Growth Firms

We introduce a new non-black-box method of extracting multiple areas in a high-dimensional big data space where data points that satisfy specific conditions are highly concentrated. First, we extract one-dimensional areas where the data that satisfy specific conditions are mostly gathered by using t...

Descripción completa

Detalles Bibliográficos
Autores principales: Wada, Takuya, Takayasu, Hideki, Takayasu, Misako
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10047971/
https://www.ncbi.nlm.nih.gov/pubmed/36981376
http://dx.doi.org/10.3390/e25030488
_version_ 1785014061435453440
author Wada, Takuya
Takayasu, Hideki
Takayasu, Misako
author_facet Wada, Takuya
Takayasu, Hideki
Takayasu, Misako
author_sort Wada, Takuya
collection PubMed
description We introduce a new non-black-box method of extracting multiple areas in a high-dimensional big data space where data points that satisfy specific conditions are highly concentrated. First, we extract one-dimensional areas where the data that satisfy specific conditions are mostly gathered by using the Bayesian method. Second, we construct higher-dimensional areas where the densities of focused data points are higher than the simple combination of the results for one dimension, and then we verify the results through data validation. Third, we apply this method to estimate the set of significant factors shared in successful firms with growth rates in sales at the top 1% level using 156-dimensional data of corporate financial reports for 12 years containing about 320,000 firms. We also categorize high-growth firms into 15 groups of different sets of factors.
format Online
Article
Text
id pubmed-10047971
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100479712023-03-29 Extraction of Important Factors in a High-Dimensional Data Space: An Application for High-Growth Firms Wada, Takuya Takayasu, Hideki Takayasu, Misako Entropy (Basel) Article We introduce a new non-black-box method of extracting multiple areas in a high-dimensional big data space where data points that satisfy specific conditions are highly concentrated. First, we extract one-dimensional areas where the data that satisfy specific conditions are mostly gathered by using the Bayesian method. Second, we construct higher-dimensional areas where the densities of focused data points are higher than the simple combination of the results for one dimension, and then we verify the results through data validation. Third, we apply this method to estimate the set of significant factors shared in successful firms with growth rates in sales at the top 1% level using 156-dimensional data of corporate financial reports for 12 years containing about 320,000 firms. We also categorize high-growth firms into 15 groups of different sets of factors. MDPI 2023-03-10 /pmc/articles/PMC10047971/ /pubmed/36981376 http://dx.doi.org/10.3390/e25030488 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wada, Takuya
Takayasu, Hideki
Takayasu, Misako
Extraction of Important Factors in a High-Dimensional Data Space: An Application for High-Growth Firms
title Extraction of Important Factors in a High-Dimensional Data Space: An Application for High-Growth Firms
title_full Extraction of Important Factors in a High-Dimensional Data Space: An Application for High-Growth Firms
title_fullStr Extraction of Important Factors in a High-Dimensional Data Space: An Application for High-Growth Firms
title_full_unstemmed Extraction of Important Factors in a High-Dimensional Data Space: An Application for High-Growth Firms
title_short Extraction of Important Factors in a High-Dimensional Data Space: An Application for High-Growth Firms
title_sort extraction of important factors in a high-dimensional data space: an application for high-growth firms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10047971/
https://www.ncbi.nlm.nih.gov/pubmed/36981376
http://dx.doi.org/10.3390/e25030488
work_keys_str_mv AT wadatakuya extractionofimportantfactorsinahighdimensionaldataspaceanapplicationforhighgrowthfirms
AT takayasuhideki extractionofimportantfactorsinahighdimensionaldataspaceanapplicationforhighgrowthfirms
AT takayasumisako extractionofimportantfactorsinahighdimensionaldataspaceanapplicationforhighgrowthfirms