Cargando…

A nonparametric framework for inferring orders of categorical data from category-real pairs

Given a dataset of careers and incomes, how large a difference of incomes between any pair of careers would be? Given a dataset of travel time records, how long do we need to spend more when choosing a public transportation mode A instead of B to travel? In this paper, we propose a framework that is...

Descripción completa

Detalles Bibliográficos
Autores principales: Amornbunchornvej, Chainarong, Surasvadi, Navaporn, Plangprasopchok, Anon, Thajchayapong, Suttipong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7658719/
https://www.ncbi.nlm.nih.gov/pubmed/33210008
http://dx.doi.org/10.1016/j.heliyon.2020.e05435
_version_ 1783608731405123584
author Amornbunchornvej, Chainarong
Surasvadi, Navaporn
Plangprasopchok, Anon
Thajchayapong, Suttipong
author_facet Amornbunchornvej, Chainarong
Surasvadi, Navaporn
Plangprasopchok, Anon
Thajchayapong, Suttipong
author_sort Amornbunchornvej, Chainarong
collection PubMed
description Given a dataset of careers and incomes, how large a difference of incomes between any pair of careers would be? Given a dataset of travel time records, how long do we need to spend more when choosing a public transportation mode A instead of B to travel? In this paper, we propose a framework that is able to infer orders of categories as well as magnitudes of difference of real numbers between each pair of categories using an estimation statistics framework. Our framework not only reports whether an order of categories exists, but it also reports magnitudes of difference of each consecutive pair of categories in the order. In a large dataset, our framework is scalable well compared with existing frameworks. The proposed framework has been applied to two real-world case studies: 1) ordering careers by incomes from 350,000 households living in Khon Kaen province, Thailand, and 2) ordering sectors by closing prices from 1,060 companies in NASDAQ stock market between years 2000 and 2016. The results of careers ordering demonstrate income inequality among different careers. The stock market results illustrate dynamics of sector domination that can change over time. Our approach is able to be applied in any research area that has category-real pairs. Our proposed Dominant-Distribution Network provides a novel approach to gain new insight of analyzing category orders. A software of this framework is available for researchers or practitioners in an R CRAN package: EDOIF.
format Online
Article
Text
id pubmed-7658719
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-76587192020-11-17 A nonparametric framework for inferring orders of categorical data from category-real pairs Amornbunchornvej, Chainarong Surasvadi, Navaporn Plangprasopchok, Anon Thajchayapong, Suttipong Heliyon Research Article Given a dataset of careers and incomes, how large a difference of incomes between any pair of careers would be? Given a dataset of travel time records, how long do we need to spend more when choosing a public transportation mode A instead of B to travel? In this paper, we propose a framework that is able to infer orders of categories as well as magnitudes of difference of real numbers between each pair of categories using an estimation statistics framework. Our framework not only reports whether an order of categories exists, but it also reports magnitudes of difference of each consecutive pair of categories in the order. In a large dataset, our framework is scalable well compared with existing frameworks. The proposed framework has been applied to two real-world case studies: 1) ordering careers by incomes from 350,000 households living in Khon Kaen province, Thailand, and 2) ordering sectors by closing prices from 1,060 companies in NASDAQ stock market between years 2000 and 2016. The results of careers ordering demonstrate income inequality among different careers. The stock market results illustrate dynamics of sector domination that can change over time. Our approach is able to be applied in any research area that has category-real pairs. Our proposed Dominant-Distribution Network provides a novel approach to gain new insight of analyzing category orders. A software of this framework is available for researchers or practitioners in an R CRAN package: EDOIF. Elsevier 2020-11-06 /pmc/articles/PMC7658719/ /pubmed/33210008 http://dx.doi.org/10.1016/j.heliyon.2020.e05435 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Amornbunchornvej, Chainarong
Surasvadi, Navaporn
Plangprasopchok, Anon
Thajchayapong, Suttipong
A nonparametric framework for inferring orders of categorical data from category-real pairs
title A nonparametric framework for inferring orders of categorical data from category-real pairs
title_full A nonparametric framework for inferring orders of categorical data from category-real pairs
title_fullStr A nonparametric framework for inferring orders of categorical data from category-real pairs
title_full_unstemmed A nonparametric framework for inferring orders of categorical data from category-real pairs
title_short A nonparametric framework for inferring orders of categorical data from category-real pairs
title_sort nonparametric framework for inferring orders of categorical data from category-real pairs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7658719/
https://www.ncbi.nlm.nih.gov/pubmed/33210008
http://dx.doi.org/10.1016/j.heliyon.2020.e05435
work_keys_str_mv AT amornbunchornvejchainarong anonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs
AT surasvadinavaporn anonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs
AT plangprasopchokanon anonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs
AT thajchayapongsuttipong anonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs
AT amornbunchornvejchainarong nonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs
AT surasvadinavaporn nonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs
AT plangprasopchokanon nonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs
AT thajchayapongsuttipong nonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs