Cargando…
A nonparametric framework for inferring orders of categorical data from category-real pairs
Given a dataset of careers and incomes, how large a difference of incomes between any pair of careers would be? Given a dataset of travel time records, how long do we need to spend more when choosing a public transportation mode A instead of B to travel? In this paper, we propose a framework that is...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7658719/ https://www.ncbi.nlm.nih.gov/pubmed/33210008 http://dx.doi.org/10.1016/j.heliyon.2020.e05435 |
_version_ | 1783608731405123584 |
---|---|
author | Amornbunchornvej, Chainarong Surasvadi, Navaporn Plangprasopchok, Anon Thajchayapong, Suttipong |
author_facet | Amornbunchornvej, Chainarong Surasvadi, Navaporn Plangprasopchok, Anon Thajchayapong, Suttipong |
author_sort | Amornbunchornvej, Chainarong |
collection | PubMed |
description | Given a dataset of careers and incomes, how large a difference of incomes between any pair of careers would be? Given a dataset of travel time records, how long do we need to spend more when choosing a public transportation mode A instead of B to travel? In this paper, we propose a framework that is able to infer orders of categories as well as magnitudes of difference of real numbers between each pair of categories using an estimation statistics framework. Our framework not only reports whether an order of categories exists, but it also reports magnitudes of difference of each consecutive pair of categories in the order. In a large dataset, our framework is scalable well compared with existing frameworks. The proposed framework has been applied to two real-world case studies: 1) ordering careers by incomes from 350,000 households living in Khon Kaen province, Thailand, and 2) ordering sectors by closing prices from 1,060 companies in NASDAQ stock market between years 2000 and 2016. The results of careers ordering demonstrate income inequality among different careers. The stock market results illustrate dynamics of sector domination that can change over time. Our approach is able to be applied in any research area that has category-real pairs. Our proposed Dominant-Distribution Network provides a novel approach to gain new insight of analyzing category orders. A software of this framework is available for researchers or practitioners in an R CRAN package: EDOIF. |
format | Online Article Text |
id | pubmed-7658719 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-76587192020-11-17 A nonparametric framework for inferring orders of categorical data from category-real pairs Amornbunchornvej, Chainarong Surasvadi, Navaporn Plangprasopchok, Anon Thajchayapong, Suttipong Heliyon Research Article Given a dataset of careers and incomes, how large a difference of incomes between any pair of careers would be? Given a dataset of travel time records, how long do we need to spend more when choosing a public transportation mode A instead of B to travel? In this paper, we propose a framework that is able to infer orders of categories as well as magnitudes of difference of real numbers between each pair of categories using an estimation statistics framework. Our framework not only reports whether an order of categories exists, but it also reports magnitudes of difference of each consecutive pair of categories in the order. In a large dataset, our framework is scalable well compared with existing frameworks. The proposed framework has been applied to two real-world case studies: 1) ordering careers by incomes from 350,000 households living in Khon Kaen province, Thailand, and 2) ordering sectors by closing prices from 1,060 companies in NASDAQ stock market between years 2000 and 2016. The results of careers ordering demonstrate income inequality among different careers. The stock market results illustrate dynamics of sector domination that can change over time. Our approach is able to be applied in any research area that has category-real pairs. Our proposed Dominant-Distribution Network provides a novel approach to gain new insight of analyzing category orders. A software of this framework is available for researchers or practitioners in an R CRAN package: EDOIF. Elsevier 2020-11-06 /pmc/articles/PMC7658719/ /pubmed/33210008 http://dx.doi.org/10.1016/j.heliyon.2020.e05435 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Amornbunchornvej, Chainarong Surasvadi, Navaporn Plangprasopchok, Anon Thajchayapong, Suttipong A nonparametric framework for inferring orders of categorical data from category-real pairs |
title | A nonparametric framework for inferring orders of categorical data from category-real pairs |
title_full | A nonparametric framework for inferring orders of categorical data from category-real pairs |
title_fullStr | A nonparametric framework for inferring orders of categorical data from category-real pairs |
title_full_unstemmed | A nonparametric framework for inferring orders of categorical data from category-real pairs |
title_short | A nonparametric framework for inferring orders of categorical data from category-real pairs |
title_sort | nonparametric framework for inferring orders of categorical data from category-real pairs |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7658719/ https://www.ncbi.nlm.nih.gov/pubmed/33210008 http://dx.doi.org/10.1016/j.heliyon.2020.e05435 |
work_keys_str_mv | AT amornbunchornvejchainarong anonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs AT surasvadinavaporn anonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs AT plangprasopchokanon anonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs AT thajchayapongsuttipong anonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs AT amornbunchornvejchainarong nonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs AT surasvadinavaporn nonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs AT plangprasopchokanon nonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs AT thajchayapongsuttipong nonparametricframeworkforinferringordersofcategoricaldatafromcategoryrealpairs |