Cargando…

Location-Scale Matching for Approximate Quasi-Order Sampling

Quasi-orders are reflexive and transitive binary relations and have many applications. Examples are the dependencies of mastery among the problems of a psychological test, or methods such as item tree or Boolean analysis that mine for quasi-orders in empirical data. Data mining techniques are typica...

Descripción completa

Detalles Bibliográficos
Autores principales: Ünlü, Ali, Schrepp, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6573793/
https://www.ncbi.nlm.nih.gov/pubmed/31244703
http://dx.doi.org/10.3389/fpsyg.2019.01163
_version_ 1783427786222862336
author Ünlü, Ali
Schrepp, Martin
author_facet Ünlü, Ali
Schrepp, Martin
author_sort Ünlü, Ali
collection PubMed
description Quasi-orders are reflexive and transitive binary relations and have many applications. Examples are the dependencies of mastery among the problems of a psychological test, or methods such as item tree or Boolean analysis that mine for quasi-orders in empirical data. Data mining techniques are typically tested based on simulation studies with unbiased samples of randomly generated quasi-orders. In this paper, we develop techniques for the approximately representative sampling of quasi-orders. Polynomial regression curves are fitted for the mean and standard deviation of quasi-order size as a function of item number. The resulting regression graphs are seen to be quadratic and linear functions, respectively. The extrapolated values for the mean and standard deviation are used to propose two quasi-order sampling techniques. The discrete method matches these location and scale measures with a transformed discrete distribution directly obtained from the sample. The continuous method uses the normal density function with matched expectation and variance. The quasi-orders are constructed according to the biased randomized doubly inductive construction, however they are resampled to become approximately representative following the matched discrete and continuous distributions. In simulations, we investigate the usefulness of these methods. The location-scale matching approach can cope with very large item sets. Close to representative samples of random quasi-orders are constructed for item numbers up to n = 400.
format Online
Article
Text
id pubmed-6573793
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65737932019-06-26 Location-Scale Matching for Approximate Quasi-Order Sampling Ünlü, Ali Schrepp, Martin Front Psychol Psychology Quasi-orders are reflexive and transitive binary relations and have many applications. Examples are the dependencies of mastery among the problems of a psychological test, or methods such as item tree or Boolean analysis that mine for quasi-orders in empirical data. Data mining techniques are typically tested based on simulation studies with unbiased samples of randomly generated quasi-orders. In this paper, we develop techniques for the approximately representative sampling of quasi-orders. Polynomial regression curves are fitted for the mean and standard deviation of quasi-order size as a function of item number. The resulting regression graphs are seen to be quadratic and linear functions, respectively. The extrapolated values for the mean and standard deviation are used to propose two quasi-order sampling techniques. The discrete method matches these location and scale measures with a transformed discrete distribution directly obtained from the sample. The continuous method uses the normal density function with matched expectation and variance. The quasi-orders are constructed according to the biased randomized doubly inductive construction, however they are resampled to become approximately representative following the matched discrete and continuous distributions. In simulations, we investigate the usefulness of these methods. The location-scale matching approach can cope with very large item sets. Close to representative samples of random quasi-orders are constructed for item numbers up to n = 400. Frontiers Media S.A. 2019-06-10 /pmc/articles/PMC6573793/ /pubmed/31244703 http://dx.doi.org/10.3389/fpsyg.2019.01163 Text en Copyright © 2019 Ünlü and Schrepp. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Ünlü, Ali
Schrepp, Martin
Location-Scale Matching for Approximate Quasi-Order Sampling
title Location-Scale Matching for Approximate Quasi-Order Sampling
title_full Location-Scale Matching for Approximate Quasi-Order Sampling
title_fullStr Location-Scale Matching for Approximate Quasi-Order Sampling
title_full_unstemmed Location-Scale Matching for Approximate Quasi-Order Sampling
title_short Location-Scale Matching for Approximate Quasi-Order Sampling
title_sort location-scale matching for approximate quasi-order sampling
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6573793/
https://www.ncbi.nlm.nih.gov/pubmed/31244703
http://dx.doi.org/10.3389/fpsyg.2019.01163
work_keys_str_mv AT unluali locationscalematchingforapproximatequasiordersampling
AT schreppmartin locationscalematchingforapproximatequasiordersampling