Cargando…

Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies

Binary classification has numerous applications. For one, lie detection methods typically aim to classify each tested person either as “liar” or as “truthteller” based on the given test results. To infer practical implications, as well as to compare different methods, it is essential to assess the d...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lukács, Gáspár, Specker, Eva
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2020
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7531802/ https://www.ncbi.nlm.nih.gov/pubmed/33007043 http://dx.doi.org/10.1371/journal.pone.0240259

_version_	1783589800144535552
author	Lukács, Gáspár Specker, Eva
author_facet	Lukács, Gáspár Specker, Eva
author_sort	Lukács, Gáspár
collection	PubMed
description	Binary classification has numerous applications. For one, lie detection methods typically aim to classify each tested person either as “liar” or as “truthteller” based on the given test results. To infer practical implications, as well as to compare different methods, it is essential to assess the diagnostic efficiency, such as demonstrating the number of correctly classified persons. However, this is not always straightforward. In Concealed Information Tests (CITs), the key predictor value (probe-irrelevant difference) for “truthtellers” is always similar (zero on average), and “liars” are always distinguished by a larger value (i.e., a larger number resulting from the CIT test, as compared to the zero baseline). Thereby, in general, the larger predictor values a given CIT method obtains for “liars” on average, the better this method is assumed to be. This has indeed been assumed in countless studies, and therefore, when comparing the classification efficiencies of two different designs, the mean difference of “liar” predictor values in the two designs were simply compared to each other (hence not collecting “truthteller” data to spare resources). We show, based on the meta-data of 12 different experimental designs collected in response time-based CIT studies, that differences in dispersion (i.e., variance in the data, e.g. the extent of random deviations from the zero average in case of “truthtellers”) can substantially influence classification efficiency–to the point that, in extreme cases, one design may even be superior in classification despite having a larger mean “liar” predictor value. However, we also introduce a computer simulation procedure to estimate classification efficiency in the absence of “truthteller” data, and validate this procedure via a meta-analysis comparing outcomes based on empirical data versus simulated data.
format	Online Article Text
id	pubmed-7531802
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-75318022020-10-08 Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies Lukács, Gáspár Specker, Eva PLoS One Research Article Binary classification has numerous applications. For one, lie detection methods typically aim to classify each tested person either as “liar” or as “truthteller” based on the given test results. To infer practical implications, as well as to compare different methods, it is essential to assess the diagnostic efficiency, such as demonstrating the number of correctly classified persons. However, this is not always straightforward. In Concealed Information Tests (CITs), the key predictor value (probe-irrelevant difference) for “truthtellers” is always similar (zero on average), and “liars” are always distinguished by a larger value (i.e., a larger number resulting from the CIT test, as compared to the zero baseline). Thereby, in general, the larger predictor values a given CIT method obtains for “liars” on average, the better this method is assumed to be. This has indeed been assumed in countless studies, and therefore, when comparing the classification efficiencies of two different designs, the mean difference of “liar” predictor values in the two designs were simply compared to each other (hence not collecting “truthteller” data to spare resources). We show, based on the meta-data of 12 different experimental designs collected in response time-based CIT studies, that differences in dispersion (i.e., variance in the data, e.g. the extent of random deviations from the zero average in case of “truthtellers”) can substantially influence classification efficiency–to the point that, in extreme cases, one design may even be superior in classification despite having a larger mean “liar” predictor value. However, we also introduce a computer simulation procedure to estimate classification efficiency in the absence of “truthteller” data, and validate this procedure via a meta-analysis comparing outcomes based on empirical data versus simulated data. Public Library of Science 2020-10-02 /pmc/articles/PMC7531802/ /pubmed/33007043 http://dx.doi.org/10.1371/journal.pone.0240259 Text en © 2020 Lukács, Specker http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Lukács, Gáspár Specker, Eva Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title	Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_full	Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_fullStr	Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_full_unstemmed	Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_short	Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_sort	dispersion matters: diagnostics and control data computer simulation in concealed information test studies
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7531802/ https://www.ncbi.nlm.nih.gov/pubmed/33007043 http://dx.doi.org/10.1371/journal.pone.0240259
work_keys_str_mv	AT lukacsgaspar dispersionmattersdiagnosticsandcontroldatacomputersimulationinconcealedinformationteststudies AT speckereva dispersionmattersdiagnosticsandcontroldatacomputersimulationinconcealedinformationteststudies

Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies

Ejemplares similares