Cargando…

Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies

Binary classification has numerous applications. For one, lie detection methods typically aim to classify each tested person either as “liar” or as “truthteller” based on the given test results. To infer practical implications, as well as to compare different methods, it is essential to assess the d...

Descripción completa

Detalles Bibliográficos
Autores principales: Lukács, Gáspár, Specker, Eva
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7531802/
https://www.ncbi.nlm.nih.gov/pubmed/33007043
http://dx.doi.org/10.1371/journal.pone.0240259
_version_ 1783589800144535552
author Lukács, Gáspár
Specker, Eva
author_facet Lukács, Gáspár
Specker, Eva
author_sort Lukács, Gáspár
collection PubMed
description Binary classification has numerous applications. For one, lie detection methods typically aim to classify each tested person either as “liar” or as “truthteller” based on the given test results. To infer practical implications, as well as to compare different methods, it is essential to assess the diagnostic efficiency, such as demonstrating the number of correctly classified persons. However, this is not always straightforward. In Concealed Information Tests (CITs), the key predictor value (probe-irrelevant difference) for “truthtellers” is always similar (zero on average), and “liars” are always distinguished by a larger value (i.e., a larger number resulting from the CIT test, as compared to the zero baseline). Thereby, in general, the larger predictor values a given CIT method obtains for “liars” on average, the better this method is assumed to be. This has indeed been assumed in countless studies, and therefore, when comparing the classification efficiencies of two different designs, the mean difference of “liar” predictor values in the two designs were simply compared to each other (hence not collecting “truthteller” data to spare resources). We show, based on the meta-data of 12 different experimental designs collected in response time-based CIT studies, that differences in dispersion (i.e., variance in the data, e.g. the extent of random deviations from the zero average in case of “truthtellers”) can substantially influence classification efficiency–to the point that, in extreme cases, one design may even be superior in classification despite having a larger mean “liar” predictor value. However, we also introduce a computer simulation procedure to estimate classification efficiency in the absence of “truthteller” data, and validate this procedure via a meta-analysis comparing outcomes based on empirical data versus simulated data.
format Online
Article
Text
id pubmed-7531802
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-75318022020-10-08 Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies Lukács, Gáspár Specker, Eva PLoS One Research Article Binary classification has numerous applications. For one, lie detection methods typically aim to classify each tested person either as “liar” or as “truthteller” based on the given test results. To infer practical implications, as well as to compare different methods, it is essential to assess the diagnostic efficiency, such as demonstrating the number of correctly classified persons. However, this is not always straightforward. In Concealed Information Tests (CITs), the key predictor value (probe-irrelevant difference) for “truthtellers” is always similar (zero on average), and “liars” are always distinguished by a larger value (i.e., a larger number resulting from the CIT test, as compared to the zero baseline). Thereby, in general, the larger predictor values a given CIT method obtains for “liars” on average, the better this method is assumed to be. This has indeed been assumed in countless studies, and therefore, when comparing the classification efficiencies of two different designs, the mean difference of “liar” predictor values in the two designs were simply compared to each other (hence not collecting “truthteller” data to spare resources). We show, based on the meta-data of 12 different experimental designs collected in response time-based CIT studies, that differences in dispersion (i.e., variance in the data, e.g. the extent of random deviations from the zero average in case of “truthtellers”) can substantially influence classification efficiency–to the point that, in extreme cases, one design may even be superior in classification despite having a larger mean “liar” predictor value. However, we also introduce a computer simulation procedure to estimate classification efficiency in the absence of “truthteller” data, and validate this procedure via a meta-analysis comparing outcomes based on empirical data versus simulated data. Public Library of Science 2020-10-02 /pmc/articles/PMC7531802/ /pubmed/33007043 http://dx.doi.org/10.1371/journal.pone.0240259 Text en © 2020 Lukács, Specker http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lukács, Gáspár
Specker, Eva
Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_full Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_fullStr Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_full_unstemmed Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_short Dispersion matters: Diagnostics and control data computer simulation in Concealed Information Test studies
title_sort dispersion matters: diagnostics and control data computer simulation in concealed information test studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7531802/
https://www.ncbi.nlm.nih.gov/pubmed/33007043
http://dx.doi.org/10.1371/journal.pone.0240259
work_keys_str_mv AT lukacsgaspar dispersionmattersdiagnosticsandcontroldatacomputersimulationinconcealedinformationteststudies
AT speckereva dispersionmattersdiagnosticsandcontroldatacomputersimulationinconcealedinformationteststudies