Cargando…
On the shape of timings distributions in free-text keystroke dynamics profiles
Keystroke dynamics is a soft biometric trait. Although the shape of the timing distributions in keystroke dynamics profiles is a central element for the accurate modeling of the behavioral patterns of the user, a simplified approach has been to presuppose normality. Careful consideration of the indi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8606350/ https://www.ncbi.nlm.nih.gov/pubmed/34841114 http://dx.doi.org/10.1016/j.heliyon.2021.e08413 |
_version_ | 1784602326257893376 |
---|---|
author | González, Nahuel Calot, Enrique P. Ierache, Jorge S. Hasperué, Waldo |
author_facet | González, Nahuel Calot, Enrique P. Ierache, Jorge S. Hasperué, Waldo |
author_sort | González, Nahuel |
collection | PubMed |
description | Keystroke dynamics is a soft biometric trait. Although the shape of the timing distributions in keystroke dynamics profiles is a central element for the accurate modeling of the behavioral patterns of the user, a simplified approach has been to presuppose normality. Careful consideration of the individual shapes for the timing models could lead to improvements in the error rates of current methods or possibly inspire new ones. The main objective of this study is to compare several heavy-tailed and positively skewed candidate distributions in order to rank them according to their merit for fitting timing histograms in keystroke dynamics profiles. Results are summarized in three ways: counting how many times each candidate distribution provides the best fit and ranking them in order of success, measuring average information content, and ranking candidate distributions according to the frequency of hypothesis rejection with an Anderson-Darling goodness of fit test. Seven distributions with two parameters and seven with three were evaluated against three publicly available free-text keystroke dynamics datasets. The results confirm the established use in the research community of the log-normal distribution, in its two- and three-parameter variations, as excellent choices for modeling the shape of timings histograms in keystroke dynamics profiles. However, the log-logistic distribution emerges as a clear winner among all two- and three-parameter candidates, consistently surpassing the log-normal and all the rest under the three evaluation criteria for both hold and flight times. |
format | Online Article Text |
id | pubmed-8606350 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-86063502021-11-26 On the shape of timings distributions in free-text keystroke dynamics profiles González, Nahuel Calot, Enrique P. Ierache, Jorge S. Hasperué, Waldo Heliyon Research Article Keystroke dynamics is a soft biometric trait. Although the shape of the timing distributions in keystroke dynamics profiles is a central element for the accurate modeling of the behavioral patterns of the user, a simplified approach has been to presuppose normality. Careful consideration of the individual shapes for the timing models could lead to improvements in the error rates of current methods or possibly inspire new ones. The main objective of this study is to compare several heavy-tailed and positively skewed candidate distributions in order to rank them according to their merit for fitting timing histograms in keystroke dynamics profiles. Results are summarized in three ways: counting how many times each candidate distribution provides the best fit and ranking them in order of success, measuring average information content, and ranking candidate distributions according to the frequency of hypothesis rejection with an Anderson-Darling goodness of fit test. Seven distributions with two parameters and seven with three were evaluated against three publicly available free-text keystroke dynamics datasets. The results confirm the established use in the research community of the log-normal distribution, in its two- and three-parameter variations, as excellent choices for modeling the shape of timings histograms in keystroke dynamics profiles. However, the log-logistic distribution emerges as a clear winner among all two- and three-parameter candidates, consistently surpassing the log-normal and all the rest under the three evaluation criteria for both hold and flight times. Elsevier 2021-11-17 /pmc/articles/PMC8606350/ /pubmed/34841114 http://dx.doi.org/10.1016/j.heliyon.2021.e08413 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article González, Nahuel Calot, Enrique P. Ierache, Jorge S. Hasperué, Waldo On the shape of timings distributions in free-text keystroke dynamics profiles |
title | On the shape of timings distributions in free-text keystroke dynamics profiles |
title_full | On the shape of timings distributions in free-text keystroke dynamics profiles |
title_fullStr | On the shape of timings distributions in free-text keystroke dynamics profiles |
title_full_unstemmed | On the shape of timings distributions in free-text keystroke dynamics profiles |
title_short | On the shape of timings distributions in free-text keystroke dynamics profiles |
title_sort | on the shape of timings distributions in free-text keystroke dynamics profiles |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8606350/ https://www.ncbi.nlm.nih.gov/pubmed/34841114 http://dx.doi.org/10.1016/j.heliyon.2021.e08413 |
work_keys_str_mv | AT gonzaleznahuel ontheshapeoftimingsdistributionsinfreetextkeystrokedynamicsprofiles AT calotenriquep ontheshapeoftimingsdistributionsinfreetextkeystrokedynamicsprofiles AT ierachejorges ontheshapeoftimingsdistributionsinfreetextkeystrokedynamicsprofiles AT hasperuewaldo ontheshapeoftimingsdistributionsinfreetextkeystrokedynamicsprofiles |