Cargando…

Document Liveness Challenge Dataset (DLC-2021)

Various government and commercial services, including, but not limited to, e-government, fintech, banking, and sharing economy services, widely use smartphones to simplify service access and user authorization. Many organizations involved in these areas use identity document analysis systems in orde...

Descripción completa

Detalles Bibliográficos
Autores principales: Polevoy, Dmitry V., Sigareva, Irina V., Ershova, Daria M., Arlazarov, Vladimir V., Nikolaev, Dmitry P., Ming, Zuheng, Luqman, Muhammad Muzzamil, Burie, Jean-Christophe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9323793/
https://www.ncbi.nlm.nih.gov/pubmed/35877624
http://dx.doi.org/10.3390/jimaging8070181
_version_ 1784756640910671872
author Polevoy, Dmitry V.
Sigareva, Irina V.
Ershova, Daria M.
Arlazarov, Vladimir V.
Nikolaev, Dmitry P.
Ming, Zuheng
Luqman, Muhammad Muzzamil
Burie, Jean-Christophe
author_facet Polevoy, Dmitry V.
Sigareva, Irina V.
Ershova, Daria M.
Arlazarov, Vladimir V.
Nikolaev, Dmitry P.
Ming, Zuheng
Luqman, Muhammad Muzzamil
Burie, Jean-Christophe
author_sort Polevoy, Dmitry V.
collection PubMed
description Various government and commercial services, including, but not limited to, e-government, fintech, banking, and sharing economy services, widely use smartphones to simplify service access and user authorization. Many organizations involved in these areas use identity document analysis systems in order to improve user personal-data-input processes. The tasks of such systems are not only ID document data recognition and extraction but also fraud prevention by detecting document forgery or by checking whether the document is genuine. Modern systems of this kind are often expected to operate in unconstrained environments. A significant amount of research has been published on the topic of mobile ID document analysis, but the main difficulty for such research is the lack of public datasets due to the fact that the subject is protected by security requirements. In this paper, we present the DLC-2021 dataset, which consists of 1424 video clips captured in a wide range of real-world conditions, focused on tasks relating to ID document forensics. The novelty of the dataset is that it contains shots from video with color laminated mock ID documents, color unlaminated copies, grayscale unlaminated copies, and screen recaptures of the documents. The proposed dataset complies with the GDPR because it contains images of synthetic IDs with generated owner photos and artificial personal information. For the presented dataset, benchmark baselines are provided for tasks such as screen recapture detection and glare detection. The data presented are openly available in Zenodo.
format Online
Article
Text
id pubmed-9323793
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93237932022-07-27 Document Liveness Challenge Dataset (DLC-2021) Polevoy, Dmitry V. Sigareva, Irina V. Ershova, Daria M. Arlazarov, Vladimir V. Nikolaev, Dmitry P. Ming, Zuheng Luqman, Muhammad Muzzamil Burie, Jean-Christophe J Imaging Article Various government and commercial services, including, but not limited to, e-government, fintech, banking, and sharing economy services, widely use smartphones to simplify service access and user authorization. Many organizations involved in these areas use identity document analysis systems in order to improve user personal-data-input processes. The tasks of such systems are not only ID document data recognition and extraction but also fraud prevention by detecting document forgery or by checking whether the document is genuine. Modern systems of this kind are often expected to operate in unconstrained environments. A significant amount of research has been published on the topic of mobile ID document analysis, but the main difficulty for such research is the lack of public datasets due to the fact that the subject is protected by security requirements. In this paper, we present the DLC-2021 dataset, which consists of 1424 video clips captured in a wide range of real-world conditions, focused on tasks relating to ID document forensics. The novelty of the dataset is that it contains shots from video with color laminated mock ID documents, color unlaminated copies, grayscale unlaminated copies, and screen recaptures of the documents. The proposed dataset complies with the GDPR because it contains images of synthetic IDs with generated owner photos and artificial personal information. For the presented dataset, benchmark baselines are provided for tasks such as screen recapture detection and glare detection. The data presented are openly available in Zenodo. MDPI 2022-06-28 /pmc/articles/PMC9323793/ /pubmed/35877624 http://dx.doi.org/10.3390/jimaging8070181 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Polevoy, Dmitry V.
Sigareva, Irina V.
Ershova, Daria M.
Arlazarov, Vladimir V.
Nikolaev, Dmitry P.
Ming, Zuheng
Luqman, Muhammad Muzzamil
Burie, Jean-Christophe
Document Liveness Challenge Dataset (DLC-2021)
title Document Liveness Challenge Dataset (DLC-2021)
title_full Document Liveness Challenge Dataset (DLC-2021)
title_fullStr Document Liveness Challenge Dataset (DLC-2021)
title_full_unstemmed Document Liveness Challenge Dataset (DLC-2021)
title_short Document Liveness Challenge Dataset (DLC-2021)
title_sort document liveness challenge dataset (dlc-2021)
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9323793/
https://www.ncbi.nlm.nih.gov/pubmed/35877624
http://dx.doi.org/10.3390/jimaging8070181
work_keys_str_mv AT polevoydmitryv documentlivenesschallengedatasetdlc2021
AT sigarevairinav documentlivenesschallengedatasetdlc2021
AT ershovadariam documentlivenesschallengedatasetdlc2021
AT arlazarovvladimirv documentlivenesschallengedatasetdlc2021
AT nikolaevdmitryp documentlivenesschallengedatasetdlc2021
AT mingzuheng documentlivenesschallengedatasetdlc2021
AT luqmanmuhammadmuzzamil documentlivenesschallengedatasetdlc2021
AT buriejeanchristophe documentlivenesschallengedatasetdlc2021