Cargando…

Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence

Deep learning is notably successful in data analysis, computer vision, and human control. Nevertheless, this approach has inevitably allowed the development of DeepFake video sequences and images that could be altered so that the changes are not easily or explicitly detectable. Such alterations have...

Descripción completa

Detalles Bibliográficos
Autores principales:	Saealal, Muhammad Salihin, Ibrahim, Mohd Zamri, Mulvaney, David. J., Shapiai, Mohd Ibrahim, Fadilah, Norasyikin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9754287/ https://www.ncbi.nlm.nih.gov/pubmed/36520851 http://dx.doi.org/10.1371/journal.pone.0278989

_version_	1784851154677530624
author	Saealal, Muhammad Salihin Ibrahim, Mohd Zamri Mulvaney, David. J. Shapiai, Mohd Ibrahim Fadilah, Norasyikin
author_facet	Saealal, Muhammad Salihin Ibrahim, Mohd Zamri Mulvaney, David. J. Shapiai, Mohd Ibrahim Fadilah, Norasyikin
author_sort	Saealal, Muhammad Salihin
collection	PubMed
description	Deep learning is notably successful in data analysis, computer vision, and human control. Nevertheless, this approach has inevitably allowed the development of DeepFake video sequences and images that could be altered so that the changes are not easily or explicitly detectable. Such alterations have been recently used to spread false news or disinformation. This study aims to identify Deepfaked videos and images and alert viewers to the possible falsity of the information. The current work presented a novel means of revealing fake face videos by cascading the convolution network with recurrent neural networks and fully connected network (FCN) models. The system detection approach utilizes the eye-blinking state in temporal video frames. Notwithstanding, it is deemed challenging to precisely depict (i) artificiality in fake videos and (ii) spatial information within the individual frame through this physiological signal. Spatial features were extracted using the VGG16 network and trained with the ImageNet dataset. The temporal features were then extracted in every 20 sequences through the LSTM network. On another note, the pre-processed eye-blinking state served as a probability to generate a novel BPD dataset. This newly-acquired dataset was fed to three models for training purposes with each entailing four, three, and six hidden layers, respectively. Every model constitutes a unique architecture and specific dropout value. Resultantly, the model optimally and accurately identified tampered videos within the dataset. The study model was assessed using the current BPD dataset based on one of the most complex datasets (FaceForensic++) with 90.8% accuracy. Such precision was successfully maintained in datasets that were not used in the training process. The training process was also accelerated by lowering the computation prerequisites.
format	Online Article Text
id	pubmed-9754287
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-97542872022-12-16 Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence Saealal, Muhammad Salihin Ibrahim, Mohd Zamri Mulvaney, David. J. Shapiai, Mohd Ibrahim Fadilah, Norasyikin PLoS One Research Article Deep learning is notably successful in data analysis, computer vision, and human control. Nevertheless, this approach has inevitably allowed the development of DeepFake video sequences and images that could be altered so that the changes are not easily or explicitly detectable. Such alterations have been recently used to spread false news or disinformation. This study aims to identify Deepfaked videos and images and alert viewers to the possible falsity of the information. The current work presented a novel means of revealing fake face videos by cascading the convolution network with recurrent neural networks and fully connected network (FCN) models. The system detection approach utilizes the eye-blinking state in temporal video frames. Notwithstanding, it is deemed challenging to precisely depict (i) artificiality in fake videos and (ii) spatial information within the individual frame through this physiological signal. Spatial features were extracted using the VGG16 network and trained with the ImageNet dataset. The temporal features were then extracted in every 20 sequences through the LSTM network. On another note, the pre-processed eye-blinking state served as a probability to generate a novel BPD dataset. This newly-acquired dataset was fed to three models for training purposes with each entailing four, three, and six hidden layers, respectively. Every model constitutes a unique architecture and specific dropout value. Resultantly, the model optimally and accurately identified tampered videos within the dataset. The study model was assessed using the current BPD dataset based on one of the most complex datasets (FaceForensic++) with 90.8% accuracy. Such precision was successfully maintained in datasets that were not used in the training process. The training process was also accelerated by lowering the computation prerequisites. Public Library of Science 2022-12-15 /pmc/articles/PMC9754287/ /pubmed/36520851 http://dx.doi.org/10.1371/journal.pone.0278989 Text en © 2022 Saealal et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Saealal, Muhammad Salihin Ibrahim, Mohd Zamri Mulvaney, David. J. Shapiai, Mohd Ibrahim Fadilah, Norasyikin Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence
title	Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence
title_full	Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence
title_fullStr	Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence
title_full_unstemmed	Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence
title_short	Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence
title_sort	using cascade cnn-lstm-fcns to identify ai-altered video based on eye state sequence
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9754287/ https://www.ncbi.nlm.nih.gov/pubmed/36520851 http://dx.doi.org/10.1371/journal.pone.0278989
work_keys_str_mv	AT saealalmuhammadsalihin usingcascadecnnlstmfcnstoidentifyaialteredvideobasedoneyestatesequence AT ibrahimmohdzamri usingcascadecnnlstmfcnstoidentifyaialteredvideobasedoneyestatesequence AT mulvaneydavidj usingcascadecnnlstmfcnstoidentifyaialteredvideobasedoneyestatesequence AT shapiaimohdibrahim usingcascadecnnlstmfcnstoidentifyaialteredvideobasedoneyestatesequence AT fadilahnorasyikin usingcascadecnnlstmfcnstoidentifyaialteredvideobasedoneyestatesequence

Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence

Ejemplares similares