Cargando…
CNN-Based Multi-Modal Camera Model Identification on Video Sequences
Identifying the source camera of images and videos has gained significant importance in multimedia forensics. It allows tracing back data to their creator, thus enabling to solve copyright infringement cases and expose the authors of hideous crimes. In this paper, we focus on the problem of camera m...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404940/ https://www.ncbi.nlm.nih.gov/pubmed/34460771 http://dx.doi.org/10.3390/jimaging7080135 |
_version_ | 1783746238609358848 |
---|---|
author | Dal Cortivo, Davide Mandelli, Sara Bestagini, Paolo Tubaro, Stefano |
author_facet | Dal Cortivo, Davide Mandelli, Sara Bestagini, Paolo Tubaro, Stefano |
author_sort | Dal Cortivo, Davide |
collection | PubMed |
description | Identifying the source camera of images and videos has gained significant importance in multimedia forensics. It allows tracing back data to their creator, thus enabling to solve copyright infringement cases and expose the authors of hideous crimes. In this paper, we focus on the problem of camera model identification for video sequences, that is, given a video under analysis, detecting the camera model used for its acquisition. To this purpose, we develop two different CNN-based camera model identification methods, working in a novel multi-modal scenario. Differently from mono-modal methods, which use only the visual or audio information from the investigated video to tackle the identification task, the proposed multi-modal methods jointly exploit audio and visual information. We test our proposed methodologies on the well-known Vision dataset, which collects almost 2000 video sequences belonging to different devices. Experiments are performed, considering native videos directly acquired by their acquisition devices and videos uploaded on social media platforms, such as YouTube and WhatsApp. The achieved results show that the proposed multi-modal approaches significantly outperform their mono-modal counterparts, representing a valuable strategy for the tackled problem and opening future research to even more challenging scenarios. |
format | Online Article Text |
id | pubmed-8404940 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84049402021-10-28 CNN-Based Multi-Modal Camera Model Identification on Video Sequences Dal Cortivo, Davide Mandelli, Sara Bestagini, Paolo Tubaro, Stefano J Imaging Article Identifying the source camera of images and videos has gained significant importance in multimedia forensics. It allows tracing back data to their creator, thus enabling to solve copyright infringement cases and expose the authors of hideous crimes. In this paper, we focus on the problem of camera model identification for video sequences, that is, given a video under analysis, detecting the camera model used for its acquisition. To this purpose, we develop two different CNN-based camera model identification methods, working in a novel multi-modal scenario. Differently from mono-modal methods, which use only the visual or audio information from the investigated video to tackle the identification task, the proposed multi-modal methods jointly exploit audio and visual information. We test our proposed methodologies on the well-known Vision dataset, which collects almost 2000 video sequences belonging to different devices. Experiments are performed, considering native videos directly acquired by their acquisition devices and videos uploaded on social media platforms, such as YouTube and WhatsApp. The achieved results show that the proposed multi-modal approaches significantly outperform their mono-modal counterparts, representing a valuable strategy for the tackled problem and opening future research to even more challenging scenarios. MDPI 2021-08-05 /pmc/articles/PMC8404940/ /pubmed/34460771 http://dx.doi.org/10.3390/jimaging7080135 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Dal Cortivo, Davide Mandelli, Sara Bestagini, Paolo Tubaro, Stefano CNN-Based Multi-Modal Camera Model Identification on Video Sequences |
title | CNN-Based Multi-Modal Camera Model Identification on Video Sequences |
title_full | CNN-Based Multi-Modal Camera Model Identification on Video Sequences |
title_fullStr | CNN-Based Multi-Modal Camera Model Identification on Video Sequences |
title_full_unstemmed | CNN-Based Multi-Modal Camera Model Identification on Video Sequences |
title_short | CNN-Based Multi-Modal Camera Model Identification on Video Sequences |
title_sort | cnn-based multi-modal camera model identification on video sequences |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404940/ https://www.ncbi.nlm.nih.gov/pubmed/34460771 http://dx.doi.org/10.3390/jimaging7080135 |
work_keys_str_mv | AT dalcortivodavide cnnbasedmultimodalcameramodelidentificationonvideosequences AT mandellisara cnnbasedmultimodalcameramodelidentificationonvideosequences AT bestaginipaolo cnnbasedmultimodalcameramodelidentificationonvideosequences AT tubarostefano cnnbasedmultimodalcameramodelidentificationonvideosequences |