Cargando…

Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction

Without doubt general video and sound, as found in large multimedia archives, carry emotional information. Thus, audio and video retrieval by certain emotional categories or dimensions could play a central role for tomorrow's intelligent systems, enabling search for movies with a particular moo...

Descripción completa

Detalles Bibliográficos
Autores principales: Eyben, Florian, Weninger, Felix, Lehment, Nicolas, Schuller, Björn, Rigoll, Gerhard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3876976/
https://www.ncbi.nlm.nih.gov/pubmed/24391704
http://dx.doi.org/10.1371/journal.pone.0078506
_version_ 1782297563496448000
author Eyben, Florian
Weninger, Felix
Lehment, Nicolas
Schuller, Björn
Rigoll, Gerhard
author_facet Eyben, Florian
Weninger, Felix
Lehment, Nicolas
Schuller, Björn
Rigoll, Gerhard
author_sort Eyben, Florian
collection PubMed
description Without doubt general video and sound, as found in large multimedia archives, carry emotional information. Thus, audio and video retrieval by certain emotional categories or dimensions could play a central role for tomorrow's intelligent systems, enabling search for movies with a particular mood, computer aided scene and sound design in order to elicit certain emotions in the audience, etc. Yet, the lion's share of research in affective computing is exclusively focusing on signals conveyed by humans, such as affective speech. Uniting the fields of multimedia retrieval and affective computing is believed to lend to a multiplicity of interesting retrieval applications, and at the same time to benefit affective computing research, by moving its methodology “out of the lab” to real-world, diverse data. In this contribution, we address the problem of finding “disturbing” scenes in movies, a scenario that is highly relevant for computer-aided parental guidance. We apply large-scale segmental feature extraction combined with audio-visual classification to the particular task of detecting violence. Our system performs fully data-driven analysis including automatic segmentation. We evaluate the system in terms of mean average precision (MAP) on the official data set of the MediaEval 2012 evaluation campaign's Affect Task, which consists of 18 original Hollywood movies, achieving up to .398 MAP on unseen test data in full realism. An in-depth analysis of the worth of individual features with respect to the target class and the system errors is carried out and reveals the importance of peak-related audio feature extraction and low-level histogram-based video analysis.
format Online
Article
Text
id pubmed-3876976
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38769762014-01-03 Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction Eyben, Florian Weninger, Felix Lehment, Nicolas Schuller, Björn Rigoll, Gerhard PLoS One Research Article Without doubt general video and sound, as found in large multimedia archives, carry emotional information. Thus, audio and video retrieval by certain emotional categories or dimensions could play a central role for tomorrow's intelligent systems, enabling search for movies with a particular mood, computer aided scene and sound design in order to elicit certain emotions in the audience, etc. Yet, the lion's share of research in affective computing is exclusively focusing on signals conveyed by humans, such as affective speech. Uniting the fields of multimedia retrieval and affective computing is believed to lend to a multiplicity of interesting retrieval applications, and at the same time to benefit affective computing research, by moving its methodology “out of the lab” to real-world, diverse data. In this contribution, we address the problem of finding “disturbing” scenes in movies, a scenario that is highly relevant for computer-aided parental guidance. We apply large-scale segmental feature extraction combined with audio-visual classification to the particular task of detecting violence. Our system performs fully data-driven analysis including automatic segmentation. We evaluate the system in terms of mean average precision (MAP) on the official data set of the MediaEval 2012 evaluation campaign's Affect Task, which consists of 18 original Hollywood movies, achieving up to .398 MAP on unseen test data in full realism. An in-depth analysis of the worth of individual features with respect to the target class and the system errors is carried out and reveals the importance of peak-related audio feature extraction and low-level histogram-based video analysis. Public Library of Science 2013-12-31 /pmc/articles/PMC3876976/ /pubmed/24391704 http://dx.doi.org/10.1371/journal.pone.0078506 Text en © 2013 Eyben et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Eyben, Florian
Weninger, Felix
Lehment, Nicolas
Schuller, Björn
Rigoll, Gerhard
Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction
title Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction
title_full Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction
title_fullStr Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction
title_full_unstemmed Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction
title_short Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction
title_sort affective video retrieval: violence detection in hollywood movies by large-scale segmental feature extraction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3876976/
https://www.ncbi.nlm.nih.gov/pubmed/24391704
http://dx.doi.org/10.1371/journal.pone.0078506
work_keys_str_mv AT eybenflorian affectivevideoretrievalviolencedetectioninhollywoodmoviesbylargescalesegmentalfeatureextraction
AT weningerfelix affectivevideoretrievalviolencedetectioninhollywoodmoviesbylargescalesegmentalfeatureextraction
AT lehmentnicolas affectivevideoretrievalviolencedetectioninhollywoodmoviesbylargescalesegmentalfeatureextraction
AT schullerbjorn affectivevideoretrievalviolencedetectioninhollywoodmoviesbylargescalesegmentalfeatureextraction
AT rigollgerhard affectivevideoretrievalviolencedetectioninhollywoodmoviesbylargescalesegmentalfeatureextraction