Cargando…

A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content

Analysts and journalists face the problem of having to deal with very large, heterogeneous, and multilingual data volumes that need to be analyzed, understood, and aggregated. Automated and simplified editorial and authoring process could significantly reduce time, labor, and costs. Therefore, there...

Descripción completa

Detalles Bibliográficos
Autores principales: Vrochidis, Stefanos, Moumtzidou, Anastasia, Gialampoukidis, Ilias, Liparas, Dimitris, Casamayor, Gerard, Wanner, Leo, Heise, Nicolaus, Wagner, Tilman, Bilous, Andriy, Jamin, Emmanuel, Simeonov, Boyan, Alexiev, Vladimir, Busch, Reinhard, Arapakis, Ioannis, Kompatsiaris, Ioannis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805659/
https://www.ncbi.nlm.nih.gov/pubmed/33501002
http://dx.doi.org/10.3389/frobt.2018.00123
Descripción
Sumario:Analysts and journalists face the problem of having to deal with very large, heterogeneous, and multilingual data volumes that need to be analyzed, understood, and aggregated. Automated and simplified editorial and authoring process could significantly reduce time, labor, and costs. Therefore, there is a need for unified access to multilingual and multicultural news story material, beyond the level of a nation, ensuring context-aware, spatiotemporal, and semantic interpretation, correlating also and summarizing the interpreted material into a coherent gist. In this paper, we present a platform integrating multimodal analytics techniques, which are able to support journalists in handling large streams of real-time and diverse information. Specifically, the platform automatically crawls and indexes multilingual and multimedia information from heterogeneous resources. Textual information is automatically summarized and can be translated (on demand) into the language of the journalist. High-level information is extracted from both textual and multimedia content for fast inspection using concept clouds. The textual and multimedia content is semantically integrated and indexed using a common representation, to be accessible through a web-based search engine. The evaluation of the proposed platform was performed by several groups of journalists revealing satisfaction from the user side.