Cargando…

Text-Image-Video Summary Generation Using Joint Integer Linear Programming

Automatically generating a summary for asynchronous data can help users to keep up with the rapid growth of multi-modal information on the Internet. However, the current multi-modal systems usually generate summaries composed of text and images. In this paper, we propose a novel research problem of...

Descripción completa

Detalles Bibliográficos
Autores principales: Jangra, Anubhav, Jatowt, Adam, Hasanuzzaman, Mohammad, Saha, Sriparna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148046/
http://dx.doi.org/10.1007/978-3-030-45442-5_24
Descripción
Sumario:Automatically generating a summary for asynchronous data can help users to keep up with the rapid growth of multi-modal information on the Internet. However, the current multi-modal systems usually generate summaries composed of text and images. In this paper, we propose a novel research problem of text-image-video summary generation (TIVS). We first develop a multi-modal dataset containing text documents, images and videos. We then propose a novel joint integer linear programming multi-modal summarization (JILP-MMS) framework. We report the performance of our model on the developed dataset.