Cargando…

Text-Image-Video Summary Generation Using Joint Integer Linear Programming

Automatically generating a summary for asynchronous data can help users to keep up with the rapid growth of multi-modal information on the Internet. However, the current multi-modal systems usually generate summaries composed of text and images. In this paper, we propose a novel research problem of...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jangra, Anubhav, Jatowt, Adam, Hasanuzzaman, Mohammad, Saha, Sriparna
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148046/ http://dx.doi.org/10.1007/978-3-030-45442-5_24

Descripción
Sumario:	Automatically generating a summary for asynchronous data can help users to keep up with the rapid growth of multi-modal information on the Internet. However, the current multi-modal systems usually generate summaries composed of text and images. In this paper, we propose a novel research problem of text-image-video summary generation (TIVS). We first develop a multi-modal dataset containing text documents, images and videos. We then propose a novel joint integer linear programming multi-modal summarization (JILP-MMS) framework. We report the performance of our model on the developed dataset.

Text-Image-Video Summary Generation Using Joint Integer Linear Programming

Ejemplares similares