Cargando…

TIPS: A Framework for Text Summarising with Illustrative Pictures

We propose an algorithm to generate graphical summarising of longer text passages using a set of illustrative pictures (TIPS). TIPS is an algorithm using a voting process that uses results of individual “weak” algorithms. The proposed method includes a summarising algorithm that generates a digest o...

Descripción completa

Detalles Bibliográficos
Autores principales:	Golec, Justyna, Hachaj, Tomasz, Sokal, Grzegorz
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700518/ https://www.ncbi.nlm.nih.gov/pubmed/34945920 http://dx.doi.org/10.3390/e23121614

_version_	1784620776302837760
author	Golec, Justyna Hachaj, Tomasz Sokal, Grzegorz
author_facet	Golec, Justyna Hachaj, Tomasz Sokal, Grzegorz
author_sort	Golec, Justyna
collection	PubMed
description	We propose an algorithm to generate graphical summarising of longer text passages using a set of illustrative pictures (TIPS). TIPS is an algorithm using a voting process that uses results of individual “weak” algorithms. The proposed method includes a summarising algorithm that generates a digest of the input document. Each sentence of the text summary is used as the input for further processing by the sentence transformer separately. A sentence transformer performs text embedding and a group of CLIP similarity-based algorithms trained on different image embedding finds semantic distances between images in the illustration image database and the input text. A voting process extracts the most matching images to the text. The TIPS algorithm allows the integration of the best (highest scored) results of the different recommendation algorithms by diminishing the influence of images that are a disjointed part of the recommendations of the component algorithms. TIPS returns a set of illustrative images that describe each sentence of the text summary. Three human judges found that the use of TIPS resulted in an increase in matching highly relevant images to text, ranging from 5% to 8% and images relevant to text ranging from 3% to 7% compared to the approach based on single-embedding schema.
format	Online Article Text
id	pubmed-8700518
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-87005182021-12-24 TIPS: A Framework for Text Summarising with Illustrative Pictures Golec, Justyna Hachaj, Tomasz Sokal, Grzegorz Entropy (Basel) Article We propose an algorithm to generate graphical summarising of longer text passages using a set of illustrative pictures (TIPS). TIPS is an algorithm using a voting process that uses results of individual “weak” algorithms. The proposed method includes a summarising algorithm that generates a digest of the input document. Each sentence of the text summary is used as the input for further processing by the sentence transformer separately. A sentence transformer performs text embedding and a group of CLIP similarity-based algorithms trained on different image embedding finds semantic distances between images in the illustration image database and the input text. A voting process extracts the most matching images to the text. The TIPS algorithm allows the integration of the best (highest scored) results of the different recommendation algorithms by diminishing the influence of images that are a disjointed part of the recommendations of the component algorithms. TIPS returns a set of illustrative images that describe each sentence of the text summary. Three human judges found that the use of TIPS resulted in an increase in matching highly relevant images to text, ranging from 5% to 8% and images relevant to text ranging from 3% to 7% compared to the approach based on single-embedding schema. MDPI 2021-11-30 /pmc/articles/PMC8700518/ /pubmed/34945920 http://dx.doi.org/10.3390/e23121614 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Golec, Justyna Hachaj, Tomasz Sokal, Grzegorz TIPS: A Framework for Text Summarising with Illustrative Pictures
title	TIPS: A Framework for Text Summarising with Illustrative Pictures
title_full	TIPS: A Framework for Text Summarising with Illustrative Pictures
title_fullStr	TIPS: A Framework for Text Summarising with Illustrative Pictures
title_full_unstemmed	TIPS: A Framework for Text Summarising with Illustrative Pictures
title_short	TIPS: A Framework for Text Summarising with Illustrative Pictures
title_sort	tips: a framework for text summarising with illustrative pictures
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700518/ https://www.ncbi.nlm.nih.gov/pubmed/34945920 http://dx.doi.org/10.3390/e23121614
work_keys_str_mv	AT golecjustyna tipsaframeworkfortextsummarisingwithillustrativepictures AT hachajtomasz tipsaframeworkfortextsummarisingwithillustrativepictures AT sokalgrzegorz tipsaframeworkfortextsummarisingwithillustrativepictures

TIPS: A Framework for Text Summarising with Illustrative Pictures

Ejemplares similares