Cargando…

Qualitative Analysis of Text Summarization Techniques and Its Applications in Health Domain

For the better utilization of the enormous amount of data available to us on the Internet and in different archives, summarization is a valuable method. Manual summarization by experts is an almost impossible and time-consuming activity. People could not access, read, or use such a big pile of infor...

Descripción completa

Detalles Bibliográficos
Autores principales: Yadav, Divakar, Lalit, Naman, Kaushik, Riya, Singh, Yogendra, Mohit, Dinesh, Yadav, Arun Kr., Bhadane, Kishor V., Kumar, Adarsh, Khan, Baseem
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8849812/
https://www.ncbi.nlm.nih.gov/pubmed/35186058
http://dx.doi.org/10.1155/2022/3411881
_version_ 1784652499654803456
author Yadav, Divakar
Lalit, Naman
Kaushik, Riya
Singh, Yogendra
Mohit,
Dinesh,
Yadav, Arun Kr.
Bhadane, Kishor V.
Kumar, Adarsh
Khan, Baseem
author_facet Yadav, Divakar
Lalit, Naman
Kaushik, Riya
Singh, Yogendra
Mohit,
Dinesh,
Yadav, Arun Kr.
Bhadane, Kishor V.
Kumar, Adarsh
Khan, Baseem
author_sort Yadav, Divakar
collection PubMed
description For the better utilization of the enormous amount of data available to us on the Internet and in different archives, summarization is a valuable method. Manual summarization by experts is an almost impossible and time-consuming activity. People could not access, read, or use such a big pile of information for their needs. Therefore, summary generation is essential and beneficial in the current scenario. This paper presents an efficient qualitative analysis of the different algorithms used for text summarization. We implemented five different algorithms, namely, term frequency-inverse document frequency (TF-IDF), LexRank, TextRank, BertSum, and PEGASUS, for a summary generation. These algorithms are chosen based on various factors. After reviewing the state-of-the-art literature, it generates good summaries results. The performance of these algorithms is compared on two different datasets, i.e., Reddit-TIFU and MultiNews, and their results are measured using Recall-Oriented Understudy for Gisting Evaluation (ROUGE) measure to perform analysis to decide the best algorithm among these and generate the summary. After performing a qualitative analysis of the above algorithms, we observe that for both the datasets, i.e., Reddit-TIFU and MultiNews, PEGASUS had the best average F-score for abstractive text summarization and TextRank algorithms for extractive text summarization, with a better average F-score.
format Online
Article
Text
id pubmed-8849812
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-88498122022-02-17 Qualitative Analysis of Text Summarization Techniques and Its Applications in Health Domain Yadav, Divakar Lalit, Naman Kaushik, Riya Singh, Yogendra Mohit, Dinesh, Yadav, Arun Kr. Bhadane, Kishor V. Kumar, Adarsh Khan, Baseem Comput Intell Neurosci Research Article For the better utilization of the enormous amount of data available to us on the Internet and in different archives, summarization is a valuable method. Manual summarization by experts is an almost impossible and time-consuming activity. People could not access, read, or use such a big pile of information for their needs. Therefore, summary generation is essential and beneficial in the current scenario. This paper presents an efficient qualitative analysis of the different algorithms used for text summarization. We implemented five different algorithms, namely, term frequency-inverse document frequency (TF-IDF), LexRank, TextRank, BertSum, and PEGASUS, for a summary generation. These algorithms are chosen based on various factors. After reviewing the state-of-the-art literature, it generates good summaries results. The performance of these algorithms is compared on two different datasets, i.e., Reddit-TIFU and MultiNews, and their results are measured using Recall-Oriented Understudy for Gisting Evaluation (ROUGE) measure to perform analysis to decide the best algorithm among these and generate the summary. After performing a qualitative analysis of the above algorithms, we observe that for both the datasets, i.e., Reddit-TIFU and MultiNews, PEGASUS had the best average F-score for abstractive text summarization and TextRank algorithms for extractive text summarization, with a better average F-score. Hindawi 2022-02-09 /pmc/articles/PMC8849812/ /pubmed/35186058 http://dx.doi.org/10.1155/2022/3411881 Text en Copyright © 2022 Divakar Yadav et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yadav, Divakar
Lalit, Naman
Kaushik, Riya
Singh, Yogendra
Mohit,
Dinesh,
Yadav, Arun Kr.
Bhadane, Kishor V.
Kumar, Adarsh
Khan, Baseem
Qualitative Analysis of Text Summarization Techniques and Its Applications in Health Domain
title Qualitative Analysis of Text Summarization Techniques and Its Applications in Health Domain
title_full Qualitative Analysis of Text Summarization Techniques and Its Applications in Health Domain
title_fullStr Qualitative Analysis of Text Summarization Techniques and Its Applications in Health Domain
title_full_unstemmed Qualitative Analysis of Text Summarization Techniques and Its Applications in Health Domain
title_short Qualitative Analysis of Text Summarization Techniques and Its Applications in Health Domain
title_sort qualitative analysis of text summarization techniques and its applications in health domain
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8849812/
https://www.ncbi.nlm.nih.gov/pubmed/35186058
http://dx.doi.org/10.1155/2022/3411881
work_keys_str_mv AT yadavdivakar qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain
AT lalitnaman qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain
AT kaushikriya qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain
AT singhyogendra qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain
AT mohit qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain
AT dinesh qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain
AT yadavarunkr qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain
AT bhadanekishorv qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain
AT kumaradarsh qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain
AT khanbaseem qualitativeanalysisoftextsummarizationtechniquesanditsapplicationsinhealthdomain