Cargando…

Word synonym relationships for text analysis: A graph-based approach

Keyword extraction refers to the process of detecting the most relevant terms and expressions in a given text in a timely manner. In the information explosion era, keyword extraction has attracted increasing attention. The importance of keyword extraction in text summarization, text comparisons, and...

Descripción completa

Detalles Bibliográficos
Autor principal: Alrasheed, Hend
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8315826/
https://www.ncbi.nlm.nih.gov/pubmed/34315172
http://dx.doi.org/10.1371/journal.pone.0255127
_version_ 1783729782355132416
author Alrasheed, Hend
author_facet Alrasheed, Hend
author_sort Alrasheed, Hend
collection PubMed
description Keyword extraction refers to the process of detecting the most relevant terms and expressions in a given text in a timely manner. In the information explosion era, keyword extraction has attracted increasing attention. The importance of keyword extraction in text summarization, text comparisons, and document categorization has led to an emphasis on graph-based keyword extraction techniques because they can capture more structural information compared to other classic text analysis methods. In this paper, we propose a simple unsupervised text mining approach that aims to extract a set of keywords from a given text and analyze its topic diversity using graph analysis tools. Initially, the text is represented as a directed graph using synonym relationships. Then, community detection and other measures are used to identify keywords in the text. The set of extracted keywords is used to assess topic diversity within the text and analyze its sentiment. The proposed approach relies on grouping semantically similar candidate words. This approach ensures that the set of extracted keywords is comprehensive. Differing from other graph-based keyword extraction approaches, the proposed method does not require user parameters during graph construction and word scoring. The proposed approach achieved significant results compared to other keyword extraction techniques.
format Online
Article
Text
id pubmed-8315826
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83158262021-07-31 Word synonym relationships for text analysis: A graph-based approach Alrasheed, Hend PLoS One Research Article Keyword extraction refers to the process of detecting the most relevant terms and expressions in a given text in a timely manner. In the information explosion era, keyword extraction has attracted increasing attention. The importance of keyword extraction in text summarization, text comparisons, and document categorization has led to an emphasis on graph-based keyword extraction techniques because they can capture more structural information compared to other classic text analysis methods. In this paper, we propose a simple unsupervised text mining approach that aims to extract a set of keywords from a given text and analyze its topic diversity using graph analysis tools. Initially, the text is represented as a directed graph using synonym relationships. Then, community detection and other measures are used to identify keywords in the text. The set of extracted keywords is used to assess topic diversity within the text and analyze its sentiment. The proposed approach relies on grouping semantically similar candidate words. This approach ensures that the set of extracted keywords is comprehensive. Differing from other graph-based keyword extraction approaches, the proposed method does not require user parameters during graph construction and word scoring. The proposed approach achieved significant results compared to other keyword extraction techniques. Public Library of Science 2021-07-27 /pmc/articles/PMC8315826/ /pubmed/34315172 http://dx.doi.org/10.1371/journal.pone.0255127 Text en © 2021 Hend Alrasheed https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Alrasheed, Hend
Word synonym relationships for text analysis: A graph-based approach
title Word synonym relationships for text analysis: A graph-based approach
title_full Word synonym relationships for text analysis: A graph-based approach
title_fullStr Word synonym relationships for text analysis: A graph-based approach
title_full_unstemmed Word synonym relationships for text analysis: A graph-based approach
title_short Word synonym relationships for text analysis: A graph-based approach
title_sort word synonym relationships for text analysis: a graph-based approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8315826/
https://www.ncbi.nlm.nih.gov/pubmed/34315172
http://dx.doi.org/10.1371/journal.pone.0255127
work_keys_str_mv AT alrasheedhend wordsynonymrelationshipsfortextanalysisagraphbasedapproach