Cargando…
Word synonym relationships for text analysis: A graph-based approach
Keyword extraction refers to the process of detecting the most relevant terms and expressions in a given text in a timely manner. In the information explosion era, keyword extraction has attracted increasing attention. The importance of keyword extraction in text summarization, text comparisons, and...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8315826/ https://www.ncbi.nlm.nih.gov/pubmed/34315172 http://dx.doi.org/10.1371/journal.pone.0255127 |
_version_ | 1783729782355132416 |
---|---|
author | Alrasheed, Hend |
author_facet | Alrasheed, Hend |
author_sort | Alrasheed, Hend |
collection | PubMed |
description | Keyword extraction refers to the process of detecting the most relevant terms and expressions in a given text in a timely manner. In the information explosion era, keyword extraction has attracted increasing attention. The importance of keyword extraction in text summarization, text comparisons, and document categorization has led to an emphasis on graph-based keyword extraction techniques because they can capture more structural information compared to other classic text analysis methods. In this paper, we propose a simple unsupervised text mining approach that aims to extract a set of keywords from a given text and analyze its topic diversity using graph analysis tools. Initially, the text is represented as a directed graph using synonym relationships. Then, community detection and other measures are used to identify keywords in the text. The set of extracted keywords is used to assess topic diversity within the text and analyze its sentiment. The proposed approach relies on grouping semantically similar candidate words. This approach ensures that the set of extracted keywords is comprehensive. Differing from other graph-based keyword extraction approaches, the proposed method does not require user parameters during graph construction and word scoring. The proposed approach achieved significant results compared to other keyword extraction techniques. |
format | Online Article Text |
id | pubmed-8315826 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83158262021-07-31 Word synonym relationships for text analysis: A graph-based approach Alrasheed, Hend PLoS One Research Article Keyword extraction refers to the process of detecting the most relevant terms and expressions in a given text in a timely manner. In the information explosion era, keyword extraction has attracted increasing attention. The importance of keyword extraction in text summarization, text comparisons, and document categorization has led to an emphasis on graph-based keyword extraction techniques because they can capture more structural information compared to other classic text analysis methods. In this paper, we propose a simple unsupervised text mining approach that aims to extract a set of keywords from a given text and analyze its topic diversity using graph analysis tools. Initially, the text is represented as a directed graph using synonym relationships. Then, community detection and other measures are used to identify keywords in the text. The set of extracted keywords is used to assess topic diversity within the text and analyze its sentiment. The proposed approach relies on grouping semantically similar candidate words. This approach ensures that the set of extracted keywords is comprehensive. Differing from other graph-based keyword extraction approaches, the proposed method does not require user parameters during graph construction and word scoring. The proposed approach achieved significant results compared to other keyword extraction techniques. Public Library of Science 2021-07-27 /pmc/articles/PMC8315826/ /pubmed/34315172 http://dx.doi.org/10.1371/journal.pone.0255127 Text en © 2021 Hend Alrasheed https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Alrasheed, Hend Word synonym relationships for text analysis: A graph-based approach |
title | Word synonym relationships for text analysis: A graph-based approach |
title_full | Word synonym relationships for text analysis: A graph-based approach |
title_fullStr | Word synonym relationships for text analysis: A graph-based approach |
title_full_unstemmed | Word synonym relationships for text analysis: A graph-based approach |
title_short | Word synonym relationships for text analysis: A graph-based approach |
title_sort | word synonym relationships for text analysis: a graph-based approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8315826/ https://www.ncbi.nlm.nih.gov/pubmed/34315172 http://dx.doi.org/10.1371/journal.pone.0255127 |
work_keys_str_mv | AT alrasheedhend wordsynonymrelationshipsfortextanalysisagraphbasedapproach |