Cargando…

Text Mining in Biomedical Domain with Emphasis on Document Clustering

OBJECTIVES: With the exponential increase in the number of articles published every year in the biomedical domain, there is a need to build automated systems to extract unknown information from the articles published. Text mining techniques enable the extraction of unknown knowledge from unstructure...

Descripción completa

Detalles Bibliográficos
Autor principal: Renganathan, Vinaitheerthan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Medical Informatics 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5572517/
https://www.ncbi.nlm.nih.gov/pubmed/28875048
http://dx.doi.org/10.4258/hir.2017.23.3.141
_version_ 1783259538893307904
author Renganathan, Vinaitheerthan
author_facet Renganathan, Vinaitheerthan
author_sort Renganathan, Vinaitheerthan
collection PubMed
description OBJECTIVES: With the exponential increase in the number of articles published every year in the biomedical domain, there is a need to build automated systems to extract unknown information from the articles published. Text mining techniques enable the extraction of unknown knowledge from unstructured documents. METHODS: This paper reviews text mining processes in detail and the software tools available to carry out text mining. It also reviews the roles and applications of text mining in the biomedical domain. RESULTS: Text mining processes, such as search and retrieval of documents, pre-processing of documents, natural language processing, methods for text clustering, and methods for text classification are described in detail. CONCLUSIONS: Text mining techniques can facilitate the mining of vast amounts of knowledge on a given topic from published biomedical research articles and draw meaningful conclusions that are not possible otherwise.
format Online
Article
Text
id pubmed-5572517
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Korean Society of Medical Informatics
record_format MEDLINE/PubMed
spelling pubmed-55725172017-09-05 Text Mining in Biomedical Domain with Emphasis on Document Clustering Renganathan, Vinaitheerthan Healthc Inform Res Review Article OBJECTIVES: With the exponential increase in the number of articles published every year in the biomedical domain, there is a need to build automated systems to extract unknown information from the articles published. Text mining techniques enable the extraction of unknown knowledge from unstructured documents. METHODS: This paper reviews text mining processes in detail and the software tools available to carry out text mining. It also reviews the roles and applications of text mining in the biomedical domain. RESULTS: Text mining processes, such as search and retrieval of documents, pre-processing of documents, natural language processing, methods for text clustering, and methods for text classification are described in detail. CONCLUSIONS: Text mining techniques can facilitate the mining of vast amounts of knowledge on a given topic from published biomedical research articles and draw meaningful conclusions that are not possible otherwise. Korean Society of Medical Informatics 2017-07 2017-07-31 /pmc/articles/PMC5572517/ /pubmed/28875048 http://dx.doi.org/10.4258/hir.2017.23.3.141 Text en © 2017 The Korean Society of Medical Informatics http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
Renganathan, Vinaitheerthan
Text Mining in Biomedical Domain with Emphasis on Document Clustering
title Text Mining in Biomedical Domain with Emphasis on Document Clustering
title_full Text Mining in Biomedical Domain with Emphasis on Document Clustering
title_fullStr Text Mining in Biomedical Domain with Emphasis on Document Clustering
title_full_unstemmed Text Mining in Biomedical Domain with Emphasis on Document Clustering
title_short Text Mining in Biomedical Domain with Emphasis on Document Clustering
title_sort text mining in biomedical domain with emphasis on document clustering
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5572517/
https://www.ncbi.nlm.nih.gov/pubmed/28875048
http://dx.doi.org/10.4258/hir.2017.23.3.141
work_keys_str_mv AT renganathanvinaitheerthan textmininginbiomedicaldomainwithemphasisondocumentclustering