Cargando…

Clustering of PubMed abstracts using nearer terms of the domain

Literature search is a process in which external developers provide alternative representations for efficient data mining of biomedical literature such as ranking search results, displaying summarized knowledge of semantics and clustering results into topics. In clustering search results, prominent...

Descripción completa

Detalles Bibliográficos
Autores principales:	David, Mary Rajathei, Samuel, Selvaraj
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Biomedical Informatics 2012
Materias:	Hypothesis
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3282271/ https://www.ncbi.nlm.nih.gov/pubmed/22359430

_version_	1782224059550924800
author	David, Mary Rajathei Samuel, Selvaraj
author_facet	David, Mary Rajathei Samuel, Selvaraj
author_sort	David, Mary Rajathei
collection	PubMed
description	Literature search is a process in which external developers provide alternative representations for efficient data mining of biomedical literature such as ranking search results, displaying summarized knowledge of semantics and clustering results into topics. In clustering search results, prominent vocabularies, such as GO (Gene Ontology), MeSH(Medical Subject Headings) and frequent terms extracted from retrieved PubMed abstracts have been used as topics for grouping. In this study, we have proposed FNeTD (Frequent Nearer Terms of the Domain) method for PubMed abstracts clustering. This is achieved through a two-step process viz; i) identifying frequent words or phrases in the abstracts through the frequent multi-word extraction algorithm and ii) identifying nearer terms of the domain from the extracted frequent phrases using the nearest neighbors search. The efficiency of the clustering of PubMed abstracts using nearer terms of the domain was measured using F-score. The present study suggests that nearer terms of the domain can be used for clustering the search results.
format	Online Article Text
id	pubmed-3282271
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Biomedical Informatics
record_format	MEDLINE/PubMed
spelling	pubmed-32822712012-02-22 Clustering of PubMed abstracts using nearer terms of the domain David, Mary Rajathei Samuel, Selvaraj Bioinformation Hypothesis Literature search is a process in which external developers provide alternative representations for efficient data mining of biomedical literature such as ranking search results, displaying summarized knowledge of semantics and clustering results into topics. In clustering search results, prominent vocabularies, such as GO (Gene Ontology), MeSH(Medical Subject Headings) and frequent terms extracted from retrieved PubMed abstracts have been used as topics for grouping. In this study, we have proposed FNeTD (Frequent Nearer Terms of the Domain) method for PubMed abstracts clustering. This is achieved through a two-step process viz; i) identifying frequent words or phrases in the abstracts through the frequent multi-word extraction algorithm and ii) identifying nearer terms of the domain from the extracted frequent phrases using the nearest neighbors search. The efficiency of the clustering of PubMed abstracts using nearer terms of the domain was measured using F-score. The present study suggests that nearer terms of the domain can be used for clustering the search results. Biomedical Informatics 2012-01-06 /pmc/articles/PMC3282271/ /pubmed/22359430 Text en © 2012 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle	Hypothesis David, Mary Rajathei Samuel, Selvaraj Clustering of PubMed abstracts using nearer terms of the domain
title	Clustering of PubMed abstracts using nearer terms of the domain
title_full	Clustering of PubMed abstracts using nearer terms of the domain
title_fullStr	Clustering of PubMed abstracts using nearer terms of the domain
title_full_unstemmed	Clustering of PubMed abstracts using nearer terms of the domain
title_short	Clustering of PubMed abstracts using nearer terms of the domain
title_sort	clustering of pubmed abstracts using nearer terms of the domain
topic	Hypothesis
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3282271/ https://www.ncbi.nlm.nih.gov/pubmed/22359430
work_keys_str_mv	AT davidmaryrajathei clusteringofpubmedabstractsusingnearertermsofthedomain AT samuelselvaraj clusteringofpubmedabstractsusingnearertermsofthedomain

Clustering of PubMed abstracts using nearer terms of the domain

Ejemplares similares