Cargando…

A MEDLINE categorization algorithm

BACKGROUND: Categorization is designed to enhance resource description by organizing content description so as to enable the reader to grasp quickly and easily what are the main topics discussed in it. The objective of this work is to propose a categorization algorithm to classify a set of scientifi...

Descripción completa

Detalles Bibliográficos
Autores principales: Darmoni, Stefan J, Névéol, Aurelie, Renard, Jean-Marie, Gehanno, Jean-Francois, Soualmia, Lina F, Dahamna, Badisse, Thirion, Benoit
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1456955/
https://www.ncbi.nlm.nih.gov/pubmed/16464249
http://dx.doi.org/10.1186/1472-6947-6-7
_version_ 1782127409373380608
author Darmoni, Stefan J
Névéol, Aurelie
Renard, Jean-Marie
Gehanno, Jean-Francois
Soualmia, Lina F
Dahamna, Badisse
Thirion, Benoit
author_facet Darmoni, Stefan J
Névéol, Aurelie
Renard, Jean-Marie
Gehanno, Jean-Francois
Soualmia, Lina F
Dahamna, Badisse
Thirion, Benoit
author_sort Darmoni, Stefan J
collection PubMed
description BACKGROUND: Categorization is designed to enhance resource description by organizing content description so as to enable the reader to grasp quickly and easily what are the main topics discussed in it. The objective of this work is to propose a categorization algorithm to classify a set of scientific articles indexed with the MeSH thesaurus, and in particular those of the MEDLINE bibliographic database. In a large bibliographic database such as MEDLINE, finding materials of particular interest to a specialty group, or relevant to a particular audience, can be difficult. The categorization refines the retrieval of indexed material. In the CISMeF terminology, metaterms can be considered as super-concepts. They were primarily conceived to improve recall in the CISMeF quality-controlled health gateway. METHODS: The MEDLINE categorization algorithm (MCA) is based on semantic links existing between MeSH terms and metaterms on the one hand and between MeSH subheadings and metaterms on the other hand. These links are used to automatically infer a list of metaterms from any MeSH term/subheading indexing. Medical librarians manually select the semantic links. RESULTS: The MEDLINE categorization algorithm lists the medical specialties relevant to a MEDLINE file by decreasing order of their importance. The MEDLINE categorization algorithm is available on a Web site. It can run on any MEDLINE file in a batch mode. As an example, the top 3 medical specialties for the set of 60 articles published in BioMed Central Medical Informatics & Decision Making, which are currently indexed in MEDLINE are: information science, organization and administration and medical informatics. CONCLUSION: We have presented a MEDLINE categorization algorithm in order to classify the medical specialties addressed in any MEDLINE file in the form of a ranked list of relevant specialties. The categorization method introduced in this paper is based on the manual indexing of resources with MeSH (terms/subheadings) pairs by NLM indexers. This algorithm may be used as a new bibliometric tool.
format Text
id pubmed-1456955
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14569552006-05-04 A MEDLINE categorization algorithm Darmoni, Stefan J Névéol, Aurelie Renard, Jean-Marie Gehanno, Jean-Francois Soualmia, Lina F Dahamna, Badisse Thirion, Benoit BMC Med Inform Decis Mak Research Article BACKGROUND: Categorization is designed to enhance resource description by organizing content description so as to enable the reader to grasp quickly and easily what are the main topics discussed in it. The objective of this work is to propose a categorization algorithm to classify a set of scientific articles indexed with the MeSH thesaurus, and in particular those of the MEDLINE bibliographic database. In a large bibliographic database such as MEDLINE, finding materials of particular interest to a specialty group, or relevant to a particular audience, can be difficult. The categorization refines the retrieval of indexed material. In the CISMeF terminology, metaterms can be considered as super-concepts. They were primarily conceived to improve recall in the CISMeF quality-controlled health gateway. METHODS: The MEDLINE categorization algorithm (MCA) is based on semantic links existing between MeSH terms and metaterms on the one hand and between MeSH subheadings and metaterms on the other hand. These links are used to automatically infer a list of metaterms from any MeSH term/subheading indexing. Medical librarians manually select the semantic links. RESULTS: The MEDLINE categorization algorithm lists the medical specialties relevant to a MEDLINE file by decreasing order of their importance. The MEDLINE categorization algorithm is available on a Web site. It can run on any MEDLINE file in a batch mode. As an example, the top 3 medical specialties for the set of 60 articles published in BioMed Central Medical Informatics & Decision Making, which are currently indexed in MEDLINE are: information science, organization and administration and medical informatics. CONCLUSION: We have presented a MEDLINE categorization algorithm in order to classify the medical specialties addressed in any MEDLINE file in the form of a ranked list of relevant specialties. The categorization method introduced in this paper is based on the manual indexing of resources with MeSH (terms/subheadings) pairs by NLM indexers. This algorithm may be used as a new bibliometric tool. BioMed Central 2006-02-07 /pmc/articles/PMC1456955/ /pubmed/16464249 http://dx.doi.org/10.1186/1472-6947-6-7 Text en Copyright © 2006 Darmoni et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Darmoni, Stefan J
Névéol, Aurelie
Renard, Jean-Marie
Gehanno, Jean-Francois
Soualmia, Lina F
Dahamna, Badisse
Thirion, Benoit
A MEDLINE categorization algorithm
title A MEDLINE categorization algorithm
title_full A MEDLINE categorization algorithm
title_fullStr A MEDLINE categorization algorithm
title_full_unstemmed A MEDLINE categorization algorithm
title_short A MEDLINE categorization algorithm
title_sort medline categorization algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1456955/
https://www.ncbi.nlm.nih.gov/pubmed/16464249
http://dx.doi.org/10.1186/1472-6947-6-7
work_keys_str_mv AT darmonistefanj amedlinecategorizationalgorithm
AT neveolaurelie amedlinecategorizationalgorithm
AT renardjeanmarie amedlinecategorizationalgorithm
AT gehannojeanfrancois amedlinecategorizationalgorithm
AT soualmialinaf amedlinecategorizationalgorithm
AT dahamnabadisse amedlinecategorizationalgorithm
AT thirionbenoit amedlinecategorizationalgorithm
AT darmonistefanj medlinecategorizationalgorithm
AT neveolaurelie medlinecategorizationalgorithm
AT renardjeanmarie medlinecategorizationalgorithm
AT gehannojeanfrancois medlinecategorizationalgorithm
AT soualmialinaf medlinecategorizationalgorithm
AT dahamnabadisse medlinecategorizationalgorithm
AT thirionbenoit medlinecategorizationalgorithm