Cargando…

Improving information retrieval with multiple health terminologies in a quality-controlled gateway

BACKGROUND: The Catalog and Index of French-language Health Internet resources (CISMeF) is a quality-controlled health gateway, primarily for Web resources in French (n=89,751). Recently, we achieved a major improvement in the structure of the catalogue by setting-up multiple terminologies, based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Soualmia, Lina F, Sakji, Saoussen, Letord, Catherine, Rollin, Laetitia, Massari, Philippe, Darmoni, Stéfan J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4341235/
https://www.ncbi.nlm.nih.gov/pubmed/25825660
http://dx.doi.org/10.1186/2047-2501-1-8
_version_ 1782359150609563648
author Soualmia, Lina F
Sakji, Saoussen
Letord, Catherine
Rollin, Laetitia
Massari, Philippe
Darmoni, Stéfan J
author_facet Soualmia, Lina F
Sakji, Saoussen
Letord, Catherine
Rollin, Laetitia
Massari, Philippe
Darmoni, Stéfan J
author_sort Soualmia, Lina F
collection PubMed
description BACKGROUND: The Catalog and Index of French-language Health Internet resources (CISMeF) is a quality-controlled health gateway, primarily for Web resources in French (n=89,751). Recently, we achieved a major improvement in the structure of the catalogue by setting-up multiple terminologies, based on twelve health terminologies available in French, to overcome the potential weakness of the MeSH thesaurus, which is the main and pivotal terminology we use for indexing and retrieval since 1995. The main aim of this study was to estimate the added-value of exploiting several terminologies and their semantic relationships to improve Web resource indexing and retrieval in CISMeF, in order to provide additional health resources which meet the users’ expectations. METHODS: Twelve terminologies were integrated into the CISMeF information system to set up multiple-terminologies indexing and retrieval. The same sets of thirty queries were run: (i) by exploiting the hierarchical structure of the MeSH, and (ii) by exploiting the additional twelve terminologies and their semantic links. The two search modes were evaluated and compared. RESULTS: The overall coverage of the multiple-terminologies search mode was improved by comparison to the coverage of using the MeSH (16,283 vs. 14,159) (+15%). These additional findings were estimated at 56.6% relevant results, 24.7% intermediate results and 18.7% irrelevant. CONCLUSION: The multiple-terminologies approach improved information retrieval. These results suggest that integrating additional health terminologies was able to improve recall. Since performing the study, 21 other terminologies have been added which should enable us to make broader studies in multiple-terminologies information retrieval.
format Online
Article
Text
id pubmed-4341235
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43412352015-03-30 Improving information retrieval with multiple health terminologies in a quality-controlled gateway Soualmia, Lina F Sakji, Saoussen Letord, Catherine Rollin, Laetitia Massari, Philippe Darmoni, Stéfan J Health Inf Sci Syst Research BACKGROUND: The Catalog and Index of French-language Health Internet resources (CISMeF) is a quality-controlled health gateway, primarily for Web resources in French (n=89,751). Recently, we achieved a major improvement in the structure of the catalogue by setting-up multiple terminologies, based on twelve health terminologies available in French, to overcome the potential weakness of the MeSH thesaurus, which is the main and pivotal terminology we use for indexing and retrieval since 1995. The main aim of this study was to estimate the added-value of exploiting several terminologies and their semantic relationships to improve Web resource indexing and retrieval in CISMeF, in order to provide additional health resources which meet the users’ expectations. METHODS: Twelve terminologies were integrated into the CISMeF information system to set up multiple-terminologies indexing and retrieval. The same sets of thirty queries were run: (i) by exploiting the hierarchical structure of the MeSH, and (ii) by exploiting the additional twelve terminologies and their semantic links. The two search modes were evaluated and compared. RESULTS: The overall coverage of the multiple-terminologies search mode was improved by comparison to the coverage of using the MeSH (16,283 vs. 14,159) (+15%). These additional findings were estimated at 56.6% relevant results, 24.7% intermediate results and 18.7% irrelevant. CONCLUSION: The multiple-terminologies approach improved information retrieval. These results suggest that integrating additional health terminologies was able to improve recall. Since performing the study, 21 other terminologies have been added which should enable us to make broader studies in multiple-terminologies information retrieval. BioMed Central 2013-02-04 /pmc/articles/PMC4341235/ /pubmed/25825660 http://dx.doi.org/10.1186/2047-2501-1-8 Text en © Soualmia et al.; licensee BioMed Central Ltd. 2013 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Soualmia, Lina F
Sakji, Saoussen
Letord, Catherine
Rollin, Laetitia
Massari, Philippe
Darmoni, Stéfan J
Improving information retrieval with multiple health terminologies in a quality-controlled gateway
title Improving information retrieval with multiple health terminologies in a quality-controlled gateway
title_full Improving information retrieval with multiple health terminologies in a quality-controlled gateway
title_fullStr Improving information retrieval with multiple health terminologies in a quality-controlled gateway
title_full_unstemmed Improving information retrieval with multiple health terminologies in a quality-controlled gateway
title_short Improving information retrieval with multiple health terminologies in a quality-controlled gateway
title_sort improving information retrieval with multiple health terminologies in a quality-controlled gateway
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4341235/
https://www.ncbi.nlm.nih.gov/pubmed/25825660
http://dx.doi.org/10.1186/2047-2501-1-8
work_keys_str_mv AT soualmialinaf improvinginformationretrievalwithmultiplehealthterminologiesinaqualitycontrolledgateway
AT sakjisaoussen improvinginformationretrievalwithmultiplehealthterminologiesinaqualitycontrolledgateway
AT letordcatherine improvinginformationretrievalwithmultiplehealthterminologiesinaqualitycontrolledgateway
AT rollinlaetitia improvinginformationretrievalwithmultiplehealthterminologiesinaqualitycontrolledgateway
AT massariphilippe improvinginformationretrievalwithmultiplehealthterminologiesinaqualitycontrolledgateway
AT darmonistefanj improvinginformationretrievalwithmultiplehealthterminologiesinaqualitycontrolledgateway