Cargando…

Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research

BACKGROUND: PubMed is a widely used database for scientists to find biomedical-related literature. Due to the complexity of the selected research subject and its interdisciplinary nature, as well as the exponential growth in the number of disparate pieces of biomedical literature, it is an overwhelm...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Yan, Andrews, James, Huang, Hong, Wang, Yue, Kong, Liangliang, Cannon, Peter, Xu, Ping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4878086/
https://www.ncbi.nlm.nih.gov/pubmed/27215352
http://dx.doi.org/10.1186/s12911-016-0298-z
_version_ 1782433508276305920
author Su, Yan
Andrews, James
Huang, Hong
Wang, Yue
Kong, Liangliang
Cannon, Peter
Xu, Ping
author_facet Su, Yan
Andrews, James
Huang, Hong
Wang, Yue
Kong, Liangliang
Cannon, Peter
Xu, Ping
author_sort Su, Yan
collection PubMed
description BACKGROUND: PubMed is a widely used database for scientists to find biomedical-related literature. Due to the complexity of the selected research subject and its interdisciplinary nature, as well as the exponential growth in the number of disparate pieces of biomedical literature, it is an overwhelming challenge for scientists to define the right search strategies and quickly locate all related information. Specialized subsets and groupings of controlled vocabularies, such as Medical Subject Headings (MeSH), can enhance information retrieval in specialized domains, such as stem cell research. There is a need to develop effective search strategies and convenient solutions for knowledge organization in stem cell research. The understanding of the interrelationships between these MeSH terms also facilitates the building of knowledge organization systems in related subject fields. METHODS: This study collected empirical data for MeSH-related terms from stem cell literature and developed a novel approach that uses both automation and expert-selection to create a set of terms that supports enhanced retrieval. The selected MeSH terms were reconstructed into a classified thesaurus that can guide researchers towards a successful search and knowledge organization of stem cell literature. RESULTS: First, 4253 MeSH terms were harvested from a sample of 5527 stem cell related research papers from the PubMed database. Next, unrelated terms were filtered out based on term frequency and specificity. Precision and recall measures were used to help identify additional valuable terms, which were mostly non-MeSH terms. The study identified 15 terms that specifically referred to stem cell research for information retrieval, which would yield a higher precision (97.7 %) and recall (94.4 %) rates in comparison to other approaches. In addition, 128 root MeSH terms were selected to conduct knowledge organization of stem cell research in categories of anatomy, disease, and others. CONCLUSIONS: This study presented a novel strategy and procedure to reengineer term selections of the MeSH thesaurus for literature retrieval and knowledge organization using stem cell research as a case. It could help scientists to select their own search terms and build up a thesaurus-based knowledge organization system in interested and interdisciplinary research subject areas. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-016-0298-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4878086
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48780862016-05-25 Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research Su, Yan Andrews, James Huang, Hong Wang, Yue Kong, Liangliang Cannon, Peter Xu, Ping BMC Med Inform Decis Mak Research Article BACKGROUND: PubMed is a widely used database for scientists to find biomedical-related literature. Due to the complexity of the selected research subject and its interdisciplinary nature, as well as the exponential growth in the number of disparate pieces of biomedical literature, it is an overwhelming challenge for scientists to define the right search strategies and quickly locate all related information. Specialized subsets and groupings of controlled vocabularies, such as Medical Subject Headings (MeSH), can enhance information retrieval in specialized domains, such as stem cell research. There is a need to develop effective search strategies and convenient solutions for knowledge organization in stem cell research. The understanding of the interrelationships between these MeSH terms also facilitates the building of knowledge organization systems in related subject fields. METHODS: This study collected empirical data for MeSH-related terms from stem cell literature and developed a novel approach that uses both automation and expert-selection to create a set of terms that supports enhanced retrieval. The selected MeSH terms were reconstructed into a classified thesaurus that can guide researchers towards a successful search and knowledge organization of stem cell literature. RESULTS: First, 4253 MeSH terms were harvested from a sample of 5527 stem cell related research papers from the PubMed database. Next, unrelated terms were filtered out based on term frequency and specificity. Precision and recall measures were used to help identify additional valuable terms, which were mostly non-MeSH terms. The study identified 15 terms that specifically referred to stem cell research for information retrieval, which would yield a higher precision (97.7 %) and recall (94.4 %) rates in comparison to other approaches. In addition, 128 root MeSH terms were selected to conduct knowledge organization of stem cell research in categories of anatomy, disease, and others. CONCLUSIONS: This study presented a novel strategy and procedure to reengineer term selections of the MeSH thesaurus for literature retrieval and knowledge organization using stem cell research as a case. It could help scientists to select their own search terms and build up a thesaurus-based knowledge organization system in interested and interdisciplinary research subject areas. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-016-0298-z) contains supplementary material, which is available to authorized users. BioMed Central 2016-05-23 /pmc/articles/PMC4878086/ /pubmed/27215352 http://dx.doi.org/10.1186/s12911-016-0298-z Text en © Su et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Su, Yan
Andrews, James
Huang, Hong
Wang, Yue
Kong, Liangliang
Cannon, Peter
Xu, Ping
Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research
title Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research
title_full Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research
title_fullStr Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research
title_full_unstemmed Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research
title_short Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research
title_sort reengineering of mesh thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4878086/
https://www.ncbi.nlm.nih.gov/pubmed/27215352
http://dx.doi.org/10.1186/s12911-016-0298-z
work_keys_str_mv AT suyan reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch
AT andrewsjames reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch
AT huanghong reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch
AT wangyue reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch
AT kongliangliang reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch
AT cannonpeter reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch
AT xuping reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch