Cargando…
Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research
BACKGROUND: PubMed is a widely used database for scientists to find biomedical-related literature. Due to the complexity of the selected research subject and its interdisciplinary nature, as well as the exponential growth in the number of disparate pieces of biomedical literature, it is an overwhelm...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4878086/ https://www.ncbi.nlm.nih.gov/pubmed/27215352 http://dx.doi.org/10.1186/s12911-016-0298-z |
_version_ | 1782433508276305920 |
---|---|
author | Su, Yan Andrews, James Huang, Hong Wang, Yue Kong, Liangliang Cannon, Peter Xu, Ping |
author_facet | Su, Yan Andrews, James Huang, Hong Wang, Yue Kong, Liangliang Cannon, Peter Xu, Ping |
author_sort | Su, Yan |
collection | PubMed |
description | BACKGROUND: PubMed is a widely used database for scientists to find biomedical-related literature. Due to the complexity of the selected research subject and its interdisciplinary nature, as well as the exponential growth in the number of disparate pieces of biomedical literature, it is an overwhelming challenge for scientists to define the right search strategies and quickly locate all related information. Specialized subsets and groupings of controlled vocabularies, such as Medical Subject Headings (MeSH), can enhance information retrieval in specialized domains, such as stem cell research. There is a need to develop effective search strategies and convenient solutions for knowledge organization in stem cell research. The understanding of the interrelationships between these MeSH terms also facilitates the building of knowledge organization systems in related subject fields. METHODS: This study collected empirical data for MeSH-related terms from stem cell literature and developed a novel approach that uses both automation and expert-selection to create a set of terms that supports enhanced retrieval. The selected MeSH terms were reconstructed into a classified thesaurus that can guide researchers towards a successful search and knowledge organization of stem cell literature. RESULTS: First, 4253 MeSH terms were harvested from a sample of 5527 stem cell related research papers from the PubMed database. Next, unrelated terms were filtered out based on term frequency and specificity. Precision and recall measures were used to help identify additional valuable terms, which were mostly non-MeSH terms. The study identified 15 terms that specifically referred to stem cell research for information retrieval, which would yield a higher precision (97.7 %) and recall (94.4 %) rates in comparison to other approaches. In addition, 128 root MeSH terms were selected to conduct knowledge organization of stem cell research in categories of anatomy, disease, and others. CONCLUSIONS: This study presented a novel strategy and procedure to reengineer term selections of the MeSH thesaurus for literature retrieval and knowledge organization using stem cell research as a case. It could help scientists to select their own search terms and build up a thesaurus-based knowledge organization system in interested and interdisciplinary research subject areas. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-016-0298-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4878086 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48780862016-05-25 Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research Su, Yan Andrews, James Huang, Hong Wang, Yue Kong, Liangliang Cannon, Peter Xu, Ping BMC Med Inform Decis Mak Research Article BACKGROUND: PubMed is a widely used database for scientists to find biomedical-related literature. Due to the complexity of the selected research subject and its interdisciplinary nature, as well as the exponential growth in the number of disparate pieces of biomedical literature, it is an overwhelming challenge for scientists to define the right search strategies and quickly locate all related information. Specialized subsets and groupings of controlled vocabularies, such as Medical Subject Headings (MeSH), can enhance information retrieval in specialized domains, such as stem cell research. There is a need to develop effective search strategies and convenient solutions for knowledge organization in stem cell research. The understanding of the interrelationships between these MeSH terms also facilitates the building of knowledge organization systems in related subject fields. METHODS: This study collected empirical data for MeSH-related terms from stem cell literature and developed a novel approach that uses both automation and expert-selection to create a set of terms that supports enhanced retrieval. The selected MeSH terms were reconstructed into a classified thesaurus that can guide researchers towards a successful search and knowledge organization of stem cell literature. RESULTS: First, 4253 MeSH terms were harvested from a sample of 5527 stem cell related research papers from the PubMed database. Next, unrelated terms were filtered out based on term frequency and specificity. Precision and recall measures were used to help identify additional valuable terms, which were mostly non-MeSH terms. The study identified 15 terms that specifically referred to stem cell research for information retrieval, which would yield a higher precision (97.7 %) and recall (94.4 %) rates in comparison to other approaches. In addition, 128 root MeSH terms were selected to conduct knowledge organization of stem cell research in categories of anatomy, disease, and others. CONCLUSIONS: This study presented a novel strategy and procedure to reengineer term selections of the MeSH thesaurus for literature retrieval and knowledge organization using stem cell research as a case. It could help scientists to select their own search terms and build up a thesaurus-based knowledge organization system in interested and interdisciplinary research subject areas. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-016-0298-z) contains supplementary material, which is available to authorized users. BioMed Central 2016-05-23 /pmc/articles/PMC4878086/ /pubmed/27215352 http://dx.doi.org/10.1186/s12911-016-0298-z Text en © Su et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Su, Yan Andrews, James Huang, Hong Wang, Yue Kong, Liangliang Cannon, Peter Xu, Ping Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research |
title | Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research |
title_full | Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research |
title_fullStr | Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research |
title_full_unstemmed | Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research |
title_short | Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research |
title_sort | reengineering of mesh thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4878086/ https://www.ncbi.nlm.nih.gov/pubmed/27215352 http://dx.doi.org/10.1186/s12911-016-0298-z |
work_keys_str_mv | AT suyan reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch AT andrewsjames reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch AT huanghong reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch AT wangyue reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch AT kongliangliang reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch AT cannonpeter reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch AT xuping reengineeringofmeshthesaurifortermselectiontooptimizeliteratureretrievalandknowledgereconstructioninsupportofstemcellresearch |