Cargando…

A MeSH-based text mining method for identifying novel prebiotics

Prebiotics contribute to the well-being of their host by altering the composition of the gut microbiota. Discovering new prebiotics is a challenging and arduous task due to strict inclusion criteria; thus, highly limited numbers of prebiotic candidates have been identified. Notably, the large number...

Descripción completa

Detalles Bibliográficos
Autores principales: Shan, Guangyu, Lu, Yiming, Min, Bo, Qu, Wubin, Zhang, Chenggang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wolters Kluwer Health 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5266046/
https://www.ncbi.nlm.nih.gov/pubmed/27930574
http://dx.doi.org/10.1097/MD.0000000000005585
_version_ 1782500394801299456
author Shan, Guangyu
Lu, Yiming
Min, Bo
Qu, Wubin
Zhang, Chenggang
author_facet Shan, Guangyu
Lu, Yiming
Min, Bo
Qu, Wubin
Zhang, Chenggang
author_sort Shan, Guangyu
collection PubMed
description Prebiotics contribute to the well-being of their host by altering the composition of the gut microbiota. Discovering new prebiotics is a challenging and arduous task due to strict inclusion criteria; thus, highly limited numbers of prebiotic candidates have been identified. Notably, the large numbers of published studies may contain substantial information attached to various features of known prebiotics that can be used to predict new candidates. In this paper, we propose a medical subject headings (MeSH)-based text mining method for identifying new prebiotics with structured texts obtained from PubMed. We defined an optimal feature set for prebiotics prediction using a systematic feature-ranking algorithm with which a variety of carbohydrates can be accurately classified into different clusters in accordance with their chemical and biological attributes. The optimal feature set was used to separate positive prebiotics from other carbohydrates, and a cross-validation procedure was employed to assess the prediction accuracy of the model. Our method achieved a specificity of 0.876 and a sensitivity of 0.838. Finally, we identified a high-confidence list of candidates of prebiotics that are strongly supported by the literature. Our study demonstrates that text mining from high-volume biomedical literature is a promising approach in searching for new prebiotics.
format Online
Article
Text
id pubmed-5266046
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Wolters Kluwer Health
record_format MEDLINE/PubMed
spelling pubmed-52660462017-02-06 A MeSH-based text mining method for identifying novel prebiotics Shan, Guangyu Lu, Yiming Min, Bo Qu, Wubin Zhang, Chenggang Medicine (Baltimore) 5500 Prebiotics contribute to the well-being of their host by altering the composition of the gut microbiota. Discovering new prebiotics is a challenging and arduous task due to strict inclusion criteria; thus, highly limited numbers of prebiotic candidates have been identified. Notably, the large numbers of published studies may contain substantial information attached to various features of known prebiotics that can be used to predict new candidates. In this paper, we propose a medical subject headings (MeSH)-based text mining method for identifying new prebiotics with structured texts obtained from PubMed. We defined an optimal feature set for prebiotics prediction using a systematic feature-ranking algorithm with which a variety of carbohydrates can be accurately classified into different clusters in accordance with their chemical and biological attributes. The optimal feature set was used to separate positive prebiotics from other carbohydrates, and a cross-validation procedure was employed to assess the prediction accuracy of the model. Our method achieved a specificity of 0.876 and a sensitivity of 0.838. Finally, we identified a high-confidence list of candidates of prebiotics that are strongly supported by the literature. Our study demonstrates that text mining from high-volume biomedical literature is a promising approach in searching for new prebiotics. Wolters Kluwer Health 2016-12-09 /pmc/articles/PMC5266046/ /pubmed/27930574 http://dx.doi.org/10.1097/MD.0000000000005585 Text en Copyright © 2016 the Author(s). Published by Wolters Kluwer Health, Inc. All rights reserved. http://creativecommons.org/licenses/by-nc-nd/4.0 This is an open access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (CCBY-NC-ND), where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in any way or used commercially. http://creativecommons.org/licenses/by-nc-nd/4.0
spellingShingle 5500
Shan, Guangyu
Lu, Yiming
Min, Bo
Qu, Wubin
Zhang, Chenggang
A MeSH-based text mining method for identifying novel prebiotics
title A MeSH-based text mining method for identifying novel prebiotics
title_full A MeSH-based text mining method for identifying novel prebiotics
title_fullStr A MeSH-based text mining method for identifying novel prebiotics
title_full_unstemmed A MeSH-based text mining method for identifying novel prebiotics
title_short A MeSH-based text mining method for identifying novel prebiotics
title_sort mesh-based text mining method for identifying novel prebiotics
topic 5500
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5266046/
https://www.ncbi.nlm.nih.gov/pubmed/27930574
http://dx.doi.org/10.1097/MD.0000000000005585
work_keys_str_mv AT shanguangyu ameshbasedtextminingmethodforidentifyingnovelprebiotics
AT luyiming ameshbasedtextminingmethodforidentifyingnovelprebiotics
AT minbo ameshbasedtextminingmethodforidentifyingnovelprebiotics
AT quwubin ameshbasedtextminingmethodforidentifyingnovelprebiotics
AT zhangchenggang ameshbasedtextminingmethodforidentifyingnovelprebiotics
AT shanguangyu meshbasedtextminingmethodforidentifyingnovelprebiotics
AT luyiming meshbasedtextminingmethodforidentifyingnovelprebiotics
AT minbo meshbasedtextminingmethodforidentifyingnovelprebiotics
AT quwubin meshbasedtextminingmethodforidentifyingnovelprebiotics
AT zhangchenggang meshbasedtextminingmethodforidentifyingnovelprebiotics