Cargando…
A MeSH-based text mining method for identifying novel prebiotics
Prebiotics contribute to the well-being of their host by altering the composition of the gut microbiota. Discovering new prebiotics is a challenging and arduous task due to strict inclusion criteria; thus, highly limited numbers of prebiotic candidates have been identified. Notably, the large number...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Wolters Kluwer Health
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5266046/ https://www.ncbi.nlm.nih.gov/pubmed/27930574 http://dx.doi.org/10.1097/MD.0000000000005585 |
_version_ | 1782500394801299456 |
---|---|
author | Shan, Guangyu Lu, Yiming Min, Bo Qu, Wubin Zhang, Chenggang |
author_facet | Shan, Guangyu Lu, Yiming Min, Bo Qu, Wubin Zhang, Chenggang |
author_sort | Shan, Guangyu |
collection | PubMed |
description | Prebiotics contribute to the well-being of their host by altering the composition of the gut microbiota. Discovering new prebiotics is a challenging and arduous task due to strict inclusion criteria; thus, highly limited numbers of prebiotic candidates have been identified. Notably, the large numbers of published studies may contain substantial information attached to various features of known prebiotics that can be used to predict new candidates. In this paper, we propose a medical subject headings (MeSH)-based text mining method for identifying new prebiotics with structured texts obtained from PubMed. We defined an optimal feature set for prebiotics prediction using a systematic feature-ranking algorithm with which a variety of carbohydrates can be accurately classified into different clusters in accordance with their chemical and biological attributes. The optimal feature set was used to separate positive prebiotics from other carbohydrates, and a cross-validation procedure was employed to assess the prediction accuracy of the model. Our method achieved a specificity of 0.876 and a sensitivity of 0.838. Finally, we identified a high-confidence list of candidates of prebiotics that are strongly supported by the literature. Our study demonstrates that text mining from high-volume biomedical literature is a promising approach in searching for new prebiotics. |
format | Online Article Text |
id | pubmed-5266046 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Wolters Kluwer Health |
record_format | MEDLINE/PubMed |
spelling | pubmed-52660462017-02-06 A MeSH-based text mining method for identifying novel prebiotics Shan, Guangyu Lu, Yiming Min, Bo Qu, Wubin Zhang, Chenggang Medicine (Baltimore) 5500 Prebiotics contribute to the well-being of their host by altering the composition of the gut microbiota. Discovering new prebiotics is a challenging and arduous task due to strict inclusion criteria; thus, highly limited numbers of prebiotic candidates have been identified. Notably, the large numbers of published studies may contain substantial information attached to various features of known prebiotics that can be used to predict new candidates. In this paper, we propose a medical subject headings (MeSH)-based text mining method for identifying new prebiotics with structured texts obtained from PubMed. We defined an optimal feature set for prebiotics prediction using a systematic feature-ranking algorithm with which a variety of carbohydrates can be accurately classified into different clusters in accordance with their chemical and biological attributes. The optimal feature set was used to separate positive prebiotics from other carbohydrates, and a cross-validation procedure was employed to assess the prediction accuracy of the model. Our method achieved a specificity of 0.876 and a sensitivity of 0.838. Finally, we identified a high-confidence list of candidates of prebiotics that are strongly supported by the literature. Our study demonstrates that text mining from high-volume biomedical literature is a promising approach in searching for new prebiotics. Wolters Kluwer Health 2016-12-09 /pmc/articles/PMC5266046/ /pubmed/27930574 http://dx.doi.org/10.1097/MD.0000000000005585 Text en Copyright © 2016 the Author(s). Published by Wolters Kluwer Health, Inc. All rights reserved. http://creativecommons.org/licenses/by-nc-nd/4.0 This is an open access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (CCBY-NC-ND), where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in any way or used commercially. http://creativecommons.org/licenses/by-nc-nd/4.0 |
spellingShingle | 5500 Shan, Guangyu Lu, Yiming Min, Bo Qu, Wubin Zhang, Chenggang A MeSH-based text mining method for identifying novel prebiotics |
title | A MeSH-based text mining method for identifying novel prebiotics |
title_full | A MeSH-based text mining method for identifying novel prebiotics |
title_fullStr | A MeSH-based text mining method for identifying novel prebiotics |
title_full_unstemmed | A MeSH-based text mining method for identifying novel prebiotics |
title_short | A MeSH-based text mining method for identifying novel prebiotics |
title_sort | mesh-based text mining method for identifying novel prebiotics |
topic | 5500 |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5266046/ https://www.ncbi.nlm.nih.gov/pubmed/27930574 http://dx.doi.org/10.1097/MD.0000000000005585 |
work_keys_str_mv | AT shanguangyu ameshbasedtextminingmethodforidentifyingnovelprebiotics AT luyiming ameshbasedtextminingmethodforidentifyingnovelprebiotics AT minbo ameshbasedtextminingmethodforidentifyingnovelprebiotics AT quwubin ameshbasedtextminingmethodforidentifyingnovelprebiotics AT zhangchenggang ameshbasedtextminingmethodforidentifyingnovelprebiotics AT shanguangyu meshbasedtextminingmethodforidentifyingnovelprebiotics AT luyiming meshbasedtextminingmethodforidentifyingnovelprebiotics AT minbo meshbasedtextminingmethodforidentifyingnovelprebiotics AT quwubin meshbasedtextminingmethodforidentifyingnovelprebiotics AT zhangchenggang meshbasedtextminingmethodforidentifyingnovelprebiotics |