Cargando…

Large scale text mining for deriving useful insights: A case study focused on microbiome

Text mining has been shown to be an auxiliary but key driver for modeling, data harmonization, and interpretation in bio-medicine. Scientific literature holds a wealth of information and embodies cumulative knowledge and remains the core basis on which mechanistic pathways, molecular databases, and...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmed, Syed Ashif Jardary Al, Bapatdhar, Nishad, Kumar, Bipin Pradeep, Ghosh, Samik, Yachie, Ayako, Palaniappan, Sucheendra K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9473635/
https://www.ncbi.nlm.nih.gov/pubmed/36117696
http://dx.doi.org/10.3389/fphys.2022.933069
_version_ 1784789546338091008
author Ahmed, Syed Ashif Jardary Al
Bapatdhar, Nishad
Kumar, Bipin Pradeep
Ghosh, Samik
Yachie, Ayako
Palaniappan, Sucheendra K.
author_facet Ahmed, Syed Ashif Jardary Al
Bapatdhar, Nishad
Kumar, Bipin Pradeep
Ghosh, Samik
Yachie, Ayako
Palaniappan, Sucheendra K.
author_sort Ahmed, Syed Ashif Jardary Al
collection PubMed
description Text mining has been shown to be an auxiliary but key driver for modeling, data harmonization, and interpretation in bio-medicine. Scientific literature holds a wealth of information and embodies cumulative knowledge and remains the core basis on which mechanistic pathways, molecular databases, and models are built and refined. Text mining provides the necessary tools to automatically harness the potential of text. In this study, we show the potential of large-scale text mining for deriving novel insights, with a focus on the growing field of microbiome. We first collected the complete set of abstracts relevant to the microbiome from PubMed and used our text mining and intelligence platform Taxila for analysis. We drive the usefulness of text mining using two case studies. First, we analyze the geographical distribution of research and study locations for the field of microbiome by extracting geo mentions from text. Using this analysis, we were able to draw useful insights on the state of research in microbiome w. r.t geographical distributions and economic drivers. Next, to understand the relationships between diseases, microbiome, and food which are central to the field, we construct semantic relationship networks between these different concepts central to the field of microbiome. We show how such networks can be useful to derive useful insight with no prior knowledge encoded.
format Online
Article
Text
id pubmed-9473635
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-94736352022-09-15 Large scale text mining for deriving useful insights: A case study focused on microbiome Ahmed, Syed Ashif Jardary Al Bapatdhar, Nishad Kumar, Bipin Pradeep Ghosh, Samik Yachie, Ayako Palaniappan, Sucheendra K. Front Physiol Physiology Text mining has been shown to be an auxiliary but key driver for modeling, data harmonization, and interpretation in bio-medicine. Scientific literature holds a wealth of information and embodies cumulative knowledge and remains the core basis on which mechanistic pathways, molecular databases, and models are built and refined. Text mining provides the necessary tools to automatically harness the potential of text. In this study, we show the potential of large-scale text mining for deriving novel insights, with a focus on the growing field of microbiome. We first collected the complete set of abstracts relevant to the microbiome from PubMed and used our text mining and intelligence platform Taxila for analysis. We drive the usefulness of text mining using two case studies. First, we analyze the geographical distribution of research and study locations for the field of microbiome by extracting geo mentions from text. Using this analysis, we were able to draw useful insights on the state of research in microbiome w. r.t geographical distributions and economic drivers. Next, to understand the relationships between diseases, microbiome, and food which are central to the field, we construct semantic relationship networks between these different concepts central to the field of microbiome. We show how such networks can be useful to derive useful insight with no prior knowledge encoded. Frontiers Media S.A. 2022-08-31 /pmc/articles/PMC9473635/ /pubmed/36117696 http://dx.doi.org/10.3389/fphys.2022.933069 Text en Copyright © 2022 Ahmed, Bapatdhar, Kumar, Ghosh, Yachie and Palaniappan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Physiology
Ahmed, Syed Ashif Jardary Al
Bapatdhar, Nishad
Kumar, Bipin Pradeep
Ghosh, Samik
Yachie, Ayako
Palaniappan, Sucheendra K.
Large scale text mining for deriving useful insights: A case study focused on microbiome
title Large scale text mining for deriving useful insights: A case study focused on microbiome
title_full Large scale text mining for deriving useful insights: A case study focused on microbiome
title_fullStr Large scale text mining for deriving useful insights: A case study focused on microbiome
title_full_unstemmed Large scale text mining for deriving useful insights: A case study focused on microbiome
title_short Large scale text mining for deriving useful insights: A case study focused on microbiome
title_sort large scale text mining for deriving useful insights: a case study focused on microbiome
topic Physiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9473635/
https://www.ncbi.nlm.nih.gov/pubmed/36117696
http://dx.doi.org/10.3389/fphys.2022.933069
work_keys_str_mv AT ahmedsyedashifjardaryal largescaletextminingforderivingusefulinsightsacasestudyfocusedonmicrobiome
AT bapatdharnishad largescaletextminingforderivingusefulinsightsacasestudyfocusedonmicrobiome
AT kumarbipinpradeep largescaletextminingforderivingusefulinsightsacasestudyfocusedonmicrobiome
AT ghoshsamik largescaletextminingforderivingusefulinsightsacasestudyfocusedonmicrobiome
AT yachieayako largescaletextminingforderivingusefulinsightsacasestudyfocusedonmicrobiome
AT palaniappansucheendrak largescaletextminingforderivingusefulinsightsacasestudyfocusedonmicrobiome