Cargando…

Integration of text mining and biological network analysis: Identification of essential genes in sulfate-reducing bacteria

The growth and survival of an organism in a particular environment is highly depends on the certain indispensable genes, termed as essential genes. Sulfate-reducing bacteria (SRB) are obligate anaerobes which thrives on sulfate reduction for its energy requirements. The present study used Oleidesulf...

Descripción completa

Detalles Bibliográficos
Autores principales: Saxena, Priya, Rauniyar, Shailabh, Thakur, Payal, Singh, Ram Nageena, Bomgni, Alain, Alaba, Mathew O., Tripathi, Abhilash Kumar, Gnimpieba, Etienne Z., Lushbough, Carol, Sani, Rajesh Kumar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133479/
https://www.ncbi.nlm.nih.gov/pubmed/37125195
http://dx.doi.org/10.3389/fmicb.2023.1086021
_version_ 1785031574752854016
author Saxena, Priya
Rauniyar, Shailabh
Thakur, Payal
Singh, Ram Nageena
Bomgni, Alain
Alaba, Mathew O.
Tripathi, Abhilash Kumar
Gnimpieba, Etienne Z.
Lushbough, Carol
Sani, Rajesh Kumar
author_facet Saxena, Priya
Rauniyar, Shailabh
Thakur, Payal
Singh, Ram Nageena
Bomgni, Alain
Alaba, Mathew O.
Tripathi, Abhilash Kumar
Gnimpieba, Etienne Z.
Lushbough, Carol
Sani, Rajesh Kumar
author_sort Saxena, Priya
collection PubMed
description The growth and survival of an organism in a particular environment is highly depends on the certain indispensable genes, termed as essential genes. Sulfate-reducing bacteria (SRB) are obligate anaerobes which thrives on sulfate reduction for its energy requirements. The present study used Oleidesulfovibrio alaskensis G20 (OA G20) as a model SRB to categorize the essential genes based on their key metabolic pathways. Herein, we reported a feedback loop framework for gene of interest discovery, from bio-problem to gene set of interest, leveraging expert annotation with computational prediction. Defined bio-problem was applied to retrieve the genes of SRB from literature databases (PubMed, and PubMed Central) and annotated them to the genome of OA G20. Retrieved gene list was further used to enrich protein–protein interaction and was corroborated to the pangenome analysis, to categorize the enriched gene sets and the respective pathways under essential and non-essential. Interestingly, the sat gene (dde_2265) from the sulfur metabolism was the bridging gene between all the enriched pathways. Gene clusters involved in essential pathways were linked with the genes from seleno-compound metabolism, amino acid metabolism, secondary metabolite synthesis, and cofactor biosynthesis. Furthermore, pangenome analysis demonstrated the gene distribution, where 69.83% of the 116 enriched genes were mapped under “persistent,” inferring the essentiality of these genes. Likewise, 21.55% of the enriched genes, which involves specially the formate dehydrogenases and metallic hydrogenases, appeared under “shell.” Our methodology suggested that semi-automated text mining and network analysis may play a crucial role in deciphering the previously unexplored genes and key mechanisms which can help to generate a baseline prior to perform any experimental studies.
format Online
Article
Text
id pubmed-10133479
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-101334792023-04-28 Integration of text mining and biological network analysis: Identification of essential genes in sulfate-reducing bacteria Saxena, Priya Rauniyar, Shailabh Thakur, Payal Singh, Ram Nageena Bomgni, Alain Alaba, Mathew O. Tripathi, Abhilash Kumar Gnimpieba, Etienne Z. Lushbough, Carol Sani, Rajesh Kumar Front Microbiol Microbiology The growth and survival of an organism in a particular environment is highly depends on the certain indispensable genes, termed as essential genes. Sulfate-reducing bacteria (SRB) are obligate anaerobes which thrives on sulfate reduction for its energy requirements. The present study used Oleidesulfovibrio alaskensis G20 (OA G20) as a model SRB to categorize the essential genes based on their key metabolic pathways. Herein, we reported a feedback loop framework for gene of interest discovery, from bio-problem to gene set of interest, leveraging expert annotation with computational prediction. Defined bio-problem was applied to retrieve the genes of SRB from literature databases (PubMed, and PubMed Central) and annotated them to the genome of OA G20. Retrieved gene list was further used to enrich protein–protein interaction and was corroborated to the pangenome analysis, to categorize the enriched gene sets and the respective pathways under essential and non-essential. Interestingly, the sat gene (dde_2265) from the sulfur metabolism was the bridging gene between all the enriched pathways. Gene clusters involved in essential pathways were linked with the genes from seleno-compound metabolism, amino acid metabolism, secondary metabolite synthesis, and cofactor biosynthesis. Furthermore, pangenome analysis demonstrated the gene distribution, where 69.83% of the 116 enriched genes were mapped under “persistent,” inferring the essentiality of these genes. Likewise, 21.55% of the enriched genes, which involves specially the formate dehydrogenases and metallic hydrogenases, appeared under “shell.” Our methodology suggested that semi-automated text mining and network analysis may play a crucial role in deciphering the previously unexplored genes and key mechanisms which can help to generate a baseline prior to perform any experimental studies. Frontiers Media S.A. 2023-04-13 /pmc/articles/PMC10133479/ /pubmed/37125195 http://dx.doi.org/10.3389/fmicb.2023.1086021 Text en Copyright © 2023 Saxena, Rauniyar, Thakur, Singh, Bomgni, Alaba, Tripathi, Gnimpieba, Lushbough and Sani. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Saxena, Priya
Rauniyar, Shailabh
Thakur, Payal
Singh, Ram Nageena
Bomgni, Alain
Alaba, Mathew O.
Tripathi, Abhilash Kumar
Gnimpieba, Etienne Z.
Lushbough, Carol
Sani, Rajesh Kumar
Integration of text mining and biological network analysis: Identification of essential genes in sulfate-reducing bacteria
title Integration of text mining and biological network analysis: Identification of essential genes in sulfate-reducing bacteria
title_full Integration of text mining and biological network analysis: Identification of essential genes in sulfate-reducing bacteria
title_fullStr Integration of text mining and biological network analysis: Identification of essential genes in sulfate-reducing bacteria
title_full_unstemmed Integration of text mining and biological network analysis: Identification of essential genes in sulfate-reducing bacteria
title_short Integration of text mining and biological network analysis: Identification of essential genes in sulfate-reducing bacteria
title_sort integration of text mining and biological network analysis: identification of essential genes in sulfate-reducing bacteria
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133479/
https://www.ncbi.nlm.nih.gov/pubmed/37125195
http://dx.doi.org/10.3389/fmicb.2023.1086021
work_keys_str_mv AT saxenapriya integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria
AT rauniyarshailabh integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria
AT thakurpayal integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria
AT singhramnageena integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria
AT bomgnialain integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria
AT alabamathewo integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria
AT tripathiabhilashkumar integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria
AT gnimpiebaetiennez integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria
AT lushboughcarol integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria
AT sanirajeshkumar integrationoftextminingandbiologicalnetworkanalysisidentificationofessentialgenesinsulfatereducingbacteria