Cargando…

Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis

A goal of the biotechnology industry is to be able to recognise detrimental cellular states that may lead to suboptimal or anomalous growth in a bacterial population. Our current knowledge of how different environmental treatments modulate gene regulation and bring about physiology adaptations is li...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yiming, Smith, Wendy, Harwood, Colin, Wipat, Anil, Bacardit, Jaume
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8036383/
https://www.ncbi.nlm.nih.gov/pubmed/33916259
http://dx.doi.org/10.3390/s21072436
_version_ 1783676897273577472
author Huang, Yiming
Smith, Wendy
Harwood, Colin
Wipat, Anil
Bacardit, Jaume
author_facet Huang, Yiming
Smith, Wendy
Harwood, Colin
Wipat, Anil
Bacardit, Jaume
author_sort Huang, Yiming
collection PubMed
description A goal of the biotechnology industry is to be able to recognise detrimental cellular states that may lead to suboptimal or anomalous growth in a bacterial population. Our current knowledge of how different environmental treatments modulate gene regulation and bring about physiology adaptations is limited, and hence it is difficult to determine the mechanisms that lead to their effects. Patterns of gene expression, revealed using technologies such as microarrays or RNA-seq, can provide useful biomarkers of different gene regulatory states indicative of a bacterium’s physiological status. It is desirable to have only a few key genes as the biomarkers to reduce the costs of determining the transcriptional state by opening the way for methods such as quantitative RT-PCR and amplicon panels. In this paper, we used unsupervised machine learning to construct a transcriptional landscape model from condition-dependent transcriptome data, from which we have identified 10 clusters of samples with differentiated gene expression profiles and linked to different cellular growth states. Using an iterative feature elimination strategy, we identified a minimal panel of 10 biomarker genes that achieved 100% cross-validation accuracy in predicting the cluster assignment. Moreover, we designed and evaluated a variety of data processing strategies to ensure our methods were able to generate meaningful transcriptional landscape models, capturing relevant biological processes. Overall, the computational strategies introduced in this study facilitate the identification of a detailed set of relevant cellular growth states, and how to sense them using a reduced biomarker panel.
format Online
Article
Text
id pubmed-8036383
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80363832021-04-12 Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis Huang, Yiming Smith, Wendy Harwood, Colin Wipat, Anil Bacardit, Jaume Sensors (Basel) Article A goal of the biotechnology industry is to be able to recognise detrimental cellular states that may lead to suboptimal or anomalous growth in a bacterial population. Our current knowledge of how different environmental treatments modulate gene regulation and bring about physiology adaptations is limited, and hence it is difficult to determine the mechanisms that lead to their effects. Patterns of gene expression, revealed using technologies such as microarrays or RNA-seq, can provide useful biomarkers of different gene regulatory states indicative of a bacterium’s physiological status. It is desirable to have only a few key genes as the biomarkers to reduce the costs of determining the transcriptional state by opening the way for methods such as quantitative RT-PCR and amplicon panels. In this paper, we used unsupervised machine learning to construct a transcriptional landscape model from condition-dependent transcriptome data, from which we have identified 10 clusters of samples with differentiated gene expression profiles and linked to different cellular growth states. Using an iterative feature elimination strategy, we identified a minimal panel of 10 biomarker genes that achieved 100% cross-validation accuracy in predicting the cluster assignment. Moreover, we designed and evaluated a variety of data processing strategies to ensure our methods were able to generate meaningful transcriptional landscape models, capturing relevant biological processes. Overall, the computational strategies introduced in this study facilitate the identification of a detailed set of relevant cellular growth states, and how to sense them using a reduced biomarker panel. MDPI 2021-04-01 /pmc/articles/PMC8036383/ /pubmed/33916259 http://dx.doi.org/10.3390/s21072436 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Huang, Yiming
Smith, Wendy
Harwood, Colin
Wipat, Anil
Bacardit, Jaume
Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis
title Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis
title_full Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis
title_fullStr Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis
title_full_unstemmed Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis
title_short Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis
title_sort computational strategies for the identification of a transcriptional biomarker panel to sense cellular growth states in bacillus subtilis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8036383/
https://www.ncbi.nlm.nih.gov/pubmed/33916259
http://dx.doi.org/10.3390/s21072436
work_keys_str_mv AT huangyiming computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis
AT smithwendy computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis
AT harwoodcolin computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis
AT wipatanil computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis
AT bacarditjaume computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis