Cargando…
Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis
A goal of the biotechnology industry is to be able to recognise detrimental cellular states that may lead to suboptimal or anomalous growth in a bacterial population. Our current knowledge of how different environmental treatments modulate gene regulation and bring about physiology adaptations is li...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8036383/ https://www.ncbi.nlm.nih.gov/pubmed/33916259 http://dx.doi.org/10.3390/s21072436 |
_version_ | 1783676897273577472 |
---|---|
author | Huang, Yiming Smith, Wendy Harwood, Colin Wipat, Anil Bacardit, Jaume |
author_facet | Huang, Yiming Smith, Wendy Harwood, Colin Wipat, Anil Bacardit, Jaume |
author_sort | Huang, Yiming |
collection | PubMed |
description | A goal of the biotechnology industry is to be able to recognise detrimental cellular states that may lead to suboptimal or anomalous growth in a bacterial population. Our current knowledge of how different environmental treatments modulate gene regulation and bring about physiology adaptations is limited, and hence it is difficult to determine the mechanisms that lead to their effects. Patterns of gene expression, revealed using technologies such as microarrays or RNA-seq, can provide useful biomarkers of different gene regulatory states indicative of a bacterium’s physiological status. It is desirable to have only a few key genes as the biomarkers to reduce the costs of determining the transcriptional state by opening the way for methods such as quantitative RT-PCR and amplicon panels. In this paper, we used unsupervised machine learning to construct a transcriptional landscape model from condition-dependent transcriptome data, from which we have identified 10 clusters of samples with differentiated gene expression profiles and linked to different cellular growth states. Using an iterative feature elimination strategy, we identified a minimal panel of 10 biomarker genes that achieved 100% cross-validation accuracy in predicting the cluster assignment. Moreover, we designed and evaluated a variety of data processing strategies to ensure our methods were able to generate meaningful transcriptional landscape models, capturing relevant biological processes. Overall, the computational strategies introduced in this study facilitate the identification of a detailed set of relevant cellular growth states, and how to sense them using a reduced biomarker panel. |
format | Online Article Text |
id | pubmed-8036383 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-80363832021-04-12 Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis Huang, Yiming Smith, Wendy Harwood, Colin Wipat, Anil Bacardit, Jaume Sensors (Basel) Article A goal of the biotechnology industry is to be able to recognise detrimental cellular states that may lead to suboptimal or anomalous growth in a bacterial population. Our current knowledge of how different environmental treatments modulate gene regulation and bring about physiology adaptations is limited, and hence it is difficult to determine the mechanisms that lead to their effects. Patterns of gene expression, revealed using technologies such as microarrays or RNA-seq, can provide useful biomarkers of different gene regulatory states indicative of a bacterium’s physiological status. It is desirable to have only a few key genes as the biomarkers to reduce the costs of determining the transcriptional state by opening the way for methods such as quantitative RT-PCR and amplicon panels. In this paper, we used unsupervised machine learning to construct a transcriptional landscape model from condition-dependent transcriptome data, from which we have identified 10 clusters of samples with differentiated gene expression profiles and linked to different cellular growth states. Using an iterative feature elimination strategy, we identified a minimal panel of 10 biomarker genes that achieved 100% cross-validation accuracy in predicting the cluster assignment. Moreover, we designed and evaluated a variety of data processing strategies to ensure our methods were able to generate meaningful transcriptional landscape models, capturing relevant biological processes. Overall, the computational strategies introduced in this study facilitate the identification of a detailed set of relevant cellular growth states, and how to sense them using a reduced biomarker panel. MDPI 2021-04-01 /pmc/articles/PMC8036383/ /pubmed/33916259 http://dx.doi.org/10.3390/s21072436 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Huang, Yiming Smith, Wendy Harwood, Colin Wipat, Anil Bacardit, Jaume Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis |
title | Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis |
title_full | Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis |
title_fullStr | Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis |
title_full_unstemmed | Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis |
title_short | Computational Strategies for the Identification of a Transcriptional Biomarker Panel to Sense Cellular Growth States in Bacillus subtilis |
title_sort | computational strategies for the identification of a transcriptional biomarker panel to sense cellular growth states in bacillus subtilis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8036383/ https://www.ncbi.nlm.nih.gov/pubmed/33916259 http://dx.doi.org/10.3390/s21072436 |
work_keys_str_mv | AT huangyiming computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis AT smithwendy computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis AT harwoodcolin computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis AT wipatanil computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis AT bacarditjaume computationalstrategiesfortheidentificationofatranscriptionalbiomarkerpaneltosensecellulargrowthstatesinbacillussubtilis |