Cargando…
iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae
BACKGROUND: Organisms are subject to various stress conditions, which affect both the organism’s gene expression and phenotype. It is critical to understand microbial responses to stress conditions and uncover the underlying molecular mechanisms. To this end, it is necessary to build a database that...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676837/ https://www.ncbi.nlm.nih.gov/pubmed/26653323 http://dx.doi.org/10.1186/s13104-015-1759-7 |
_version_ | 1782405239344726016 |
---|---|
author | Sundararaman, Niveda Ash, Christine Guo, Weihua Button, Rebecca Singh, Jugroop Feng, Xueyang |
author_facet | Sundararaman, Niveda Ash, Christine Guo, Weihua Button, Rebecca Singh, Jugroop Feng, Xueyang |
author_sort | Sundararaman, Niveda |
collection | PubMed |
description | BACKGROUND: Organisms are subject to various stress conditions, which affect both the organism’s gene expression and phenotype. It is critical to understand microbial responses to stress conditions and uncover the underlying molecular mechanisms. To this end, it is necessary to build a database that collects transcriptomics and phenotypic data of microbes growing under various stress factors for in-depth systems biology analysis. Despite of numerous databases that collect gene expression profiles, to our best knowledge, there are few, if any, databases that collect both transcriptomics and phenotype data simultaneously. In light of this, we have developed an open source, web-based database, namely integrated transcriptomics and phenotype (iTAP) database, that records and links the transcriptomics and phenotype data for two model microorganisms, Escherichia coli and Saccharomyces cerevisiae in response to exposure of various stress conditions. RESULTS: To collect the data, we chose relevant research papers from the PubMed database containing all the necessary information for data curation including experimental conditions, transcriptomics data, and phenotype data. The transcriptomics data, including the p value and fold change, were obtained through the comparison of test strains against control strains using Gene Expression Omnibus’s GEO2R analyzer. The phenotype data, including the cell growth rate and the productivity, volumetric rate, and mass-based yield of byproducts, were calculated independently from charts or graphs within the reference papers. Since the phenotype data was never reported in a standardized format, the curation of correlated transcriptomics–phenotype datasets became extremely tedious and time-consuming. Despite the challenges, till now, we successfully correlated 57 and 143 datasets of transcriptomics and phenotype for E. coli and S. cerevisiae, respectively, and applied a regression model within the iTAP database to accurately predict over 93 and 73 % of the growth rates of E. coli and S. cerevisiae, respectively, directly from the transcriptomics data. CONCLUSION: This is the first time that transcriptomics and phenotype data are categorized and correlated in an open-source database. This allows biologists to access the database and utilize it to predict the phenotype of microorganisms from their transcriptomics data. The iTAP database is freely available at https://sites.google.com/a/vt.edu/biomolecular-engineering-lab/software. |
format | Online Article Text |
id | pubmed-4676837 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46768372015-12-13 iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae Sundararaman, Niveda Ash, Christine Guo, Weihua Button, Rebecca Singh, Jugroop Feng, Xueyang BMC Res Notes Technical Note BACKGROUND: Organisms are subject to various stress conditions, which affect both the organism’s gene expression and phenotype. It is critical to understand microbial responses to stress conditions and uncover the underlying molecular mechanisms. To this end, it is necessary to build a database that collects transcriptomics and phenotypic data of microbes growing under various stress factors for in-depth systems biology analysis. Despite of numerous databases that collect gene expression profiles, to our best knowledge, there are few, if any, databases that collect both transcriptomics and phenotype data simultaneously. In light of this, we have developed an open source, web-based database, namely integrated transcriptomics and phenotype (iTAP) database, that records and links the transcriptomics and phenotype data for two model microorganisms, Escherichia coli and Saccharomyces cerevisiae in response to exposure of various stress conditions. RESULTS: To collect the data, we chose relevant research papers from the PubMed database containing all the necessary information for data curation including experimental conditions, transcriptomics data, and phenotype data. The transcriptomics data, including the p value and fold change, were obtained through the comparison of test strains against control strains using Gene Expression Omnibus’s GEO2R analyzer. The phenotype data, including the cell growth rate and the productivity, volumetric rate, and mass-based yield of byproducts, were calculated independently from charts or graphs within the reference papers. Since the phenotype data was never reported in a standardized format, the curation of correlated transcriptomics–phenotype datasets became extremely tedious and time-consuming. Despite the challenges, till now, we successfully correlated 57 and 143 datasets of transcriptomics and phenotype for E. coli and S. cerevisiae, respectively, and applied a regression model within the iTAP database to accurately predict over 93 and 73 % of the growth rates of E. coli and S. cerevisiae, respectively, directly from the transcriptomics data. CONCLUSION: This is the first time that transcriptomics and phenotype data are categorized and correlated in an open-source database. This allows biologists to access the database and utilize it to predict the phenotype of microorganisms from their transcriptomics data. The iTAP database is freely available at https://sites.google.com/a/vt.edu/biomolecular-engineering-lab/software. BioMed Central 2015-12-12 /pmc/articles/PMC4676837/ /pubmed/26653323 http://dx.doi.org/10.1186/s13104-015-1759-7 Text en © Sundararaman et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Technical Note Sundararaman, Niveda Ash, Christine Guo, Weihua Button, Rebecca Singh, Jugroop Feng, Xueyang iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae |
title | iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae |
title_full | iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae |
title_fullStr | iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae |
title_full_unstemmed | iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae |
title_short | iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae |
title_sort | itap: integrated transcriptomics and phenotype database for stress response of escherichia coli and saccharomyces cerevisiae |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676837/ https://www.ncbi.nlm.nih.gov/pubmed/26653323 http://dx.doi.org/10.1186/s13104-015-1759-7 |
work_keys_str_mv | AT sundararamanniveda itapintegratedtranscriptomicsandphenotypedatabaseforstressresponseofescherichiacoliandsaccharomycescerevisiae AT ashchristine itapintegratedtranscriptomicsandphenotypedatabaseforstressresponseofescherichiacoliandsaccharomycescerevisiae AT guoweihua itapintegratedtranscriptomicsandphenotypedatabaseforstressresponseofescherichiacoliandsaccharomycescerevisiae AT buttonrebecca itapintegratedtranscriptomicsandphenotypedatabaseforstressresponseofescherichiacoliandsaccharomycescerevisiae AT singhjugroop itapintegratedtranscriptomicsandphenotypedatabaseforstressresponseofescherichiacoliandsaccharomycescerevisiae AT fengxueyang itapintegratedtranscriptomicsandphenotypedatabaseforstressresponseofescherichiacoliandsaccharomycescerevisiae |