Cargando…
Relating gene expression data on two-component systems to functional annotations in Escherichia coli
BACKGROUND: Obtaining physiological insights from microarray experiments requires computational techniques that relate gene expression data to functional information. Traditionally, this has been done in two consecutive steps. The first step identifies important genes through clustering or statistic...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2478693/ https://www.ncbi.nlm.nih.gov/pubmed/18578884 http://dx.doi.org/10.1186/1471-2105-9-294 |
_version_ | 1782157619933216768 |
---|---|
author | Denton, Anne M Wu, Jianfei Townsend, Megan K Sule, Preeti Prüß, Birgit M |
author_facet | Denton, Anne M Wu, Jianfei Townsend, Megan K Sule, Preeti Prüß, Birgit M |
author_sort | Denton, Anne M |
collection | PubMed |
description | BACKGROUND: Obtaining physiological insights from microarray experiments requires computational techniques that relate gene expression data to functional information. Traditionally, this has been done in two consecutive steps. The first step identifies important genes through clustering or statistical techniques, while the second step assigns biological functions to the identified groups. Recently, techniques have been developed that identify such relationships in a single step. RESULTS: We have developed an algorithm that relates patterns of gene expression in a set of microarray experiments to functional groups in one step. Our only assumption is that patterns co-occur frequently. The effectiveness of the algorithm is demonstrated as part of a study of regulation by two-component systems in Escherichia coli. The significance of the relationships between expression data and functional annotations is evaluated based on density histograms that are constructed using product similarity among expression vectors. We present a biological analysis of three of the resulting functional groups of proteins, develop hypotheses for further biological studies, and test one of these hypotheses experimentally. A comparison with other algorithms and a different data set is presented. CONCLUSION: Our new algorithm is able to find interesting and biologically meaningful relationships, not found by other algorithms, in previously analyzed data sets. Scaling of the algorithm to large data sets can be achieved based on a theoretical model. |
format | Text |
id | pubmed-2478693 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-24786932008-07-22 Relating gene expression data on two-component systems to functional annotations in Escherichia coli Denton, Anne M Wu, Jianfei Townsend, Megan K Sule, Preeti Prüß, Birgit M BMC Bioinformatics Research Article BACKGROUND: Obtaining physiological insights from microarray experiments requires computational techniques that relate gene expression data to functional information. Traditionally, this has been done in two consecutive steps. The first step identifies important genes through clustering or statistical techniques, while the second step assigns biological functions to the identified groups. Recently, techniques have been developed that identify such relationships in a single step. RESULTS: We have developed an algorithm that relates patterns of gene expression in a set of microarray experiments to functional groups in one step. Our only assumption is that patterns co-occur frequently. The effectiveness of the algorithm is demonstrated as part of a study of regulation by two-component systems in Escherichia coli. The significance of the relationships between expression data and functional annotations is evaluated based on density histograms that are constructed using product similarity among expression vectors. We present a biological analysis of three of the resulting functional groups of proteins, develop hypotheses for further biological studies, and test one of these hypotheses experimentally. A comparison with other algorithms and a different data set is presented. CONCLUSION: Our new algorithm is able to find interesting and biologically meaningful relationships, not found by other algorithms, in previously analyzed data sets. Scaling of the algorithm to large data sets can be achieved based on a theoretical model. BioMed Central 2008-06-25 /pmc/articles/PMC2478693/ /pubmed/18578884 http://dx.doi.org/10.1186/1471-2105-9-294 Text en Copyright © 2008 Denton et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Denton, Anne M Wu, Jianfei Townsend, Megan K Sule, Preeti Prüß, Birgit M Relating gene expression data on two-component systems to functional annotations in Escherichia coli |
title | Relating gene expression data on two-component systems to functional annotations in Escherichia coli |
title_full | Relating gene expression data on two-component systems to functional annotations in Escherichia coli |
title_fullStr | Relating gene expression data on two-component systems to functional annotations in Escherichia coli |
title_full_unstemmed | Relating gene expression data on two-component systems to functional annotations in Escherichia coli |
title_short | Relating gene expression data on two-component systems to functional annotations in Escherichia coli |
title_sort | relating gene expression data on two-component systems to functional annotations in escherichia coli |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2478693/ https://www.ncbi.nlm.nih.gov/pubmed/18578884 http://dx.doi.org/10.1186/1471-2105-9-294 |
work_keys_str_mv | AT dentonannem relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli AT wujianfei relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli AT townsendmegank relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli AT sulepreeti relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli AT prußbirgitm relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli |