Cargando…

Relating gene expression data on two-component systems to functional annotations in Escherichia coli

BACKGROUND: Obtaining physiological insights from microarray experiments requires computational techniques that relate gene expression data to functional information. Traditionally, this has been done in two consecutive steps. The first step identifies important genes through clustering or statistic...

Descripción completa

Detalles Bibliográficos
Autores principales: Denton, Anne M, Wu, Jianfei, Townsend, Megan K, Sule, Preeti, Prüß, Birgit M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2478693/
https://www.ncbi.nlm.nih.gov/pubmed/18578884
http://dx.doi.org/10.1186/1471-2105-9-294
_version_ 1782157619933216768
author Denton, Anne M
Wu, Jianfei
Townsend, Megan K
Sule, Preeti
Prüß, Birgit M
author_facet Denton, Anne M
Wu, Jianfei
Townsend, Megan K
Sule, Preeti
Prüß, Birgit M
author_sort Denton, Anne M
collection PubMed
description BACKGROUND: Obtaining physiological insights from microarray experiments requires computational techniques that relate gene expression data to functional information. Traditionally, this has been done in two consecutive steps. The first step identifies important genes through clustering or statistical techniques, while the second step assigns biological functions to the identified groups. Recently, techniques have been developed that identify such relationships in a single step. RESULTS: We have developed an algorithm that relates patterns of gene expression in a set of microarray experiments to functional groups in one step. Our only assumption is that patterns co-occur frequently. The effectiveness of the algorithm is demonstrated as part of a study of regulation by two-component systems in Escherichia coli. The significance of the relationships between expression data and functional annotations is evaluated based on density histograms that are constructed using product similarity among expression vectors. We present a biological analysis of three of the resulting functional groups of proteins, develop hypotheses for further biological studies, and test one of these hypotheses experimentally. A comparison with other algorithms and a different data set is presented. CONCLUSION: Our new algorithm is able to find interesting and biologically meaningful relationships, not found by other algorithms, in previously analyzed data sets. Scaling of the algorithm to large data sets can be achieved based on a theoretical model.
format Text
id pubmed-2478693
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24786932008-07-22 Relating gene expression data on two-component systems to functional annotations in Escherichia coli Denton, Anne M Wu, Jianfei Townsend, Megan K Sule, Preeti Prüß, Birgit M BMC Bioinformatics Research Article BACKGROUND: Obtaining physiological insights from microarray experiments requires computational techniques that relate gene expression data to functional information. Traditionally, this has been done in two consecutive steps. The first step identifies important genes through clustering or statistical techniques, while the second step assigns biological functions to the identified groups. Recently, techniques have been developed that identify such relationships in a single step. RESULTS: We have developed an algorithm that relates patterns of gene expression in a set of microarray experiments to functional groups in one step. Our only assumption is that patterns co-occur frequently. The effectiveness of the algorithm is demonstrated as part of a study of regulation by two-component systems in Escherichia coli. The significance of the relationships between expression data and functional annotations is evaluated based on density histograms that are constructed using product similarity among expression vectors. We present a biological analysis of three of the resulting functional groups of proteins, develop hypotheses for further biological studies, and test one of these hypotheses experimentally. A comparison with other algorithms and a different data set is presented. CONCLUSION: Our new algorithm is able to find interesting and biologically meaningful relationships, not found by other algorithms, in previously analyzed data sets. Scaling of the algorithm to large data sets can be achieved based on a theoretical model. BioMed Central 2008-06-25 /pmc/articles/PMC2478693/ /pubmed/18578884 http://dx.doi.org/10.1186/1471-2105-9-294 Text en Copyright © 2008 Denton et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Denton, Anne M
Wu, Jianfei
Townsend, Megan K
Sule, Preeti
Prüß, Birgit M
Relating gene expression data on two-component systems to functional annotations in Escherichia coli
title Relating gene expression data on two-component systems to functional annotations in Escherichia coli
title_full Relating gene expression data on two-component systems to functional annotations in Escherichia coli
title_fullStr Relating gene expression data on two-component systems to functional annotations in Escherichia coli
title_full_unstemmed Relating gene expression data on two-component systems to functional annotations in Escherichia coli
title_short Relating gene expression data on two-component systems to functional annotations in Escherichia coli
title_sort relating gene expression data on two-component systems to functional annotations in escherichia coli
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2478693/
https://www.ncbi.nlm.nih.gov/pubmed/18578884
http://dx.doi.org/10.1186/1471-2105-9-294
work_keys_str_mv AT dentonannem relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli
AT wujianfei relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli
AT townsendmegank relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli
AT sulepreeti relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli
AT prußbirgitm relatinggeneexpressiondataontwocomponentsystemstofunctionalannotationsinescherichiacoli