Cargando…

Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles

Machine learning approaches offer the potential to systematically identify transcriptional regulatory interactions from a compendium of microarray expression profiles. However, experimental validation of the performance of these methods at the genome scale has remained elusive. Here we assess the gl...

Descripción completa

Detalles Bibliográficos
Autores principales: Faith, Jeremiah J, Hayete, Boris, Thaden, Joshua T, Mogno, Ilaria, Wierzbowski, Jamey, Cottarel, Guillaume, Kasif, Simon, Collins, James J, Gardner, Timothy S
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764438/
https://www.ncbi.nlm.nih.gov/pubmed/17214507
http://dx.doi.org/10.1371/journal.pbio.0050008
_version_ 1782131614731468800
author Faith, Jeremiah J
Hayete, Boris
Thaden, Joshua T
Mogno, Ilaria
Wierzbowski, Jamey
Cottarel, Guillaume
Kasif, Simon
Collins, James J
Gardner, Timothy S
author_facet Faith, Jeremiah J
Hayete, Boris
Thaden, Joshua T
Mogno, Ilaria
Wierzbowski, Jamey
Cottarel, Guillaume
Kasif, Simon
Collins, James J
Gardner, Timothy S
author_sort Faith, Jeremiah J
collection PubMed
description Machine learning approaches offer the potential to systematically identify transcriptional regulatory interactions from a compendium of microarray expression profiles. However, experimental validation of the performance of these methods at the genome scale has remained elusive. Here we assess the global performance of four existing classes of inference algorithms using 445 Escherichia coli Affymetrix arrays and 3,216 known E. coli regulatory interactions from RegulonDB. We also developed and applied the context likelihood of relatedness (CLR) algorithm, a novel extension of the relevance networks class of algorithms. CLR demonstrates an average precision gain of 36% relative to the next-best performing algorithm. At a 60% true positive rate, CLR identifies 1,079 regulatory interactions, of which 338 were in the previously known network and 741 were novel predictions. We tested the predicted interactions for three transcription factors with chromatin immunoprecipitation, confirming 21 novel interactions and verifying our RegulonDB-based performance estimates. CLR also identified a regulatory link providing central metabolic control of iron transport, which we confirmed with real-time quantitative PCR. The compendium of expression data compiled in this study, coupled with RegulonDB, provides a valuable model system for further improvement of network inference algorithms using experimental data.
format Text
id pubmed-1764438
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-17644382007-01-16 Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles Faith, Jeremiah J Hayete, Boris Thaden, Joshua T Mogno, Ilaria Wierzbowski, Jamey Cottarel, Guillaume Kasif, Simon Collins, James J Gardner, Timothy S PLoS Biol Research Article Machine learning approaches offer the potential to systematically identify transcriptional regulatory interactions from a compendium of microarray expression profiles. However, experimental validation of the performance of these methods at the genome scale has remained elusive. Here we assess the global performance of four existing classes of inference algorithms using 445 Escherichia coli Affymetrix arrays and 3,216 known E. coli regulatory interactions from RegulonDB. We also developed and applied the context likelihood of relatedness (CLR) algorithm, a novel extension of the relevance networks class of algorithms. CLR demonstrates an average precision gain of 36% relative to the next-best performing algorithm. At a 60% true positive rate, CLR identifies 1,079 regulatory interactions, of which 338 were in the previously known network and 741 were novel predictions. We tested the predicted interactions for three transcription factors with chromatin immunoprecipitation, confirming 21 novel interactions and verifying our RegulonDB-based performance estimates. CLR also identified a regulatory link providing central metabolic control of iron transport, which we confirmed with real-time quantitative PCR. The compendium of expression data compiled in this study, coupled with RegulonDB, provides a valuable model system for further improvement of network inference algorithms using experimental data. Public Library of Science 2007-01 2007-01-09 /pmc/articles/PMC1764438/ /pubmed/17214507 http://dx.doi.org/10.1371/journal.pbio.0050008 Text en © 2007 Faith et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Faith, Jeremiah J
Hayete, Boris
Thaden, Joshua T
Mogno, Ilaria
Wierzbowski, Jamey
Cottarel, Guillaume
Kasif, Simon
Collins, James J
Gardner, Timothy S
Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles
title Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles
title_full Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles
title_fullStr Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles
title_full_unstemmed Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles
title_short Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles
title_sort large-scale mapping and validation of escherichia coli transcriptional regulation from a compendium of expression profiles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764438/
https://www.ncbi.nlm.nih.gov/pubmed/17214507
http://dx.doi.org/10.1371/journal.pbio.0050008
work_keys_str_mv AT faithjeremiahj largescalemappingandvalidationofescherichiacolitranscriptionalregulationfromacompendiumofexpressionprofiles
AT hayeteboris largescalemappingandvalidationofescherichiacolitranscriptionalregulationfromacompendiumofexpressionprofiles
AT thadenjoshuat largescalemappingandvalidationofescherichiacolitranscriptionalregulationfromacompendiumofexpressionprofiles
AT mognoilaria largescalemappingandvalidationofescherichiacolitranscriptionalregulationfromacompendiumofexpressionprofiles
AT wierzbowskijamey largescalemappingandvalidationofescherichiacolitranscriptionalregulationfromacompendiumofexpressionprofiles
AT cottarelguillaume largescalemappingandvalidationofescherichiacolitranscriptionalregulationfromacompendiumofexpressionprofiles
AT kasifsimon largescalemappingandvalidationofescherichiacolitranscriptionalregulationfromacompendiumofexpressionprofiles
AT collinsjamesj largescalemappingandvalidationofescherichiacolitranscriptionalregulationfromacompendiumofexpressionprofiles
AT gardnertimothys largescalemappingandvalidationofescherichiacolitranscriptionalregulationfromacompendiumofexpressionprofiles