Cargando…

A cis-regulatory logic simulator

BACKGROUND: A major goal of computational studies of gene regulation is to accurately predict the expression of genes based on the cis-regulatory content of their promoters. The development of computational methods to decode the interactions among cis-regulatory elements has been slow, in part, beca...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeigler, Robert D, Gertz, Jason, Cohen, Barak A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2375358/
https://www.ncbi.nlm.nih.gov/pubmed/17662143
http://dx.doi.org/10.1186/1471-2105-8-272
_version_ 1782154637653049344
author Zeigler, Robert D
Gertz, Jason
Cohen, Barak A
author_facet Zeigler, Robert D
Gertz, Jason
Cohen, Barak A
author_sort Zeigler, Robert D
collection PubMed
description BACKGROUND: A major goal of computational studies of gene regulation is to accurately predict the expression of genes based on the cis-regulatory content of their promoters. The development of computational methods to decode the interactions among cis-regulatory elements has been slow, in part, because it is difficult to know, without extensive experimental validation, whether a particular method identifies the correct cis-regulatory interactions that underlie a given set of expression data. There is an urgent need for test expression data in which the interactions among cis-regulatory sites that produce the data are known. The ability to rapidly generate such data sets would facilitate the development and comparison of computational methods that predict gene expression patterns from promoter sequence. RESULTS: We developed a gene expression simulator which generates expression data using user-defined interactions between cis-regulatory sites. The simulator can incorporate additive, cooperative, competitive, and synergistic interactions between regulatory elements. Constraints on the spacing, distance, and orientation of regulatory elements and their interactions may also be defined and Gaussian noise can be added to the expression values. The simulator allows for a data transformation that simulates the sigmoid shape of expression levels from real promoters. We found good agreement between sets of simulated promoters and predicted regulatory modules from real expression data. We present several data sets that may be useful for testing new methodologies for predicting gene expression from promoter sequence. CONCLUSION: We developed a flexible gene expression simulator that rapidly generates large numbers of simulated promoters and their corresponding transcriptional output based on specified interactions between cis-regulatory sites. When appropriate rule sets are used, the data generated by our simulator faithfully reproduces experimentally derived data sets. We anticipate that using simulated gene expression data sets will facilitate the direct comparison of computational strategies to predict gene expression from promoter sequence. The source code is available online and as additional material. The test sets are available as additional material.
format Text
id pubmed-2375358
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23753582008-05-09 A cis-regulatory logic simulator Zeigler, Robert D Gertz, Jason Cohen, Barak A BMC Bioinformatics Methodology Article BACKGROUND: A major goal of computational studies of gene regulation is to accurately predict the expression of genes based on the cis-regulatory content of their promoters. The development of computational methods to decode the interactions among cis-regulatory elements has been slow, in part, because it is difficult to know, without extensive experimental validation, whether a particular method identifies the correct cis-regulatory interactions that underlie a given set of expression data. There is an urgent need for test expression data in which the interactions among cis-regulatory sites that produce the data are known. The ability to rapidly generate such data sets would facilitate the development and comparison of computational methods that predict gene expression patterns from promoter sequence. RESULTS: We developed a gene expression simulator which generates expression data using user-defined interactions between cis-regulatory sites. The simulator can incorporate additive, cooperative, competitive, and synergistic interactions between regulatory elements. Constraints on the spacing, distance, and orientation of regulatory elements and their interactions may also be defined and Gaussian noise can be added to the expression values. The simulator allows for a data transformation that simulates the sigmoid shape of expression levels from real promoters. We found good agreement between sets of simulated promoters and predicted regulatory modules from real expression data. We present several data sets that may be useful for testing new methodologies for predicting gene expression from promoter sequence. CONCLUSION: We developed a flexible gene expression simulator that rapidly generates large numbers of simulated promoters and their corresponding transcriptional output based on specified interactions between cis-regulatory sites. When appropriate rule sets are used, the data generated by our simulator faithfully reproduces experimentally derived data sets. We anticipate that using simulated gene expression data sets will facilitate the direct comparison of computational strategies to predict gene expression from promoter sequence. The source code is available online and as additional material. The test sets are available as additional material. BioMed Central 2007-07-27 /pmc/articles/PMC2375358/ /pubmed/17662143 http://dx.doi.org/10.1186/1471-2105-8-272 Text en Copyright © 2007 Zeigler et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Zeigler, Robert D
Gertz, Jason
Cohen, Barak A
A cis-regulatory logic simulator
title A cis-regulatory logic simulator
title_full A cis-regulatory logic simulator
title_fullStr A cis-regulatory logic simulator
title_full_unstemmed A cis-regulatory logic simulator
title_short A cis-regulatory logic simulator
title_sort cis-regulatory logic simulator
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2375358/
https://www.ncbi.nlm.nih.gov/pubmed/17662143
http://dx.doi.org/10.1186/1471-2105-8-272
work_keys_str_mv AT zeiglerrobertd acisregulatorylogicsimulator
AT gertzjason acisregulatorylogicsimulator
AT cohenbaraka acisregulatorylogicsimulator
AT zeiglerrobertd cisregulatorylogicsimulator
AT gertzjason cisregulatorylogicsimulator
AT cohenbaraka cisregulatorylogicsimulator