Cargando…

POWRS: Position-Sensitive Motif Discovery

Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patte...

Descripción completa

Detalles Bibliográficos
Autores principales: Davis, Ian W., Benninger, Christopher, Benfey, Philip N., Elich, Tedd
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3390389/
https://www.ncbi.nlm.nih.gov/pubmed/22792292
http://dx.doi.org/10.1371/journal.pone.0040373
_version_ 1782237441932918784
author Davis, Ian W.
Benninger, Christopher
Benfey, Philip N.
Elich, Tedd
author_facet Davis, Ian W.
Benninger, Christopher
Benfey, Philip N.
Elich, Tedd
author_sort Davis, Ian W.
collection PubMed
description Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patterns in promoter sequences. Here we present a new algorithm “POWRS” (POsition-sensitive WoRd Set) for identifying regulatory sequence motifs, specifically developed to address two common shortcomings of existing algorithms. First, POWRS uses the position-specific enrichment of regulatory elements near transcription start sites to significantly increase sensitivity, while providing new information about the preferred localization of those elements. Second, POWRS forgoes position weight matrices for a discrete motif representation that appears more resistant to over-generalization. We apply this algorithm to discover sequences related to constitutive, high-level gene expression in the model plant Arabidopsis thaliana, and then experimentally validate the importance of those elements by systematically mutating two endogenous promoters and measuring the effect on gene expression levels. This provides a foundation for future efforts to rationally engineer gene expression in plants, a problem of great importance in developing biotech crop varieties. Availability: BSD-licensed Python code at http://grassrootsbio.com/papers/powrs/.
format Online
Article
Text
id pubmed-3390389
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33903892012-07-12 POWRS: Position-Sensitive Motif Discovery Davis, Ian W. Benninger, Christopher Benfey, Philip N. Elich, Tedd PLoS One Research Article Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patterns in promoter sequences. Here we present a new algorithm “POWRS” (POsition-sensitive WoRd Set) for identifying regulatory sequence motifs, specifically developed to address two common shortcomings of existing algorithms. First, POWRS uses the position-specific enrichment of regulatory elements near transcription start sites to significantly increase sensitivity, while providing new information about the preferred localization of those elements. Second, POWRS forgoes position weight matrices for a discrete motif representation that appears more resistant to over-generalization. We apply this algorithm to discover sequences related to constitutive, high-level gene expression in the model plant Arabidopsis thaliana, and then experimentally validate the importance of those elements by systematically mutating two endogenous promoters and measuring the effect on gene expression levels. This provides a foundation for future efforts to rationally engineer gene expression in plants, a problem of great importance in developing biotech crop varieties. Availability: BSD-licensed Python code at http://grassrootsbio.com/papers/powrs/. Public Library of Science 2012-07-05 /pmc/articles/PMC3390389/ /pubmed/22792292 http://dx.doi.org/10.1371/journal.pone.0040373 Text en Davis et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Davis, Ian W.
Benninger, Christopher
Benfey, Philip N.
Elich, Tedd
POWRS: Position-Sensitive Motif Discovery
title POWRS: Position-Sensitive Motif Discovery
title_full POWRS: Position-Sensitive Motif Discovery
title_fullStr POWRS: Position-Sensitive Motif Discovery
title_full_unstemmed POWRS: Position-Sensitive Motif Discovery
title_short POWRS: Position-Sensitive Motif Discovery
title_sort powrs: position-sensitive motif discovery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3390389/
https://www.ncbi.nlm.nih.gov/pubmed/22792292
http://dx.doi.org/10.1371/journal.pone.0040373
work_keys_str_mv AT davisianw powrspositionsensitivemotifdiscovery
AT benningerchristopher powrspositionsensitivemotifdiscovery
AT benfeyphilipn powrspositionsensitivemotifdiscovery
AT elichtedd powrspositionsensitivemotifdiscovery