Cargando…
POWRS: Position-Sensitive Motif Discovery
Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patte...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3390389/ https://www.ncbi.nlm.nih.gov/pubmed/22792292 http://dx.doi.org/10.1371/journal.pone.0040373 |
_version_ | 1782237441932918784 |
---|---|
author | Davis, Ian W. Benninger, Christopher Benfey, Philip N. Elich, Tedd |
author_facet | Davis, Ian W. Benninger, Christopher Benfey, Philip N. Elich, Tedd |
author_sort | Davis, Ian W. |
collection | PubMed |
description | Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patterns in promoter sequences. Here we present a new algorithm “POWRS” (POsition-sensitive WoRd Set) for identifying regulatory sequence motifs, specifically developed to address two common shortcomings of existing algorithms. First, POWRS uses the position-specific enrichment of regulatory elements near transcription start sites to significantly increase sensitivity, while providing new information about the preferred localization of those elements. Second, POWRS forgoes position weight matrices for a discrete motif representation that appears more resistant to over-generalization. We apply this algorithm to discover sequences related to constitutive, high-level gene expression in the model plant Arabidopsis thaliana, and then experimentally validate the importance of those elements by systematically mutating two endogenous promoters and measuring the effect on gene expression levels. This provides a foundation for future efforts to rationally engineer gene expression in plants, a problem of great importance in developing biotech crop varieties. Availability: BSD-licensed Python code at http://grassrootsbio.com/papers/powrs/. |
format | Online Article Text |
id | pubmed-3390389 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-33903892012-07-12 POWRS: Position-Sensitive Motif Discovery Davis, Ian W. Benninger, Christopher Benfey, Philip N. Elich, Tedd PLoS One Research Article Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patterns in promoter sequences. Here we present a new algorithm “POWRS” (POsition-sensitive WoRd Set) for identifying regulatory sequence motifs, specifically developed to address two common shortcomings of existing algorithms. First, POWRS uses the position-specific enrichment of regulatory elements near transcription start sites to significantly increase sensitivity, while providing new information about the preferred localization of those elements. Second, POWRS forgoes position weight matrices for a discrete motif representation that appears more resistant to over-generalization. We apply this algorithm to discover sequences related to constitutive, high-level gene expression in the model plant Arabidopsis thaliana, and then experimentally validate the importance of those elements by systematically mutating two endogenous promoters and measuring the effect on gene expression levels. This provides a foundation for future efforts to rationally engineer gene expression in plants, a problem of great importance in developing biotech crop varieties. Availability: BSD-licensed Python code at http://grassrootsbio.com/papers/powrs/. Public Library of Science 2012-07-05 /pmc/articles/PMC3390389/ /pubmed/22792292 http://dx.doi.org/10.1371/journal.pone.0040373 Text en Davis et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Davis, Ian W. Benninger, Christopher Benfey, Philip N. Elich, Tedd POWRS: Position-Sensitive Motif Discovery |
title | POWRS: Position-Sensitive Motif Discovery |
title_full | POWRS: Position-Sensitive Motif Discovery |
title_fullStr | POWRS: Position-Sensitive Motif Discovery |
title_full_unstemmed | POWRS: Position-Sensitive Motif Discovery |
title_short | POWRS: Position-Sensitive Motif Discovery |
title_sort | powrs: position-sensitive motif discovery |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3390389/ https://www.ncbi.nlm.nih.gov/pubmed/22792292 http://dx.doi.org/10.1371/journal.pone.0040373 |
work_keys_str_mv | AT davisianw powrspositionsensitivemotifdiscovery AT benningerchristopher powrspositionsensitivemotifdiscovery AT benfeyphilipn powrspositionsensitivemotifdiscovery AT elichtedd powrspositionsensitivemotifdiscovery |