Cargando…
Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae
BACKGROUND: Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2770497/ https://www.ncbi.nlm.nih.gov/pubmed/19843335 http://dx.doi.org/10.1186/1471-2229-9-126 |
_version_ | 1782173669764628480 |
---|---|
author | Fauteux, François Strömvik, Martina V |
author_facet | Fauteux, François Strömvik, Martina V |
author_sort | Fauteux, François |
collection | PubMed |
description | BACKGROUND: Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. RESULTS: We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. CONCLUSION: Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs. The majority of discovered motifs match experimentally characterized cis-regulatory elements. These results provide a good starting point for further experimental analysis of plant seed-specific promoters and our methodology can be used to unravel more transcriptional regulatory mechanisms in plants and other eukaryotes. |
format | Text |
id | pubmed-2770497 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27704972009-10-30 Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae Fauteux, François Strömvik, Martina V BMC Plant Biol Research Article BACKGROUND: Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. RESULTS: We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. CONCLUSION: Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs. The majority of discovered motifs match experimentally characterized cis-regulatory elements. These results provide a good starting point for further experimental analysis of plant seed-specific promoters and our methodology can be used to unravel more transcriptional regulatory mechanisms in plants and other eukaryotes. BioMed Central 2009-10-20 /pmc/articles/PMC2770497/ /pubmed/19843335 http://dx.doi.org/10.1186/1471-2229-9-126 Text en Copyright © 2009 Fauteux and Strömvik; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Fauteux, François Strömvik, Martina V Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae |
title | Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae |
title_full | Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae |
title_fullStr | Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae |
title_full_unstemmed | Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae |
title_short | Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae |
title_sort | seed storage protein gene promoters contain conserved dna motifs in brassicaceae, fabaceae and poaceae |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2770497/ https://www.ncbi.nlm.nih.gov/pubmed/19843335 http://dx.doi.org/10.1186/1471-2229-9-126 |
work_keys_str_mv | AT fauteuxfrancois seedstorageproteingenepromoterscontainconserveddnamotifsinbrassicaceaefabaceaeandpoaceae AT stromvikmartinav seedstorageproteingenepromoterscontainconserveddnamotifsinbrassicaceaefabaceaeandpoaceae |