Cargando…

Tuning promoter boundaries improves regulatory motif discovery in nonmodel plants: the peach example

The identification of functional elements encoded in plant genomes is necessary to understand gene regulation. Although much attention has been paid to model species like Arabidopsis (Arabidopsis thaliana), little is known about regulatory motifs in other plants. Here, we describe a bottom-up approa...

Descripción completa

Detalles Bibliográficos
Autores principales: Ksouri, Najla, Castro-Mondragón, Jaime A, Montardit-Tarda, Francesc, van Helden, Jacques, Contreras-Moreira, Bruno, Gogorcena, Yolanda
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8133646/
https://www.ncbi.nlm.nih.gov/pubmed/33744946
http://dx.doi.org/10.1093/plphys/kiaa091
_version_ 1783695099050328064
author Ksouri, Najla
Castro-Mondragón, Jaime A
Montardit-Tarda, Francesc
van Helden, Jacques
Contreras-Moreira, Bruno
Gogorcena, Yolanda
author_facet Ksouri, Najla
Castro-Mondragón, Jaime A
Montardit-Tarda, Francesc
van Helden, Jacques
Contreras-Moreira, Bruno
Gogorcena, Yolanda
author_sort Ksouri, Najla
collection PubMed
description The identification of functional elements encoded in plant genomes is necessary to understand gene regulation. Although much attention has been paid to model species like Arabidopsis (Arabidopsis thaliana), little is known about regulatory motifs in other plants. Here, we describe a bottom-up approach for de novo motif discovery using peach (Prunus persica) as an example. These predictions require pre-computed gene clusters grouped by their expression similarity. After optimizing the boundaries of proximal promoter regions, two motif discovery algorithms from RSAT::Plants (http://plants.rsat.eu) were tested (oligo and dyad analysis). Overall, 18 out of 45 co-expressed modules were enriched in motifs typical of well-known transcription factor (TF) families (bHLH, bZip, BZR, CAMTA, DOF, E2FE, AP2-ERF, Myb-like, NAC, TCP, and WRKY) and a few uncharacterized motifs. Our results indicate that small modules and promoter window of [–500 bp, +200 bp] relative to the transcription start site (TSS) maximize the number of motifs found and reduce low-complexity signals in peach. The distribution of discovered regulatory sites was unbalanced, as they accumulated around the TSS. This approach was benchmarked by testing two different expression-based clustering algorithms (network-based and hierarchical) and, as control, genes grouped for harboring ChIPseq peaks of the same Arabidopsis TF. The method was also verified on maize (Zea mays), a species with a large genome. In summary, this article presents a glimpse of the peach regulatory components at genome scale and provides a general protocol that can be applied to other species. A Docker software container is released to facilitate the reproduction of these analyses.
format Online
Article
Text
id pubmed-8133646
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-81336462021-05-25 Tuning promoter boundaries improves regulatory motif discovery in nonmodel plants: the peach example Ksouri, Najla Castro-Mondragón, Jaime A Montardit-Tarda, Francesc van Helden, Jacques Contreras-Moreira, Bruno Gogorcena, Yolanda Plant Physiol Regular Issue The identification of functional elements encoded in plant genomes is necessary to understand gene regulation. Although much attention has been paid to model species like Arabidopsis (Arabidopsis thaliana), little is known about regulatory motifs in other plants. Here, we describe a bottom-up approach for de novo motif discovery using peach (Prunus persica) as an example. These predictions require pre-computed gene clusters grouped by their expression similarity. After optimizing the boundaries of proximal promoter regions, two motif discovery algorithms from RSAT::Plants (http://plants.rsat.eu) were tested (oligo and dyad analysis). Overall, 18 out of 45 co-expressed modules were enriched in motifs typical of well-known transcription factor (TF) families (bHLH, bZip, BZR, CAMTA, DOF, E2FE, AP2-ERF, Myb-like, NAC, TCP, and WRKY) and a few uncharacterized motifs. Our results indicate that small modules and promoter window of [–500 bp, +200 bp] relative to the transcription start site (TSS) maximize the number of motifs found and reduce low-complexity signals in peach. The distribution of discovered regulatory sites was unbalanced, as they accumulated around the TSS. This approach was benchmarked by testing two different expression-based clustering algorithms (network-based and hierarchical) and, as control, genes grouped for harboring ChIPseq peaks of the same Arabidopsis TF. The method was also verified on maize (Zea mays), a species with a large genome. In summary, this article presents a glimpse of the peach regulatory components at genome scale and provides a general protocol that can be applied to other species. A Docker software container is released to facilitate the reproduction of these analyses. Oxford University Press 2021-01-13 /pmc/articles/PMC8133646/ /pubmed/33744946 http://dx.doi.org/10.1093/plphys/kiaa091 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of American Society of Plant Biologists. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Regular Issue
Ksouri, Najla
Castro-Mondragón, Jaime A
Montardit-Tarda, Francesc
van Helden, Jacques
Contreras-Moreira, Bruno
Gogorcena, Yolanda
Tuning promoter boundaries improves regulatory motif discovery in nonmodel plants: the peach example
title Tuning promoter boundaries improves regulatory motif discovery in nonmodel plants: the peach example
title_full Tuning promoter boundaries improves regulatory motif discovery in nonmodel plants: the peach example
title_fullStr Tuning promoter boundaries improves regulatory motif discovery in nonmodel plants: the peach example
title_full_unstemmed Tuning promoter boundaries improves regulatory motif discovery in nonmodel plants: the peach example
title_short Tuning promoter boundaries improves regulatory motif discovery in nonmodel plants: the peach example
title_sort tuning promoter boundaries improves regulatory motif discovery in nonmodel plants: the peach example
topic Regular Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8133646/
https://www.ncbi.nlm.nih.gov/pubmed/33744946
http://dx.doi.org/10.1093/plphys/kiaa091
work_keys_str_mv AT ksourinajla tuningpromoterboundariesimprovesregulatorymotifdiscoveryinnonmodelplantsthepeachexample
AT castromondragonjaimea tuningpromoterboundariesimprovesregulatorymotifdiscoveryinnonmodelplantsthepeachexample
AT montardittardafrancesc tuningpromoterboundariesimprovesregulatorymotifdiscoveryinnonmodelplantsthepeachexample
AT vanheldenjacques tuningpromoterboundariesimprovesregulatorymotifdiscoveryinnonmodelplantsthepeachexample
AT contrerasmoreirabruno tuningpromoterboundariesimprovesregulatorymotifdiscoveryinnonmodelplantsthepeachexample
AT gogorcenayolanda tuningpromoterboundariesimprovesregulatorymotifdiscoveryinnonmodelplantsthepeachexample