Cargando…

Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes

The discovery of regulatory motifs embedded in upstream regions of plants is a particularly challenging bioinformatics task. Previous studies have shown that motifs in plants are short compared with those found in vertebrates. Furthermore, plant genomes have undergone several diversification mechani...

Descripción completa

Detalles Bibliográficos
Autores principales: Reineke, Anna R., Bornberg-Bauer, Erich, Gu, Jenny
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3152334/
https://www.ncbi.nlm.nih.gov/pubmed/21470961
http://dx.doi.org/10.1093/nar/gkr179
_version_ 1782209756204630016
author Reineke, Anna R.
Bornberg-Bauer, Erich
Gu, Jenny
author_facet Reineke, Anna R.
Bornberg-Bauer, Erich
Gu, Jenny
author_sort Reineke, Anna R.
collection PubMed
description The discovery of regulatory motifs embedded in upstream regions of plants is a particularly challenging bioinformatics task. Previous studies have shown that motifs in plants are short compared with those found in vertebrates. Furthermore, plant genomes have undergone several diversification mechanisms such as genome duplication events which impact the evolution of regulatory motifs. In this article, a systematic phylogenomic comparison of upstream regions is conducted to further identify features of the plant regulatory genomes, the component of genomes regulating gene expression, to enable future de novo discoveries. The findings highlight differences in upstream region properties between major plant groups and the effects of divergence times and duplication events. First, clear differences in upstream region evolution can be detected between monocots and dicots, thus suggesting that a separation of these groups should be made when searching for novel regulatory motifs, particularly since universal motifs such as the TATA box are rare. Second, investigating the decay rate of significantly aligned regions suggests that a divergence time of ∼100 mya sets a limit for reliable conserved non-coding sequence (CNS) detection. Insights presented here will set a framework to help identify embedded motifs of functional relevance by understanding the limits of bioinformatics detection for CNSs.
format Online
Article
Text
id pubmed-3152334
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31523342011-08-08 Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes Reineke, Anna R. Bornberg-Bauer, Erich Gu, Jenny Nucleic Acids Res Genomics The discovery of regulatory motifs embedded in upstream regions of plants is a particularly challenging bioinformatics task. Previous studies have shown that motifs in plants are short compared with those found in vertebrates. Furthermore, plant genomes have undergone several diversification mechanisms such as genome duplication events which impact the evolution of regulatory motifs. In this article, a systematic phylogenomic comparison of upstream regions is conducted to further identify features of the plant regulatory genomes, the component of genomes regulating gene expression, to enable future de novo discoveries. The findings highlight differences in upstream region properties between major plant groups and the effects of divergence times and duplication events. First, clear differences in upstream region evolution can be detected between monocots and dicots, thus suggesting that a separation of these groups should be made when searching for novel regulatory motifs, particularly since universal motifs such as the TATA box are rare. Second, investigating the decay rate of significantly aligned regions suggests that a divergence time of ∼100 mya sets a limit for reliable conserved non-coding sequence (CNS) detection. Insights presented here will set a framework to help identify embedded motifs of functional relevance by understanding the limits of bioinformatics detection for CNSs. Oxford University Press 2011-08 2011-04-05 /pmc/articles/PMC3152334/ /pubmed/21470961 http://dx.doi.org/10.1093/nar/gkr179 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomics
Reineke, Anna R.
Bornberg-Bauer, Erich
Gu, Jenny
Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes
title Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes
title_full Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes
title_fullStr Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes
title_full_unstemmed Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes
title_short Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes
title_sort evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3152334/
https://www.ncbi.nlm.nih.gov/pubmed/21470961
http://dx.doi.org/10.1093/nar/gkr179
work_keys_str_mv AT reinekeannar evolutionarydivergenceandlimitsofconservednoncodingsequencedetectioninplantgenomes
AT bornbergbauererich evolutionarydivergenceandlimitsofconservednoncodingsequencedetectioninplantgenomes
AT gujenny evolutionarydivergenceandlimitsofconservednoncodingsequencedetectioninplantgenomes