Cargando…

More robust detection of motifs in coexpressed genes by using phylogenetic information

BACKGROUND: Several motif detection algorithms have been developed to discover overrepresented motifs in sets of coexpressed genes. However, in a noisy gene list, the number of genes containing the motif versus the number lacking the motif might not be sufficiently high to allow detection by classic...

Descripción completa

Detalles Bibliográficos
Autores principales: Monsieurs, Pieter, Thijs, Gert, Fadda, Abeer A, De Keersmaecker, Sigrid CJ, Vanderleyden, Jozef, De Moor, Bart, Marchal, Kathleen
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1525208/
https://www.ncbi.nlm.nih.gov/pubmed/16549017
http://dx.doi.org/10.1186/1471-2105-7-160
_version_ 1782128888124538880
author Monsieurs, Pieter
Thijs, Gert
Fadda, Abeer A
De Keersmaecker, Sigrid CJ
Vanderleyden, Jozef
De Moor, Bart
Marchal, Kathleen
author_facet Monsieurs, Pieter
Thijs, Gert
Fadda, Abeer A
De Keersmaecker, Sigrid CJ
Vanderleyden, Jozef
De Moor, Bart
Marchal, Kathleen
author_sort Monsieurs, Pieter
collection PubMed
description BACKGROUND: Several motif detection algorithms have been developed to discover overrepresented motifs in sets of coexpressed genes. However, in a noisy gene list, the number of genes containing the motif versus the number lacking the motif might not be sufficiently high to allow detection by classical motif detection tools. To still recover motifs which are not significantly enriched but still present, we developed a procedure in which we use phylogenetic footprinting to first delineate all potential motifs in each gene. Then we mutually compare all detected motifs and identify the ones that are shared by at least a few genes in the data set as potential candidates. RESULTS: We applied our methodology to a compiled test data set containing known regulatory motifs and to two biological data sets derived from genome wide expression studies. By executing four consecutive steps of 1) identifying conserved regions in orthologous intergenic regions, 2) aligning these conserved regions, 3) clustering the conserved regions containing similar regulatory regions followed by extraction of the regulatory motifs and 4) screening the input intergenic sequences with detected regulatory motif models, our methodology proves to be a powerful tool for detecting regulatory motifs when a low signal to noise ratio is present in the input data set. Comparing our results with two other motif detection algorithms points out the robustness of our algorithm. CONCLUSION: We developed an approach that can reliably identify multiple regulatory motifs lacking a high degree of overrepresentation in a set of coexpressed genes (motifs belonging to sparsely connected hubs in the regulatory network) by exploiting the advantages of using both coexpression and phylogenetic information.
format Text
id pubmed-1525208
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15252082006-08-07 More robust detection of motifs in coexpressed genes by using phylogenetic information Monsieurs, Pieter Thijs, Gert Fadda, Abeer A De Keersmaecker, Sigrid CJ Vanderleyden, Jozef De Moor, Bart Marchal, Kathleen BMC Bioinformatics Methodology Article BACKGROUND: Several motif detection algorithms have been developed to discover overrepresented motifs in sets of coexpressed genes. However, in a noisy gene list, the number of genes containing the motif versus the number lacking the motif might not be sufficiently high to allow detection by classical motif detection tools. To still recover motifs which are not significantly enriched but still present, we developed a procedure in which we use phylogenetic footprinting to first delineate all potential motifs in each gene. Then we mutually compare all detected motifs and identify the ones that are shared by at least a few genes in the data set as potential candidates. RESULTS: We applied our methodology to a compiled test data set containing known regulatory motifs and to two biological data sets derived from genome wide expression studies. By executing four consecutive steps of 1) identifying conserved regions in orthologous intergenic regions, 2) aligning these conserved regions, 3) clustering the conserved regions containing similar regulatory regions followed by extraction of the regulatory motifs and 4) screening the input intergenic sequences with detected regulatory motif models, our methodology proves to be a powerful tool for detecting regulatory motifs when a low signal to noise ratio is present in the input data set. Comparing our results with two other motif detection algorithms points out the robustness of our algorithm. CONCLUSION: We developed an approach that can reliably identify multiple regulatory motifs lacking a high degree of overrepresentation in a set of coexpressed genes (motifs belonging to sparsely connected hubs in the regulatory network) by exploiting the advantages of using both coexpression and phylogenetic information. BioMed Central 2006-03-20 /pmc/articles/PMC1525208/ /pubmed/16549017 http://dx.doi.org/10.1186/1471-2105-7-160 Text en Copyright © 2006 Monsieurs et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Monsieurs, Pieter
Thijs, Gert
Fadda, Abeer A
De Keersmaecker, Sigrid CJ
Vanderleyden, Jozef
De Moor, Bart
Marchal, Kathleen
More robust detection of motifs in coexpressed genes by using phylogenetic information
title More robust detection of motifs in coexpressed genes by using phylogenetic information
title_full More robust detection of motifs in coexpressed genes by using phylogenetic information
title_fullStr More robust detection of motifs in coexpressed genes by using phylogenetic information
title_full_unstemmed More robust detection of motifs in coexpressed genes by using phylogenetic information
title_short More robust detection of motifs in coexpressed genes by using phylogenetic information
title_sort more robust detection of motifs in coexpressed genes by using phylogenetic information
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1525208/
https://www.ncbi.nlm.nih.gov/pubmed/16549017
http://dx.doi.org/10.1186/1471-2105-7-160
work_keys_str_mv AT monsieurspieter morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation
AT thijsgert morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation
AT faddaabeera morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation
AT dekeersmaeckersigridcj morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation
AT vanderleydenjozef morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation
AT demoorbart morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation
AT marchalkathleen morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation