Cargando…
More robust detection of motifs in coexpressed genes by using phylogenetic information
BACKGROUND: Several motif detection algorithms have been developed to discover overrepresented motifs in sets of coexpressed genes. However, in a noisy gene list, the number of genes containing the motif versus the number lacking the motif might not be sufficiently high to allow detection by classic...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1525208/ https://www.ncbi.nlm.nih.gov/pubmed/16549017 http://dx.doi.org/10.1186/1471-2105-7-160 |
_version_ | 1782128888124538880 |
---|---|
author | Monsieurs, Pieter Thijs, Gert Fadda, Abeer A De Keersmaecker, Sigrid CJ Vanderleyden, Jozef De Moor, Bart Marchal, Kathleen |
author_facet | Monsieurs, Pieter Thijs, Gert Fadda, Abeer A De Keersmaecker, Sigrid CJ Vanderleyden, Jozef De Moor, Bart Marchal, Kathleen |
author_sort | Monsieurs, Pieter |
collection | PubMed |
description | BACKGROUND: Several motif detection algorithms have been developed to discover overrepresented motifs in sets of coexpressed genes. However, in a noisy gene list, the number of genes containing the motif versus the number lacking the motif might not be sufficiently high to allow detection by classical motif detection tools. To still recover motifs which are not significantly enriched but still present, we developed a procedure in which we use phylogenetic footprinting to first delineate all potential motifs in each gene. Then we mutually compare all detected motifs and identify the ones that are shared by at least a few genes in the data set as potential candidates. RESULTS: We applied our methodology to a compiled test data set containing known regulatory motifs and to two biological data sets derived from genome wide expression studies. By executing four consecutive steps of 1) identifying conserved regions in orthologous intergenic regions, 2) aligning these conserved regions, 3) clustering the conserved regions containing similar regulatory regions followed by extraction of the regulatory motifs and 4) screening the input intergenic sequences with detected regulatory motif models, our methodology proves to be a powerful tool for detecting regulatory motifs when a low signal to noise ratio is present in the input data set. Comparing our results with two other motif detection algorithms points out the robustness of our algorithm. CONCLUSION: We developed an approach that can reliably identify multiple regulatory motifs lacking a high degree of overrepresentation in a set of coexpressed genes (motifs belonging to sparsely connected hubs in the regulatory network) by exploiting the advantages of using both coexpression and phylogenetic information. |
format | Text |
id | pubmed-1525208 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15252082006-08-07 More robust detection of motifs in coexpressed genes by using phylogenetic information Monsieurs, Pieter Thijs, Gert Fadda, Abeer A De Keersmaecker, Sigrid CJ Vanderleyden, Jozef De Moor, Bart Marchal, Kathleen BMC Bioinformatics Methodology Article BACKGROUND: Several motif detection algorithms have been developed to discover overrepresented motifs in sets of coexpressed genes. However, in a noisy gene list, the number of genes containing the motif versus the number lacking the motif might not be sufficiently high to allow detection by classical motif detection tools. To still recover motifs which are not significantly enriched but still present, we developed a procedure in which we use phylogenetic footprinting to first delineate all potential motifs in each gene. Then we mutually compare all detected motifs and identify the ones that are shared by at least a few genes in the data set as potential candidates. RESULTS: We applied our methodology to a compiled test data set containing known regulatory motifs and to two biological data sets derived from genome wide expression studies. By executing four consecutive steps of 1) identifying conserved regions in orthologous intergenic regions, 2) aligning these conserved regions, 3) clustering the conserved regions containing similar regulatory regions followed by extraction of the regulatory motifs and 4) screening the input intergenic sequences with detected regulatory motif models, our methodology proves to be a powerful tool for detecting regulatory motifs when a low signal to noise ratio is present in the input data set. Comparing our results with two other motif detection algorithms points out the robustness of our algorithm. CONCLUSION: We developed an approach that can reliably identify multiple regulatory motifs lacking a high degree of overrepresentation in a set of coexpressed genes (motifs belonging to sparsely connected hubs in the regulatory network) by exploiting the advantages of using both coexpression and phylogenetic information. BioMed Central 2006-03-20 /pmc/articles/PMC1525208/ /pubmed/16549017 http://dx.doi.org/10.1186/1471-2105-7-160 Text en Copyright © 2006 Monsieurs et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Monsieurs, Pieter Thijs, Gert Fadda, Abeer A De Keersmaecker, Sigrid CJ Vanderleyden, Jozef De Moor, Bart Marchal, Kathleen More robust detection of motifs in coexpressed genes by using phylogenetic information |
title | More robust detection of motifs in coexpressed genes by using phylogenetic information |
title_full | More robust detection of motifs in coexpressed genes by using phylogenetic information |
title_fullStr | More robust detection of motifs in coexpressed genes by using phylogenetic information |
title_full_unstemmed | More robust detection of motifs in coexpressed genes by using phylogenetic information |
title_short | More robust detection of motifs in coexpressed genes by using phylogenetic information |
title_sort | more robust detection of motifs in coexpressed genes by using phylogenetic information |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1525208/ https://www.ncbi.nlm.nih.gov/pubmed/16549017 http://dx.doi.org/10.1186/1471-2105-7-160 |
work_keys_str_mv | AT monsieurspieter morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation AT thijsgert morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation AT faddaabeera morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation AT dekeersmaeckersigridcj morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation AT vanderleydenjozef morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation AT demoorbart morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation AT marchalkathleen morerobustdetectionofmotifsincoexpressedgenesbyusingphylogeneticinformation |