Cargando…

Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences

Motivation: The identification of transcription factor (TF) binding sites and the regulatory circuitry that they define is currently an area of intense research. Data from whole-genome chromatin immunoprecipitation (ChIP–chip), whole-genome expression microarrays, and sequencing of multiple closely...

Descripción completa

Detalles Bibliográficos
Autores principales: Ward, Lucas D., Bussemaker, Harmen J.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718632/
https://www.ncbi.nlm.nih.gov/pubmed/18586710
http://dx.doi.org/10.1093/bioinformatics/btn154
_version_ 1782170003837026304
author Ward, Lucas D.
Bussemaker, Harmen J.
author_facet Ward, Lucas D.
Bussemaker, Harmen J.
author_sort Ward, Lucas D.
collection PubMed
description Motivation: The identification of transcription factor (TF) binding sites and the regulatory circuitry that they define is currently an area of intense research. Data from whole-genome chromatin immunoprecipitation (ChIP–chip), whole-genome expression microarrays, and sequencing of multiple closely related genomes have all proven useful. By and large, existing methods treat the interpretation of functional data as a classification problem (between bound and unbound DNA), and the analysis of comparative data as a problem of local alignment (to recover phylogenetic footprints of presumably functional elements). Both of these approaches suffer from the inability to model and detect low-affinity binding sites, which have recently been shown to be abundant and functional. Results: We have developed a method that discovers functional regulatory targets of TFs by predicting the total affinity of each promoter for those factors and then comparing that affinity across orthologous promoters in closely related species. At each promoter, we consider the minimum affinity among orthologs to be the fraction of the affinity that is functional. Because we calculate the affinity of the entire promoter, our method is independent of local alignment. By comparing with functional annotation information and gene expression data in Saccharomyces cerevisiae, we have validated that this biophysically motivated use of evolutionary conservation gives rise to dramatic improvement in prediction of regulatory connectivity and factor–factor interactions compared to the use of a single genome. We propose novel biological functions for several yeast TFs, including the factors Snt2 and Stb4, for which no function has been reported. Our affinity-based approach towards comparative genomics may allow a more quantitative analysis of the principles governing the evolution of non-coding DNA. Availability: The MatrixREDUCE software package is available from http://www.bussemakerlab.org/software/MatrixREDUCE Contact: Harmen.Bussemaker@columbia.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2718632
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27186322009-07-31 Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences Ward, Lucas D. Bussemaker, Harmen J. Bioinformatics Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto Motivation: The identification of transcription factor (TF) binding sites and the regulatory circuitry that they define is currently an area of intense research. Data from whole-genome chromatin immunoprecipitation (ChIP–chip), whole-genome expression microarrays, and sequencing of multiple closely related genomes have all proven useful. By and large, existing methods treat the interpretation of functional data as a classification problem (between bound and unbound DNA), and the analysis of comparative data as a problem of local alignment (to recover phylogenetic footprints of presumably functional elements). Both of these approaches suffer from the inability to model and detect low-affinity binding sites, which have recently been shown to be abundant and functional. Results: We have developed a method that discovers functional regulatory targets of TFs by predicting the total affinity of each promoter for those factors and then comparing that affinity across orthologous promoters in closely related species. At each promoter, we consider the minimum affinity among orthologs to be the fraction of the affinity that is functional. Because we calculate the affinity of the entire promoter, our method is independent of local alignment. By comparing with functional annotation information and gene expression data in Saccharomyces cerevisiae, we have validated that this biophysically motivated use of evolutionary conservation gives rise to dramatic improvement in prediction of regulatory connectivity and factor–factor interactions compared to the use of a single genome. We propose novel biological functions for several yeast TFs, including the factors Snt2 and Stb4, for which no function has been reported. Our affinity-based approach towards comparative genomics may allow a more quantitative analysis of the principles governing the evolution of non-coding DNA. Availability: The MatrixREDUCE software package is available from http://www.bussemakerlab.org/software/MatrixREDUCE Contact: Harmen.Bussemaker@columbia.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2008-07-01 /pmc/articles/PMC2718632/ /pubmed/18586710 http://dx.doi.org/10.1093/bioinformatics/btn154 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
Ward, Lucas D.
Bussemaker, Harmen J.
Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences
title Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences
title_full Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences
title_fullStr Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences
title_full_unstemmed Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences
title_short Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences
title_sort predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences
topic Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2718632/
https://www.ncbi.nlm.nih.gov/pubmed/18586710
http://dx.doi.org/10.1093/bioinformatics/btn154
work_keys_str_mv AT wardlucasd predictingfunctionaltranscriptionfactorbindingthroughalignmentfreeandaffinitybasedanalysisoforthologouspromotersequences
AT bussemakerharmenj predictingfunctionaltranscriptionfactorbindingthroughalignmentfreeandaffinitybasedanalysisoforthologouspromotersequences