Cargando…

Predicting the binding preference of transcription factors to individual DNA k-mers

Motivation: Recognition of specific DNA sequences is a central mechanism by which transcription factors (TFs) control gene expression. Many TF-binding preferences, however, are unknown or poorly characterized, in part due to the difficulty associated with determining their specificity experimentally...

Descripción completa

Detalles Bibliográficos
Autores principales: Alleyne, Trevis M., Peña-Castillo, Lourdes, Badis, Gwenael, Talukder, Shaheynoor, Berger, Michael F., Gehrke, Andrew R., Philippakis, Anthony A., Bulyk, Martha L., Morris, Quaid D., Hughes, Timothy R.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2666811/
https://www.ncbi.nlm.nih.gov/pubmed/19088121
http://dx.doi.org/10.1093/bioinformatics/btn645
_version_ 1782166083398008832
author Alleyne, Trevis M.
Peña-Castillo, Lourdes
Badis, Gwenael
Talukder, Shaheynoor
Berger, Michael F.
Gehrke, Andrew R.
Philippakis, Anthony A.
Bulyk, Martha L.
Morris, Quaid D.
Hughes, Timothy R.
author_facet Alleyne, Trevis M.
Peña-Castillo, Lourdes
Badis, Gwenael
Talukder, Shaheynoor
Berger, Michael F.
Gehrke, Andrew R.
Philippakis, Anthony A.
Bulyk, Martha L.
Morris, Quaid D.
Hughes, Timothy R.
author_sort Alleyne, Trevis M.
collection PubMed
description Motivation: Recognition of specific DNA sequences is a central mechanism by which transcription factors (TFs) control gene expression. Many TF-binding preferences, however, are unknown or poorly characterized, in part due to the difficulty associated with determining their specificity experimentally, and an incomplete understanding of the mechanisms governing sequence specificity. New techniques that estimate the affinity of TFs to all possible k-mers provide a new opportunity to study DNA–protein interaction mechanisms, and may facilitate inference of binding preferences for members of a given TF family when such information is available for other family members. Results: We employed a new dataset consisting of the relative preferences of mouse homeodomains for all eight-base DNA sequences in order to ask how well we can predict the binding profiles of homeodomains when only their protein sequences are given. We evaluated a panel of standard statistical inference techniques, as well as variations of the protein features considered. Nearest neighbour among functionally important residues emerged among the most effective methods. Our results underscore the complexity of TF–DNA recognition, and suggest a rational approach for future analyses of TF families. Contact: t.hughes@utorotno.ca Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2666811
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26668112009-04-29 Predicting the binding preference of transcription factors to individual DNA k-mers Alleyne, Trevis M. Peña-Castillo, Lourdes Badis, Gwenael Talukder, Shaheynoor Berger, Michael F. Gehrke, Andrew R. Philippakis, Anthony A. Bulyk, Martha L. Morris, Quaid D. Hughes, Timothy R. Bioinformatics Original Papers Motivation: Recognition of specific DNA sequences is a central mechanism by which transcription factors (TFs) control gene expression. Many TF-binding preferences, however, are unknown or poorly characterized, in part due to the difficulty associated with determining their specificity experimentally, and an incomplete understanding of the mechanisms governing sequence specificity. New techniques that estimate the affinity of TFs to all possible k-mers provide a new opportunity to study DNA–protein interaction mechanisms, and may facilitate inference of binding preferences for members of a given TF family when such information is available for other family members. Results: We employed a new dataset consisting of the relative preferences of mouse homeodomains for all eight-base DNA sequences in order to ask how well we can predict the binding profiles of homeodomains when only their protein sequences are given. We evaluated a panel of standard statistical inference techniques, as well as variations of the protein features considered. Nearest neighbour among functionally important residues emerged among the most effective methods. Our results underscore the complexity of TF–DNA recognition, and suggest a rational approach for future analyses of TF families. Contact: t.hughes@utorotno.ca Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2009-04-15 2008-12-16 /pmc/articles/PMC2666811/ /pubmed/19088121 http://dx.doi.org/10.1093/bioinformatics/btn645 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Alleyne, Trevis M.
Peña-Castillo, Lourdes
Badis, Gwenael
Talukder, Shaheynoor
Berger, Michael F.
Gehrke, Andrew R.
Philippakis, Anthony A.
Bulyk, Martha L.
Morris, Quaid D.
Hughes, Timothy R.
Predicting the binding preference of transcription factors to individual DNA k-mers
title Predicting the binding preference of transcription factors to individual DNA k-mers
title_full Predicting the binding preference of transcription factors to individual DNA k-mers
title_fullStr Predicting the binding preference of transcription factors to individual DNA k-mers
title_full_unstemmed Predicting the binding preference of transcription factors to individual DNA k-mers
title_short Predicting the binding preference of transcription factors to individual DNA k-mers
title_sort predicting the binding preference of transcription factors to individual dna k-mers
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2666811/
https://www.ncbi.nlm.nih.gov/pubmed/19088121
http://dx.doi.org/10.1093/bioinformatics/btn645
work_keys_str_mv AT alleynetrevism predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers
AT penacastillolourdes predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers
AT badisgwenael predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers
AT talukdershaheynoor predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers
AT bergermichaelf predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers
AT gehrkeandrewr predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers
AT philippakisanthonya predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers
AT bulykmarthal predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers
AT morrisquaidd predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers
AT hughestimothyr predictingthebindingpreferenceoftranscriptionfactorstoindividualdnakmers