Cargando…

Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABA(A) receptor subunit genes

Understanding transcription factor (TF) mediated control of gene expression remains a major challenge at the interface of computational and experimental biology. Computational techniques predicting TF-binding site specificity are frequently unreliable. On the other hand, comprehensive experimental v...

Descripción completa

Detalles Bibliográficos
Autores principales: Reddy, Timothy E., Shakhnovich, Boris E., Roberts, Daniel S., Russek, Shelley J., DeLisi, Charles
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1807961/
https://www.ncbi.nlm.nih.gov/pubmed/17204484
http://dx.doi.org/10.1093/nar/gkl1062
_version_ 1782132504867635200
author Reddy, Timothy E.
Shakhnovich, Boris E.
Roberts, Daniel S.
Russek, Shelley J.
DeLisi, Charles
author_facet Reddy, Timothy E.
Shakhnovich, Boris E.
Roberts, Daniel S.
Russek, Shelley J.
DeLisi, Charles
author_sort Reddy, Timothy E.
collection PubMed
description Understanding transcription factor (TF) mediated control of gene expression remains a major challenge at the interface of computational and experimental biology. Computational techniques predicting TF-binding site specificity are frequently unreliable. On the other hand, comprehensive experimental validation is difficult and time consuming. We introduce a simple strategy that dramatically improves robustness and accuracy of computational binding site prediction. First, we evaluate the rate of recurrence of computational TFBS predictions by commonly used sampling procedures. We find that the vast majority of results are biologically meaningless. However clustering results based on nucleotide position improves predictive power. Additionally, we find that positional clustering increases robustness to long or imperfectly selected input sequences. Positional clustering can also be used as a mechanism to integrate results from multiple sampling approaches for improvements in accuracy over each one alone. Finally, we predict and validate regulatory sequences partially responsible for transcriptional control of the mammalian type A γ-aminobutyric acid receptor (GABA(A)R) subunit genes. Positional clustering is useful for improving computational binding site predictions, with potential application to improving our understanding of mammalian gene expression. In particular, predicted regulatory mechanisms in the mammalian GABA(A)R subunit gene family may open new avenues of research towards understanding this pharmacologically important neurotransmitter receptor system.
format Text
id pubmed-1807961
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-18079612007-03-02 Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABA(A) receptor subunit genes Reddy, Timothy E. Shakhnovich, Boris E. Roberts, Daniel S. Russek, Shelley J. DeLisi, Charles Nucleic Acids Res Methods Online Understanding transcription factor (TF) mediated control of gene expression remains a major challenge at the interface of computational and experimental biology. Computational techniques predicting TF-binding site specificity are frequently unreliable. On the other hand, comprehensive experimental validation is difficult and time consuming. We introduce a simple strategy that dramatically improves robustness and accuracy of computational binding site prediction. First, we evaluate the rate of recurrence of computational TFBS predictions by commonly used sampling procedures. We find that the vast majority of results are biologically meaningless. However clustering results based on nucleotide position improves predictive power. Additionally, we find that positional clustering increases robustness to long or imperfectly selected input sequences. Positional clustering can also be used as a mechanism to integrate results from multiple sampling approaches for improvements in accuracy over each one alone. Finally, we predict and validate regulatory sequences partially responsible for transcriptional control of the mammalian type A γ-aminobutyric acid receptor (GABA(A)R) subunit genes. Positional clustering is useful for improving computational binding site predictions, with potential application to improving our understanding of mammalian gene expression. In particular, predicted regulatory mechanisms in the mammalian GABA(A)R subunit gene family may open new avenues of research towards understanding this pharmacologically important neurotransmitter receptor system. Oxford University Press 2007-02 2007-01-03 /pmc/articles/PMC1807961/ /pubmed/17204484 http://dx.doi.org/10.1093/nar/gkl1062 Text en © 2006 The Author(s). This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Reddy, Timothy E.
Shakhnovich, Boris E.
Roberts, Daniel S.
Russek, Shelley J.
DeLisi, Charles
Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABA(A) receptor subunit genes
title Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABA(A) receptor subunit genes
title_full Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABA(A) receptor subunit genes
title_fullStr Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABA(A) receptor subunit genes
title_full_unstemmed Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABA(A) receptor subunit genes
title_short Positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian GABA(A) receptor subunit genes
title_sort positional clustering improves computational binding site detection and identifies novel cis-regulatory sites in mammalian gaba(a) receptor subunit genes
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1807961/
https://www.ncbi.nlm.nih.gov/pubmed/17204484
http://dx.doi.org/10.1093/nar/gkl1062
work_keys_str_mv AT reddytimothye positionalclusteringimprovescomputationalbindingsitedetectionandidentifiesnovelcisregulatorysitesinmammaliangabaareceptorsubunitgenes
AT shakhnovichborise positionalclusteringimprovescomputationalbindingsitedetectionandidentifiesnovelcisregulatorysitesinmammaliangabaareceptorsubunitgenes
AT robertsdaniels positionalclusteringimprovescomputationalbindingsitedetectionandidentifiesnovelcisregulatorysitesinmammaliangabaareceptorsubunitgenes
AT russekshelleyj positionalclusteringimprovescomputationalbindingsitedetectionandidentifiesnovelcisregulatorysitesinmammaliangabaareceptorsubunitgenes
AT delisicharles positionalclusteringimprovescomputationalbindingsitedetectionandidentifiesnovelcisregulatorysitesinmammaliangabaareceptorsubunitgenes