Cargando…

Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas

BACKGROUND: The creation of a complete genome-wide map of transcription factor binding sites is essential for understanding gene regulatory networks in vivo. However, current prediction methods generally rely on statistical models that imperfectly model transcription factor binding. Generation of ne...

Descripción completa

Detalles Bibliográficos
Autores principales: Hill, Jonathon T, Anderson, Keith R, Mastracci, Teresa L, Kaestner, Klaus H, Sussel, Lori
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3050729/
https://www.ncbi.nlm.nih.gov/pubmed/21352540
http://dx.doi.org/10.1186/1471-2105-12-62
_version_ 1782199378682839040
author Hill, Jonathon T
Anderson, Keith R
Mastracci, Teresa L
Kaestner, Klaus H
Sussel, Lori
author_facet Hill, Jonathon T
Anderson, Keith R
Mastracci, Teresa L
Kaestner, Klaus H
Sussel, Lori
author_sort Hill, Jonathon T
collection PubMed
description BACKGROUND: The creation of a complete genome-wide map of transcription factor binding sites is essential for understanding gene regulatory networks in vivo. However, current prediction methods generally rely on statistical models that imperfectly model transcription factor binding. Generation of new prediction methods that are based on protein binding data, but do not rely on these models may improve prediction sensitivity and specificity. RESULTS: We propose a method for predicting transcription factor binding sites in the genome by directly mapping data generated from protein binding microarrays (PBM) to the genome and calculating a moving average of several overlapping octamers. Using this unique algorithm, we predicted binding sites for the essential pancreatic islet transcription factor Nkx2.2 in the mouse genome and confirmed >90% of the tested sites by EMSA and ChIP. Scores generated from this method more accurately predicted relative binding affinity than PWM based methods. We have also identified an alternative core sequence recognized by the Nkx2.2 homeodomain. Furthermore, we have shown that this method correctly identified binding sites in the promoters of two critical pancreatic islet β-cell genes, NeuroD1 and insulin2, that were not predicted by traditional methods. Finally, we show evidence that the algorithm can also be applied to predict binding sites for the nuclear receptor Hnf4α. CONCLUSIONS: PBM-mapping is an accurate method for predicting Nkx2.2 binding sites and may be widely applicable for the creation of genome-wide maps of transcription factor binding sites.
format Text
id pubmed-3050729
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30507292011-03-09 Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas Hill, Jonathon T Anderson, Keith R Mastracci, Teresa L Kaestner, Klaus H Sussel, Lori BMC Bioinformatics Research Article BACKGROUND: The creation of a complete genome-wide map of transcription factor binding sites is essential for understanding gene regulatory networks in vivo. However, current prediction methods generally rely on statistical models that imperfectly model transcription factor binding. Generation of new prediction methods that are based on protein binding data, but do not rely on these models may improve prediction sensitivity and specificity. RESULTS: We propose a method for predicting transcription factor binding sites in the genome by directly mapping data generated from protein binding microarrays (PBM) to the genome and calculating a moving average of several overlapping octamers. Using this unique algorithm, we predicted binding sites for the essential pancreatic islet transcription factor Nkx2.2 in the mouse genome and confirmed >90% of the tested sites by EMSA and ChIP. Scores generated from this method more accurately predicted relative binding affinity than PWM based methods. We have also identified an alternative core sequence recognized by the Nkx2.2 homeodomain. Furthermore, we have shown that this method correctly identified binding sites in the promoters of two critical pancreatic islet β-cell genes, NeuroD1 and insulin2, that were not predicted by traditional methods. Finally, we show evidence that the algorithm can also be applied to predict binding sites for the nuclear receptor Hnf4α. CONCLUSIONS: PBM-mapping is an accurate method for predicting Nkx2.2 binding sites and may be widely applicable for the creation of genome-wide maps of transcription factor binding sites. BioMed Central 2011-02-25 /pmc/articles/PMC3050729/ /pubmed/21352540 http://dx.doi.org/10.1186/1471-2105-12-62 Text en Copyright ©2011 Hill et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hill, Jonathon T
Anderson, Keith R
Mastracci, Teresa L
Kaestner, Klaus H
Sussel, Lori
Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas
title Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas
title_full Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas
title_fullStr Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas
title_full_unstemmed Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas
title_short Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas
title_sort novel computational analysis of protein binding array data identifies direct targets of nkx2.2 in the pancreas
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3050729/
https://www.ncbi.nlm.nih.gov/pubmed/21352540
http://dx.doi.org/10.1186/1471-2105-12-62
work_keys_str_mv AT hilljonathont novelcomputationalanalysisofproteinbindingarraydataidentifiesdirecttargetsofnkx22inthepancreas
AT andersonkeithr novelcomputationalanalysisofproteinbindingarraydataidentifiesdirecttargetsofnkx22inthepancreas
AT mastracciteresal novelcomputationalanalysisofproteinbindingarraydataidentifiesdirecttargetsofnkx22inthepancreas
AT kaestnerklaush novelcomputationalanalysisofproteinbindingarraydataidentifiesdirecttargetsofnkx22inthepancreas
AT sussellori novelcomputationalanalysisofproteinbindingarraydataidentifiesdirecttargetsofnkx22inthepancreas