Cargando…

Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites

BACKGROUND: The identifying of binding sites for transcription factors is a key component of gene regulatory network analysis. This is often done using position-weight matrices (PWMs). Because of the importance of in silico mapping of tentative binding sites, we previously developed an approach for...

Descripción completa

Detalles Bibliográficos
Autores principales: Nandi, Soumyadeep, Ioshikhes, Ilya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3481455/
https://www.ncbi.nlm.nih.gov/pubmed/22913572
http://dx.doi.org/10.1186/1471-2164-13-416
_version_ 1782247742531174400
author Nandi, Soumyadeep
Ioshikhes, Ilya
author_facet Nandi, Soumyadeep
Ioshikhes, Ilya
author_sort Nandi, Soumyadeep
collection PubMed
description BACKGROUND: The identifying of binding sites for transcription factors is a key component of gene regulatory network analysis. This is often done using position-weight matrices (PWMs). Because of the importance of in silico mapping of tentative binding sites, we previously developed an approach for PWM optimization that substantially improves the accuracy of such mapping. RESULTS: The present work implements the optimization algorithm applied to the existing PWM for GATA-3 transcription factor and builds a new di-nucleotide PWM. The existing available PWM is based on experimental data adopted from Jaspar. The optimized PWM substantially improves the sensitivity and specificity of the TF mapping compared to the conventional applications. The refined PWM also facilitates in silico identification of novel binding sites that are supported by experimental data. We also describe uncommon positioning of binding motifs for several T-cell lineage specific factors in human promoters. CONCLUSION: Our proposed di-nucleotide PWM approach outperforms the conventional mono-nucleotide PWM approach with respect to GATA-3. Therefore our new di-nucleotide PWM provides new insight into plausible transcriptional regulatory interactions in human promoters.
format Online
Article
Text
id pubmed-3481455
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34814552012-11-02 Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites Nandi, Soumyadeep Ioshikhes, Ilya BMC Genomics Research Article BACKGROUND: The identifying of binding sites for transcription factors is a key component of gene regulatory network analysis. This is often done using position-weight matrices (PWMs). Because of the importance of in silico mapping of tentative binding sites, we previously developed an approach for PWM optimization that substantially improves the accuracy of such mapping. RESULTS: The present work implements the optimization algorithm applied to the existing PWM for GATA-3 transcription factor and builds a new di-nucleotide PWM. The existing available PWM is based on experimental data adopted from Jaspar. The optimized PWM substantially improves the sensitivity and specificity of the TF mapping compared to the conventional applications. The refined PWM also facilitates in silico identification of novel binding sites that are supported by experimental data. We also describe uncommon positioning of binding motifs for several T-cell lineage specific factors in human promoters. CONCLUSION: Our proposed di-nucleotide PWM approach outperforms the conventional mono-nucleotide PWM approach with respect to GATA-3. Therefore our new di-nucleotide PWM provides new insight into plausible transcriptional regulatory interactions in human promoters. BioMed Central 2012-08-22 /pmc/articles/PMC3481455/ /pubmed/22913572 http://dx.doi.org/10.1186/1471-2164-13-416 Text en Copyright ©2012 Nandi and Ioshikhes; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Nandi, Soumyadeep
Ioshikhes, Ilya
Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites
title Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites
title_full Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites
title_fullStr Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites
title_full_unstemmed Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites
title_short Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites
title_sort optimizing the gata-3 position weight matrix to improve the identification of novel binding sites
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3481455/
https://www.ncbi.nlm.nih.gov/pubmed/22913572
http://dx.doi.org/10.1186/1471-2164-13-416
work_keys_str_mv AT nandisoumyadeep optimizingthegata3positionweightmatrixtoimprovetheidentificationofnovelbindingsites
AT ioshikhesilya optimizingthegata3positionweightmatrixtoimprovetheidentificationofnovelbindingsites