Cargando…

DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices

BACKGROUND: The accurate determination of transcription factor binding affinities is an important problem in biology and key to understanding the gene regulation process. Position weight matrices are commonly used to represent the binding properties of transcription factor binding sites but suffer f...

Descripción completa

Detalles Bibliográficos
Autores principales: da Piedade, Isabelle, Tang, Man-Hung Eric, Elemento, Olivier
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788558/
https://www.ncbi.nlm.nih.gov/pubmed/19941641
http://dx.doi.org/10.1186/1471-2105-10-388
_version_ 1782174994245091328
author da Piedade, Isabelle
Tang, Man-Hung Eric
Elemento, Olivier
author_facet da Piedade, Isabelle
Tang, Man-Hung Eric
Elemento, Olivier
author_sort da Piedade, Isabelle
collection PubMed
description BACKGROUND: The accurate determination of transcription factor binding affinities is an important problem in biology and key to understanding the gene regulation process. Position weight matrices are commonly used to represent the binding properties of transcription factor binding sites but suffer from low information content and a large number of false matches in the genome. We describe a novel algorithm for the refinement of position weight matrices representing transcription factor binding sites based on experimental data, including ChIP-chip analyses. We present an iterative weight matrix optimization method that is more accurate in distinguishing true transcription factor binding sites from a negative control set. The initial position weight matrix comes from JASPAR, TRANSFAC or other sources. The main new features are the discriminative nature of the method and matrix width and length optimization. RESULTS: The algorithm was applied to the increasing collection of known transcription factor binding sites obtained from ChIP-chip experiments. The results show that our algorithm significantly improves the sensitivity and specificity of matrix models for identifying transcription factor binding sites. CONCLUSION: When the transcription factor is known, it is more appropriate to use a discriminative approach such as the one presented here to derive its transcription factor-DNA binding properties starting with a matrix, as opposed to performing de novo motif discovery. Generating more accurate position weight matrices will ultimately contribute to a better understanding of eukaryotic transcriptional regulation, and could potentially offer a better alternative to ab initio motif discovery.
format Text
id pubmed-2788558
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27885582009-12-04 DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices da Piedade, Isabelle Tang, Man-Hung Eric Elemento, Olivier BMC Bioinformatics Methodology article BACKGROUND: The accurate determination of transcription factor binding affinities is an important problem in biology and key to understanding the gene regulation process. Position weight matrices are commonly used to represent the binding properties of transcription factor binding sites but suffer from low information content and a large number of false matches in the genome. We describe a novel algorithm for the refinement of position weight matrices representing transcription factor binding sites based on experimental data, including ChIP-chip analyses. We present an iterative weight matrix optimization method that is more accurate in distinguishing true transcription factor binding sites from a negative control set. The initial position weight matrix comes from JASPAR, TRANSFAC or other sources. The main new features are the discriminative nature of the method and matrix width and length optimization. RESULTS: The algorithm was applied to the increasing collection of known transcription factor binding sites obtained from ChIP-chip experiments. The results show that our algorithm significantly improves the sensitivity and specificity of matrix models for identifying transcription factor binding sites. CONCLUSION: When the transcription factor is known, it is more appropriate to use a discriminative approach such as the one presented here to derive its transcription factor-DNA binding properties starting with a matrix, as opposed to performing de novo motif discovery. Generating more accurate position weight matrices will ultimately contribute to a better understanding of eukaryotic transcriptional regulation, and could potentially offer a better alternative to ab initio motif discovery. BioMed Central 2009-11-26 /pmc/articles/PMC2788558/ /pubmed/19941641 http://dx.doi.org/10.1186/1471-2105-10-388 Text en Copyright ©2009 da Piedade et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
da Piedade, Isabelle
Tang, Man-Hung Eric
Elemento, Olivier
DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices
title DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices
title_full DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices
title_fullStr DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices
title_full_unstemmed DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices
title_short DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices
title_sort dispare: discriminative pattern refinement for position weight matrices
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2788558/
https://www.ncbi.nlm.nih.gov/pubmed/19941641
http://dx.doi.org/10.1186/1471-2105-10-388
work_keys_str_mv AT dapiedadeisabelle disparediscriminativepatternrefinementforpositionweightmatrices
AT tangmanhungeric disparediscriminativepatternrefinementforpositionweightmatrices
AT elementoolivier disparediscriminativepatternrefinementforpositionweightmatrices