Cargando…

A motif-independent metric for DNA sequence specificity

BACKGROUND: Genome-wide mapping of protein-DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computation...

Descripción completa

Detalles Bibliográficos
Autores principales: Pinello, Luca, Lo Bosco, Giosuè, Hanlon, Bret, Yuan, Guo-Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267244/
https://www.ncbi.nlm.nih.gov/pubmed/22017798
http://dx.doi.org/10.1186/1471-2105-12-408
_version_ 1782222267167539200
author Pinello, Luca
Lo Bosco, Giosuè
Hanlon, Bret
Yuan, Guo-Cheng
author_facet Pinello, Luca
Lo Bosco, Giosuè
Hanlon, Bret
Yuan, Guo-Cheng
author_sort Pinello, Luca
collection PubMed
description BACKGROUND: Genome-wide mapping of protein-DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computational methods for systematic evaluating sequence specificity. RESULTS: We present a simple, unbiased quantitative measure for DNA sequence specificity called the Motif Independent Measure (MIM). By analyzing both simulated and real experimental data, we found that the MIM measure can be used to detect sequence specificity independent of presence of transcription factor (TF) binding motifs. We also found that the level of specificity associated with H3K4me1 target sequences is highly cell-type specific and highest in embryonic stem (ES) cells. We predicted H3K4me1 target sequences by using the N- score model and found that the prediction accuracy is indeed high in ES cells.The software to compute the MIM is freely available at: https://github.com/lucapinello/mim. CONCLUSIONS: Our method provides a unified framework for quantifying DNA sequence specificity and serves as a guide for development of sequence-based prediction models.
format Online
Article
Text
id pubmed-3267244
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32672442012-01-27 A motif-independent metric for DNA sequence specificity Pinello, Luca Lo Bosco, Giosuè Hanlon, Bret Yuan, Guo-Cheng BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide mapping of protein-DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computational methods for systematic evaluating sequence specificity. RESULTS: We present a simple, unbiased quantitative measure for DNA sequence specificity called the Motif Independent Measure (MIM). By analyzing both simulated and real experimental data, we found that the MIM measure can be used to detect sequence specificity independent of presence of transcription factor (TF) binding motifs. We also found that the level of specificity associated with H3K4me1 target sequences is highly cell-type specific and highest in embryonic stem (ES) cells. We predicted H3K4me1 target sequences by using the N- score model and found that the prediction accuracy is indeed high in ES cells.The software to compute the MIM is freely available at: https://github.com/lucapinello/mim. CONCLUSIONS: Our method provides a unified framework for quantifying DNA sequence specificity and serves as a guide for development of sequence-based prediction models. BioMed Central 2011-10-21 /pmc/articles/PMC3267244/ /pubmed/22017798 http://dx.doi.org/10.1186/1471-2105-12-408 Text en Copyright ©2011 Pinello et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Pinello, Luca
Lo Bosco, Giosuè
Hanlon, Bret
Yuan, Guo-Cheng
A motif-independent metric for DNA sequence specificity
title A motif-independent metric for DNA sequence specificity
title_full A motif-independent metric for DNA sequence specificity
title_fullStr A motif-independent metric for DNA sequence specificity
title_full_unstemmed A motif-independent metric for DNA sequence specificity
title_short A motif-independent metric for DNA sequence specificity
title_sort motif-independent metric for dna sequence specificity
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267244/
https://www.ncbi.nlm.nih.gov/pubmed/22017798
http://dx.doi.org/10.1186/1471-2105-12-408
work_keys_str_mv AT pinelloluca amotifindependentmetricfordnasequencespecificity
AT loboscogiosue amotifindependentmetricfordnasequencespecificity
AT hanlonbret amotifindependentmetricfordnasequencespecificity
AT yuanguocheng amotifindependentmetricfordnasequencespecificity
AT pinelloluca motifindependentmetricfordnasequencespecificity
AT loboscogiosue motifindependentmetricfordnasequencespecificity
AT hanlonbret motifindependentmetricfordnasequencespecificity
AT yuanguocheng motifindependentmetricfordnasequencespecificity