Cargando…
A motif-independent metric for DNA sequence specificity
BACKGROUND: Genome-wide mapping of protein-DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computation...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267244/ https://www.ncbi.nlm.nih.gov/pubmed/22017798 http://dx.doi.org/10.1186/1471-2105-12-408 |
_version_ | 1782222267167539200 |
---|---|
author | Pinello, Luca Lo Bosco, Giosuè Hanlon, Bret Yuan, Guo-Cheng |
author_facet | Pinello, Luca Lo Bosco, Giosuè Hanlon, Bret Yuan, Guo-Cheng |
author_sort | Pinello, Luca |
collection | PubMed |
description | BACKGROUND: Genome-wide mapping of protein-DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computational methods for systematic evaluating sequence specificity. RESULTS: We present a simple, unbiased quantitative measure for DNA sequence specificity called the Motif Independent Measure (MIM). By analyzing both simulated and real experimental data, we found that the MIM measure can be used to detect sequence specificity independent of presence of transcription factor (TF) binding motifs. We also found that the level of specificity associated with H3K4me1 target sequences is highly cell-type specific and highest in embryonic stem (ES) cells. We predicted H3K4me1 target sequences by using the N- score model and found that the prediction accuracy is indeed high in ES cells.The software to compute the MIM is freely available at: https://github.com/lucapinello/mim. CONCLUSIONS: Our method provides a unified framework for quantifying DNA sequence specificity and serves as a guide for development of sequence-based prediction models. |
format | Online Article Text |
id | pubmed-3267244 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32672442012-01-27 A motif-independent metric for DNA sequence specificity Pinello, Luca Lo Bosco, Giosuè Hanlon, Bret Yuan, Guo-Cheng BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide mapping of protein-DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computational methods for systematic evaluating sequence specificity. RESULTS: We present a simple, unbiased quantitative measure for DNA sequence specificity called the Motif Independent Measure (MIM). By analyzing both simulated and real experimental data, we found that the MIM measure can be used to detect sequence specificity independent of presence of transcription factor (TF) binding motifs. We also found that the level of specificity associated with H3K4me1 target sequences is highly cell-type specific and highest in embryonic stem (ES) cells. We predicted H3K4me1 target sequences by using the N- score model and found that the prediction accuracy is indeed high in ES cells.The software to compute the MIM is freely available at: https://github.com/lucapinello/mim. CONCLUSIONS: Our method provides a unified framework for quantifying DNA sequence specificity and serves as a guide for development of sequence-based prediction models. BioMed Central 2011-10-21 /pmc/articles/PMC3267244/ /pubmed/22017798 http://dx.doi.org/10.1186/1471-2105-12-408 Text en Copyright ©2011 Pinello et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Pinello, Luca Lo Bosco, Giosuè Hanlon, Bret Yuan, Guo-Cheng A motif-independent metric for DNA sequence specificity |
title | A motif-independent metric for DNA sequence specificity |
title_full | A motif-independent metric for DNA sequence specificity |
title_fullStr | A motif-independent metric for DNA sequence specificity |
title_full_unstemmed | A motif-independent metric for DNA sequence specificity |
title_short | A motif-independent metric for DNA sequence specificity |
title_sort | motif-independent metric for dna sequence specificity |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267244/ https://www.ncbi.nlm.nih.gov/pubmed/22017798 http://dx.doi.org/10.1186/1471-2105-12-408 |
work_keys_str_mv | AT pinelloluca amotifindependentmetricfordnasequencespecificity AT loboscogiosue amotifindependentmetricfordnasequencespecificity AT hanlonbret amotifindependentmetricfordnasequencespecificity AT yuanguocheng amotifindependentmetricfordnasequencespecificity AT pinelloluca motifindependentmetricfordnasequencespecificity AT loboscogiosue motifindependentmetricfordnasequencespecificity AT hanlonbret motifindependentmetricfordnasequencespecificity AT yuanguocheng motifindependentmetricfordnasequencespecificity |