Cargando…
Leveraging cross-link modification events in CLIP-seq for motif discovery
High-throughput protein–RNA interaction data generated by CLIP-seq has provided an unprecedented depth of access to the activities of RNA-binding proteins (RBPs), the key players in co- and post-transcriptional regulation of gene expression. Motif discovery forms part of the necessary follow-up data...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288180/ https://www.ncbi.nlm.nih.gov/pubmed/25505146 http://dx.doi.org/10.1093/nar/gku1288 |
_version_ | 1782351924263124992 |
---|---|
author | Bahrami-Samani, Emad Penalva, Luiz O.F. Smith, Andrew D. Uren, Philip J. |
author_facet | Bahrami-Samani, Emad Penalva, Luiz O.F. Smith, Andrew D. Uren, Philip J. |
author_sort | Bahrami-Samani, Emad |
collection | PubMed |
description | High-throughput protein–RNA interaction data generated by CLIP-seq has provided an unprecedented depth of access to the activities of RNA-binding proteins (RBPs), the key players in co- and post-transcriptional regulation of gene expression. Motif discovery forms part of the necessary follow-up data analysis for CLIP-seq, both to refine the exact locations of RBP binding sites, and to characterize them. The specific properties of RBP binding sites, and the CLIP-seq methods, provide additional information not usually present in the classic motif discovery problem: the binding site structure, and cross-linking induced events in reads. We show that CLIP-seq data contains clear secondary structure signals, as well as technology- and RBP-specific cross-link signals. We introduce Zagros, a motif discovery algorithm specifically designed to leverage this information and explore its impact on the quality of recovered motifs. Our results indicate that using both secondary structure and cross-link modifications can greatly improve motif discovery on CLIP-seq data. Further, the motifs we recover provide insight into the balance between sequence- and structure-specificity struck by RBP binding. |
format | Online Article Text |
id | pubmed-4288180 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-42881802015-02-19 Leveraging cross-link modification events in CLIP-seq for motif discovery Bahrami-Samani, Emad Penalva, Luiz O.F. Smith, Andrew D. Uren, Philip J. Nucleic Acids Res Computational Biology High-throughput protein–RNA interaction data generated by CLIP-seq has provided an unprecedented depth of access to the activities of RNA-binding proteins (RBPs), the key players in co- and post-transcriptional regulation of gene expression. Motif discovery forms part of the necessary follow-up data analysis for CLIP-seq, both to refine the exact locations of RBP binding sites, and to characterize them. The specific properties of RBP binding sites, and the CLIP-seq methods, provide additional information not usually present in the classic motif discovery problem: the binding site structure, and cross-linking induced events in reads. We show that CLIP-seq data contains clear secondary structure signals, as well as technology- and RBP-specific cross-link signals. We introduce Zagros, a motif discovery algorithm specifically designed to leverage this information and explore its impact on the quality of recovered motifs. Our results indicate that using both secondary structure and cross-link modifications can greatly improve motif discovery on CLIP-seq data. Further, the motifs we recover provide insight into the balance between sequence- and structure-specificity struck by RBP binding. Oxford University Press 2015-01-09 2014-12-10 /pmc/articles/PMC4288180/ /pubmed/25505146 http://dx.doi.org/10.1093/nar/gku1288 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Bahrami-Samani, Emad Penalva, Luiz O.F. Smith, Andrew D. Uren, Philip J. Leveraging cross-link modification events in CLIP-seq for motif discovery |
title | Leveraging cross-link modification events in CLIP-seq for motif discovery |
title_full | Leveraging cross-link modification events in CLIP-seq for motif discovery |
title_fullStr | Leveraging cross-link modification events in CLIP-seq for motif discovery |
title_full_unstemmed | Leveraging cross-link modification events in CLIP-seq for motif discovery |
title_short | Leveraging cross-link modification events in CLIP-seq for motif discovery |
title_sort | leveraging cross-link modification events in clip-seq for motif discovery |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288180/ https://www.ncbi.nlm.nih.gov/pubmed/25505146 http://dx.doi.org/10.1093/nar/gku1288 |
work_keys_str_mv | AT bahramisamaniemad leveragingcrosslinkmodificationeventsinclipseqformotifdiscovery AT penalvaluizof leveragingcrosslinkmodificationeventsinclipseqformotifdiscovery AT smithandrewd leveragingcrosslinkmodificationeventsinclipseqformotifdiscovery AT urenphilipj leveragingcrosslinkmodificationeventsinclipseqformotifdiscovery |