Cargando…

Fast-Find: A novel computational approach to analyzing combinatorial motifs

BACKGROUND: Many vital biological processes, including transcription and splicing, require a combination of short, degenerate sequence patterns, or motifs, adjacent to defined sequence features. Although these motifs occur frequently by chance, they only have biological meaning within a specific con...

Descripción completa

Detalles Bibliográficos
Autores principales: Hamady, Micah, Peden, Erin, Knight, Rob, Singh, Ravinder
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1360682/
https://www.ncbi.nlm.nih.gov/pubmed/16393334
http://dx.doi.org/10.1186/1471-2105-7-1
_version_ 1782126707427246080
author Hamady, Micah
Peden, Erin
Knight, Rob
Singh, Ravinder
author_facet Hamady, Micah
Peden, Erin
Knight, Rob
Singh, Ravinder
author_sort Hamady, Micah
collection PubMed
description BACKGROUND: Many vital biological processes, including transcription and splicing, require a combination of short, degenerate sequence patterns, or motifs, adjacent to defined sequence features. Although these motifs occur frequently by chance, they only have biological meaning within a specific context. Identifying transcripts that contain meaningful combinations of patterns is thus an important problem, which existing tools address poorly. RESULTS: Here we present a new approach, Fast-FIND (Fast-Fully Indexed Nucleotide Database), that uses a relational database to support rapid indexed searches for arbitrary combinations of patterns defined either by sequence or composition. Fast-FIND is easy to implement, takes less than a second to search the entire Drosophila genome sequence for arbitrary patterns adjacent to sites of alternative polyadenylation, and is sufficiently fast to allow sensitivity analysis on the patterns. We have applied this approach to identify transcripts that contain combinations of sequence motifs for RNA-binding proteins that may regulate alternative polyadenylation. CONCLUSION: Fast-FIND provides an efficient way to identify transcripts that are potentially regulated via alternative polyadenylation. We have used it to generate hypotheses about interactions between specific polyadenylation factors, which we will test experimentally.
format Text
id pubmed-1360682
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-13606822006-02-04 Fast-Find: A novel computational approach to analyzing combinatorial motifs Hamady, Micah Peden, Erin Knight, Rob Singh, Ravinder BMC Bioinformatics Software BACKGROUND: Many vital biological processes, including transcription and splicing, require a combination of short, degenerate sequence patterns, or motifs, adjacent to defined sequence features. Although these motifs occur frequently by chance, they only have biological meaning within a specific context. Identifying transcripts that contain meaningful combinations of patterns is thus an important problem, which existing tools address poorly. RESULTS: Here we present a new approach, Fast-FIND (Fast-Fully Indexed Nucleotide Database), that uses a relational database to support rapid indexed searches for arbitrary combinations of patterns defined either by sequence or composition. Fast-FIND is easy to implement, takes less than a second to search the entire Drosophila genome sequence for arbitrary patterns adjacent to sites of alternative polyadenylation, and is sufficiently fast to allow sensitivity analysis on the patterns. We have applied this approach to identify transcripts that contain combinations of sequence motifs for RNA-binding proteins that may regulate alternative polyadenylation. CONCLUSION: Fast-FIND provides an efficient way to identify transcripts that are potentially regulated via alternative polyadenylation. We have used it to generate hypotheses about interactions between specific polyadenylation factors, which we will test experimentally. BioMed Central 2006-01-04 /pmc/articles/PMC1360682/ /pubmed/16393334 http://dx.doi.org/10.1186/1471-2105-7-1 Text en Copyright © 2006 Hamady et al; licensee BioMed Central Ltd.
spellingShingle Software
Hamady, Micah
Peden, Erin
Knight, Rob
Singh, Ravinder
Fast-Find: A novel computational approach to analyzing combinatorial motifs
title Fast-Find: A novel computational approach to analyzing combinatorial motifs
title_full Fast-Find: A novel computational approach to analyzing combinatorial motifs
title_fullStr Fast-Find: A novel computational approach to analyzing combinatorial motifs
title_full_unstemmed Fast-Find: A novel computational approach to analyzing combinatorial motifs
title_short Fast-Find: A novel computational approach to analyzing combinatorial motifs
title_sort fast-find: a novel computational approach to analyzing combinatorial motifs
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1360682/
https://www.ncbi.nlm.nih.gov/pubmed/16393334
http://dx.doi.org/10.1186/1471-2105-7-1
work_keys_str_mv AT hamadymicah fastfindanovelcomputationalapproachtoanalyzingcombinatorialmotifs
AT pedenerin fastfindanovelcomputationalapproachtoanalyzingcombinatorialmotifs
AT knightrob fastfindanovelcomputationalapproachtoanalyzingcombinatorialmotifs
AT singhravinder fastfindanovelcomputationalapproachtoanalyzingcombinatorialmotifs