Cargando…

tacg – a grep for DNA

BACKGROUND: Pattern matching is the core of bioinformatics; it is used in database searching, restriction enzyme mapping, and finding open reading frames. It is done repeatedly over increasingly long sequences, thus codes must be efficient and insensitive to sequence length. Such patterns of interes...

Descripción completa

Detalles Bibliográficos
Autor principal: Mangalam, Harry J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC99049/
https://www.ncbi.nlm.nih.gov/pubmed/11882250
http://dx.doi.org/10.1186/1471-2105-3-8
Descripción
Sumario:BACKGROUND: Pattern matching is the core of bioinformatics; it is used in database searching, restriction enzyme mapping, and finding open reading frames. It is done repeatedly over increasingly long sequences, thus codes must be efficient and insensitive to sequence length. Such patterns of interest include simple motifs with IUPAC degeneracies, regular expressions, patterns allowing mismatches, and probability matrices. RESULTS: I describe a small application which allows searching for all the above pattern types individually, which further allows these atomic motifs to be assembled into logical rules for more sophisticated analysis. CONCLUSION: tacg is small, portable, faster and more capable than most alternatives, relatively easy to modify, and freely available in source code.