Cargando…

Shetti, a simple tool to parse, manipulate and search large datasets of sequences

Parsing and manipulating long and/or multiple protein or gene sequences can be a challenging process for experimental biologists and microbiologists lacking prior knowledge of bioinformatics and programming. Here we present a simple, easy, user-friendly and versatile tool to parse, manipulate and se...

Descripción completa

Detalles Bibliográficos
Autor principal: Sobhy, Haitham
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5320677/
https://www.ncbi.nlm.nih.gov/pubmed/28348820
http://dx.doi.org/10.1099/mgen.0.000035
Descripción
Sumario:Parsing and manipulating long and/or multiple protein or gene sequences can be a challenging process for experimental biologists and microbiologists lacking prior knowledge of bioinformatics and programming. Here we present a simple, easy, user-friendly and versatile tool to parse, manipulate and search within large datasets of long and multiple protein or gene sequences. The Shetti tool can be used to search for a sequence, species, protein/gene or pattern/motif. Moreover, it can also be used to construct a universal consensus or molecular signatures for proteins based on their physical characteristics. Shetti is an efficient and fast tool that can deal with large sets of long sequences efficiently. Shetti parses UniProt Knowledgebase and NCBI GenBank flat files and visualizes them as a table.