Cargando…

Shetti, a simple tool to parse, manipulate and search large datasets of sequences

Parsing and manipulating long and/or multiple protein or gene sequences can be a challenging process for experimental biologists and microbiologists lacking prior knowledge of bioinformatics and programming. Here we present a simple, easy, user-friendly and versatile tool to parse, manipulate and se...

Descripción completa

Detalles Bibliográficos
Autor principal: Sobhy, Haitham
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5320677/
https://www.ncbi.nlm.nih.gov/pubmed/28348820
http://dx.doi.org/10.1099/mgen.0.000035
_version_ 1782509582395899904
author Sobhy, Haitham
author_facet Sobhy, Haitham
author_sort Sobhy, Haitham
collection PubMed
description Parsing and manipulating long and/or multiple protein or gene sequences can be a challenging process for experimental biologists and microbiologists lacking prior knowledge of bioinformatics and programming. Here we present a simple, easy, user-friendly and versatile tool to parse, manipulate and search within large datasets of long and multiple protein or gene sequences. The Shetti tool can be used to search for a sequence, species, protein/gene or pattern/motif. Moreover, it can also be used to construct a universal consensus or molecular signatures for proteins based on their physical characteristics. Shetti is an efficient and fast tool that can deal with large sets of long sequences efficiently. Shetti parses UniProt Knowledgebase and NCBI GenBank flat files and visualizes them as a table.
format Online
Article
Text
id pubmed-5320677
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-53206772017-03-27 Shetti, a simple tool to parse, manipulate and search large datasets of sequences Sobhy, Haitham Microb Genom Methods Paper Parsing and manipulating long and/or multiple protein or gene sequences can be a challenging process for experimental biologists and microbiologists lacking prior knowledge of bioinformatics and programming. Here we present a simple, easy, user-friendly and versatile tool to parse, manipulate and search within large datasets of long and multiple protein or gene sequences. The Shetti tool can be used to search for a sequence, species, protein/gene or pattern/motif. Moreover, it can also be used to construct a universal consensus or molecular signatures for proteins based on their physical characteristics. Shetti is an efficient and fast tool that can deal with large sets of long sequences efficiently. Shetti parses UniProt Knowledgebase and NCBI GenBank flat files and visualizes them as a table. Microbiology Society 2015-11-06 /pmc/articles/PMC5320677/ /pubmed/28348820 http://dx.doi.org/10.1099/mgen.0.000035 Text en © 2015 The Authors http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Methods Paper
Sobhy, Haitham
Shetti, a simple tool to parse, manipulate and search large datasets of sequences
title Shetti, a simple tool to parse, manipulate and search large datasets of sequences
title_full Shetti, a simple tool to parse, manipulate and search large datasets of sequences
title_fullStr Shetti, a simple tool to parse, manipulate and search large datasets of sequences
title_full_unstemmed Shetti, a simple tool to parse, manipulate and search large datasets of sequences
title_short Shetti, a simple tool to parse, manipulate and search large datasets of sequences
title_sort shetti, a simple tool to parse, manipulate and search large datasets of sequences
topic Methods Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5320677/
https://www.ncbi.nlm.nih.gov/pubmed/28348820
http://dx.doi.org/10.1099/mgen.0.000035
work_keys_str_mv AT sobhyhaitham shettiasimpletooltoparsemanipulateandsearchlargedatasetsofsequences