Cargando…

JUZBOX: A web server for extracting biomedical words from the protein sequence

The recognition of gene/protein names in literature is one of the pivotal steps in the processing of biological literatures for information extraction or data mining. We have compiled a lexicon of biomedical words (conserved patterns/ potential motifs) which has the combination of only 20 alphabets...

Descripción completa

Detalles Bibliográficos
Autores principales: Bobby, Paul, Balaji, Seetharaman, Sathyanath, Variath, Eapen, Santhosh J
Formato: Texto
Lenguaje:English
Publicado: Biomedical Informatics Publishing Group 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2859571/
https://www.ncbi.nlm.nih.gov/pubmed/20461154
_version_ 1782180519669137408
author Bobby, Paul
Balaji, Seetharaman
Sathyanath, Variath
Eapen, Santhosh J
author_facet Bobby, Paul
Balaji, Seetharaman
Sathyanath, Variath
Eapen, Santhosh J
author_sort Bobby, Paul
collection PubMed
description The recognition of gene/protein names in literature is one of the pivotal steps in the processing of biological literatures for information extraction or data mining. We have compiled a lexicon of biomedical words (conserved patterns/ potential motifs) which has the combination of only 20 alphabets of amino acids. The remaining 6 letters of the English alphabets (B, J, O, U, X, Z) are treated as invalid amino acid characters (to our context), We have jumbled the 6 letters for the sake of usage and convenience and termed as ’JUZBOX‘ and these characters were filtered in the biomedical lexicon. Undoubtedly, the generation of biomedical words from protein sequence using JUZBOX have applications specific for functional annotation. AVAILABILITY: JUZBOX is available freely at http://www.spices.res.in/juzbox
format Text
id pubmed-2859571
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Biomedical Informatics Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-28595712010-05-11 JUZBOX: A web server for extracting biomedical words from the protein sequence Bobby, Paul Balaji, Seetharaman Sathyanath, Variath Eapen, Santhosh J Bioinformation Web Server The recognition of gene/protein names in literature is one of the pivotal steps in the processing of biological literatures for information extraction or data mining. We have compiled a lexicon of biomedical words (conserved patterns/ potential motifs) which has the combination of only 20 alphabets of amino acids. The remaining 6 letters of the English alphabets (B, J, O, U, X, Z) are treated as invalid amino acid characters (to our context), We have jumbled the 6 letters for the sake of usage and convenience and termed as ’JUZBOX‘ and these characters were filtered in the biomedical lexicon. Undoubtedly, the generation of biomedical words from protein sequence using JUZBOX have applications specific for functional annotation. AVAILABILITY: JUZBOX is available freely at http://www.spices.res.in/juzbox Biomedical Informatics Publishing Group 2009-11-17 /pmc/articles/PMC2859571/ /pubmed/20461154 Text en © 2009 Biomedical Informatics Publishing Group This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Web Server
Bobby, Paul
Balaji, Seetharaman
Sathyanath, Variath
Eapen, Santhosh J
JUZBOX: A web server for extracting biomedical words from the protein sequence
title JUZBOX: A web server for extracting biomedical words from the protein sequence
title_full JUZBOX: A web server for extracting biomedical words from the protein sequence
title_fullStr JUZBOX: A web server for extracting biomedical words from the protein sequence
title_full_unstemmed JUZBOX: A web server for extracting biomedical words from the protein sequence
title_short JUZBOX: A web server for extracting biomedical words from the protein sequence
title_sort juzbox: a web server for extracting biomedical words from the protein sequence
topic Web Server
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2859571/
https://www.ncbi.nlm.nih.gov/pubmed/20461154
work_keys_str_mv AT bobbypaul juzboxawebserverforextractingbiomedicalwordsfromtheproteinsequence
AT balajiseetharaman juzboxawebserverforextractingbiomedicalwordsfromtheproteinsequence
AT sathyanathvariath juzboxawebserverforextractingbiomedicalwordsfromtheproteinsequence
AT eapensanthoshj juzboxawebserverforextractingbiomedicalwordsfromtheproteinsequence