Cargando…
JUZBOX: A web server for extracting biomedical words from the protein sequence
The recognition of gene/protein names in literature is one of the pivotal steps in the processing of biological literatures for information extraction or data mining. We have compiled a lexicon of biomedical words (conserved patterns/ potential motifs) which has the combination of only 20 alphabets...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Biomedical Informatics Publishing Group
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2859571/ https://www.ncbi.nlm.nih.gov/pubmed/20461154 |
_version_ | 1782180519669137408 |
---|---|
author | Bobby, Paul Balaji, Seetharaman Sathyanath, Variath Eapen, Santhosh J |
author_facet | Bobby, Paul Balaji, Seetharaman Sathyanath, Variath Eapen, Santhosh J |
author_sort | Bobby, Paul |
collection | PubMed |
description | The recognition of gene/protein names in literature is one of the pivotal steps in the processing of biological literatures for information extraction or data mining. We have compiled a lexicon of biomedical words (conserved patterns/ potential motifs) which has the combination of only 20 alphabets of amino acids. The remaining 6 letters of the English alphabets (B, J, O, U, X, Z) are treated as invalid amino acid characters (to our context), We have jumbled the 6 letters for the sake of usage and convenience and termed as ’JUZBOX‘ and these characters were filtered in the biomedical lexicon. Undoubtedly, the generation of biomedical words from protein sequence using JUZBOX have applications specific for functional annotation. AVAILABILITY: JUZBOX is available freely at http://www.spices.res.in/juzbox |
format | Text |
id | pubmed-2859571 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Biomedical Informatics Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-28595712010-05-11 JUZBOX: A web server for extracting biomedical words from the protein sequence Bobby, Paul Balaji, Seetharaman Sathyanath, Variath Eapen, Santhosh J Bioinformation Web Server The recognition of gene/protein names in literature is one of the pivotal steps in the processing of biological literatures for information extraction or data mining. We have compiled a lexicon of biomedical words (conserved patterns/ potential motifs) which has the combination of only 20 alphabets of amino acids. The remaining 6 letters of the English alphabets (B, J, O, U, X, Z) are treated as invalid amino acid characters (to our context), We have jumbled the 6 letters for the sake of usage and convenience and termed as ’JUZBOX‘ and these characters were filtered in the biomedical lexicon. Undoubtedly, the generation of biomedical words from protein sequence using JUZBOX have applications specific for functional annotation. AVAILABILITY: JUZBOX is available freely at http://www.spices.res.in/juzbox Biomedical Informatics Publishing Group 2009-11-17 /pmc/articles/PMC2859571/ /pubmed/20461154 Text en © 2009 Biomedical Informatics Publishing Group This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited. |
spellingShingle | Web Server Bobby, Paul Balaji, Seetharaman Sathyanath, Variath Eapen, Santhosh J JUZBOX: A web server for extracting biomedical words from the protein sequence |
title | JUZBOX: A web server for extracting biomedical words from the protein sequence |
title_full | JUZBOX: A web server for extracting biomedical words from the protein sequence |
title_fullStr | JUZBOX: A web server for extracting biomedical words from the protein sequence |
title_full_unstemmed | JUZBOX: A web server for extracting biomedical words from the protein sequence |
title_short | JUZBOX: A web server for extracting biomedical words from the protein sequence |
title_sort | juzbox: a web server for extracting biomedical words from the protein sequence |
topic | Web Server |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2859571/ https://www.ncbi.nlm.nih.gov/pubmed/20461154 |
work_keys_str_mv | AT bobbypaul juzboxawebserverforextractingbiomedicalwordsfromtheproteinsequence AT balajiseetharaman juzboxawebserverforextractingbiomedicalwordsfromtheproteinsequence AT sathyanathvariath juzboxawebserverforextractingbiomedicalwordsfromtheproteinsequence AT eapensanthoshj juzboxawebserverforextractingbiomedicalwordsfromtheproteinsequence |