JUZBOX: A web server for extracting biomedical words from the protein sequence

Bobby, P and Balaji, S and Sathyanath, V and Eapen, Santhosh J (2009) JUZBOX: A web server for extracting biomedical words from the protein sequence. Bioinformation, An open access forum, 4 (5). pp. 179-181. ISSN 0973-2063

[img] PDF
12[1].pdf - Published Version
Restricted to Registered users only

Download (93kB) | Request a copy


The recognition of gene/protein names in literature is one of the pivotal steps in the processing of biological literatures for information extraction or data mining. We have compiled a lexicon of biomedical words (conserved patterns/ potential motifs) which has the combination of only 20 alphabets of amino acids. The remaining 6 letters of the English alphabets (B, J, O, U, X, Z) are treated as invalid amino acid characters (to our context), We have jumbled the 6 letters for the sake of usage and convenience and termed as ‘JUZBOX’ and these characters were filtered in the biomedical lexicon. Undoubtedly, the generation of biomedical words from protein sequence using JUZBOX have applications specific for functional annotation.

Item Type: Article
Uncontrolled Keywords: JUZBOX; biomedical words; lexicon
Subjects: Engineering > MIT Manipal > Biotechnology
Depositing User: MIT Library
Date Deposited: 15 Sep 2014 07:44
Last Modified: 15 Sep 2014 07:44
URI: http://eprints.manipal.edu/id/eprint/140623

Actions (login required)

View Item View Item