Hierarchical Recognition System for Machine Printed Kannada Characters

Acharya, Dinesh U and Subbareddy , NV and Makkithaya, Krishnamoorthi (2008) Hierarchical Recognition System for Machine Printed Kannada Characters. International Journal of Computer Science and Network Security, 8 (11). pp. 44-53.

[img] PDF
ijcsns200_8-11_44.pdf - Published Version
Restricted to Registered users only

Download (452kB) | Request a copy
Official URL: http://search.ijcsns.org/02_search/02_search_03.ph...


Extensive research has been done on optical character recognition in the last few decades. Most of the efforts were made to develop OCR systems for foreign languages like English, Japanese, Roman and Arabic characters. Many commercial OCR systems for these foreign languages are available in the market. In the context of Indian languages, majority of work is reported on Hindi and Bangla. And very few reports are available on South Indian languages. This paper describes a character recognition system that can handle machine printed text documents in Kannada, which is the official language of the South Indian state of Karnataka. Initially, the scanned image is preprocessed to remove noise. Lines, words and character components are segmented using two-stage segmentation technique. Classification of the character components is done in two stages. In the first stage, the character components are grouped into small subsets by a feature based tree classifier. In the second stage, characters in each group are recognized using a nearest neighbor classifier. We adopted this hybrid approach instead of using only a tree classifier because it is nearly impossible to find a set of stroke features that are simple to compute, robust and reliable to detect, and are sufficient to classify a large number of basic and complex shaped compound characters. The system is tested with the data set containing 8400 characters of different font and size. On average, the system recognizes characters with an accuracy of about 92.68%.

Item Type: Article
Additional Information: Copyright © IJCSNS
Uncontrolled Keywords: Character recognition;Structural features;Direction code; Binary decision tree;k-Nearest Neighbor;Multi-stage classifier
Subjects: Engineering > MIT Manipal > Computer Science and Engineering
Engineering > MIT Manipal > MCA
Depositing User: MIT Library
Date Deposited: 26 Apr 2011 06:42
Last Modified: 09 Jun 2011 06:19
URI: http://eprints.manipal.edu/id/eprint/7

Actions (login required)

View Item View Item