Script Identification System for South Indian Triplets

Aithal, Prakash K and Rajesh, G and Siddalingaswamy, PC and Acharya, Dinesh U (2012) Script Identification System for South Indian Triplets. In: Proceedings of National Conference on Advances in Computer Science & Information Security , 6-7 January 2012, Manipal Institute of Technology, Manipal, India. (Submitted)

[img] PDF
Script_Identification_System_for_South_Indian_Triplets.pdf - Submitted Version
Restricted to Registered users only

Download (617kB) | Request a copy


Indian is a multilingual multi-script country. States of India follow a three language formula. The document may be printed in English, Hindi and other state official language. For example in kerala, a state in India the document may contain text lines in English, Hindi and Malayalam script. For Optical character recognition (OCR) of such a multilingual document, it is necessary to identify the script before feeding the text lines to the OCRs of individual scripts. In this paper a simple and efficient technique of script identification for Triplet consisting of any one of the south indian languages, Hindi and English text lines form a printed document is presented the proposed system uses horizontal projection profile to distinguish the three scripts. The feature extraction is done based on the horizontal projection profile of each text line. The knowledge base of the system is developed based on 80 document images containing about 1800 text lines. For a new text line necessary features are extracted from the horizontal projection profile and compared with the stored knowledge base to classify the script. The proposed system is tested on 80 different document images containing about 300 text lines of each script and an overall classification rate of 100% is achieved.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Script Identification; Tri-Script; South Indian Scripts; Projection Profile; Rule- Based.
Subjects: Engineering > MIT Manipal > Computer Science and Engineering
Depositing User: MIT Library
Date Deposited: 22 Feb 2012 11:28
Last Modified: 22 Feb 2012 11:28

Actions (login required)

View Item View Item