Kannada Morpheme Segmentation using Machine Learning

Angle, Sachi and Rao, Ashwath B and Muralikrishna, S N (2018) Kannada Morpheme Segmentation using Machine Learning. In: International Conference on Contemporary Engineering and Technology, 10/03/2018, Prince Shri Venkateshwara Padmavathy Engineering C.

[img] PDF
1100.pdf - Published Version
Restricted to Registered users only

Download (245kB) | Request a copy

Abstract

This paper addresses and targets morpheme segmentation of Kannada words using supervised classification. We have used manually annotated Kannada treebank corpus, which is recently developed by us. Kannada bears resemblance to other Dravidian languages in morphological structure. It is an agglutinative language, hence its words have complex morphological form with each word comprising of a root and an optional set of suffixes. These suffixes carry additional meaning, apart from the root word in a context. This paper discusses the extraction of morphemes of a word by using Support Vector Machines for Classification. Additional features representing the properties of the Kannada words were extracted and the different letters were classified into labels that result in the morphological segmentation of the word. Various methods for evaluation were considered and an accuracy of 85.97% was achieved

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Morphology, Kannada, Machine Learning
Subjects: Engineering > MIT Manipal > Computer Science and Engineering
Depositing User: MIT Library
Date Deposited: 07 Aug 2018 08:53
Last Modified: 07 Aug 2018 08:54
URI: http://eprints.manipal.edu/id/eprint/151710

Actions (login required)

View Item View Item