A comparison of features for POS tagging in Kannada

Atmakuri, Shriya and Shahi, Bhavya and Rao, Ashwath B and Muralikrishna, S N (2018) A comparison of features for POS tagging in Kannada. International Journal of Engineering & Technology, 7 (4). pp. 2418-2421. ISSN 2394-627X

[img] PDF
5130.pdf - Published Version
Restricted to Registered users only

Download (244kB) | Request a copy
Official URL: http://www.sciencepubco.com/index.php/IJET


This paper proposes a system of part of speech tagging for the South Indian language Kannada using supervised machine learning. POS tagging is an important step in Natural Language Processing and has varied applications such as word sense disambiguation, natural language understanding etc. Based on extensive research into methods used for POS tagging, Conditional Random fields have been chosen as our algorithm. CRFs are used for sequence modeling in POS tagging, named entity recognition and as an alternative to Hidden Markov Models. Three very large corpora are used and their results are compared. The feature sets for all three corpora are also varied. The best method for the task is determined using these results.

Item Type: Article
Uncontrolled Keywords: Conditional Random Field; Indian languages; Kannada; Natural Language Processing; POS tagging
Subjects: Engineering > MIT Manipal > Computer Science and Engineering
Depositing User: MIT Library
Date Deposited: 09 Oct 2018 06:36
Last Modified: 09 Oct 2018 06:36
URI: http://eprints.manipal.edu/id/eprint/152070

Actions (login required)

View Item View Item