Text based Machine Learning Using Discriminative Classifiers

Chatterjee, Rangom and Acharya, Vasundhara and Krishna Prakasha, K and Arjunan, Vijaya R (2019) Text based Machine Learning Using Discriminative Classifiers. Journal of Advanced Research in Dynamical and Control Systems, 11 (7). pp. 32-41. ISSN 1943-023X

[img] PDF
7306.pdf - Published Version
Restricted to Registered users only

Download (328kB) | Request a copy

Abstract

Ever since the invention of computer, a curiosity exists to see if it can be made to learn. If humans could understand how to program them and learn to improve automatically with experience, the impact would be dramatic. A successful understanding of how to make computers learn would open up many new uses of computers and new levels of competence and customization. In this paper, two applications of Machine Learning are explored. In the first one, linear regression to understand the correlation of the feature columns with the output and make predictions based on the “line of best fit” is given. In the second one, discriminative classifiers for analyzing and segregating text-based data is proposed. On applying regression analysis on advertising data, it is observed that TV advertising has the strongest linear correlation with sales. In the later section, text-based machine learning is employed using the scikit-learn library of Python. Multiple contemporary classifiers are applied on a set of SMS’s to perform spam detection. The performance of the classifiers is evaluated using suitable accuracy metrics. The results show that the Naïve Bayes algorithm is much faster than other algorithms such as Logistic Regression. Using a Bayesian probabilistic approach, a spam ratio is attached to all the tokens in the input set. The proposed work proves to be helpful in the field of advertising and spam detection systems

Item Type: Article
Uncontrolled Keywords: Information Retrieval, Machine Learning, Predictive Models, Probabilistic Algorithms, Supervised Learning.
Subjects: Engineering > MIT Manipal > Computer Science and Engineering
Engineering > MIT Manipal > Information and Communication Technology
Depositing User: MIT Library
Date Deposited: 19 Sep 2019 06:26
Last Modified: 19 Sep 2019 06:26
URI: http://eprints.manipal.edu/id/eprint/154580

Actions (login required)

View Item View Item