N-Gram Assisted Youtube Spam Comment Detection

Aiyar, Shreyas and Shetty, Nisha P (2018) N-Gram Assisted Youtube Spam Comment Detection. In: International Conference on Computational Intelligence and Data Science, 07/04/2018, Gurugram.

[img] PDF
1126.pdf - Published Version
Restricted to Registered users only

Download (666kB) | Request a copy

Abstract

This paper proposes a novel methodology for the detection of intrusive comments or spam on the video-sharing website - Youtube. We describe spam comments as those which have a promotional intent or those who deem to be contextually irrelevant for a given video. The prospects of monetisation through advertising on popular social media channels over the years has attracted an increasingly larger number of users. This has in turn led to to the growth of malicious users who have begun to develop automated bots, capable of large-scale orchestrated deployment of spam messages across multiple channels simultaneously. The presence of these comments significantly hurts the reputation of a channel and also the experience of normal users. Youtube themselves have tackled this issue with very limited methods which revolve around blocking comments that contain links. Such methods have proven to be extremely ine↵ective as Spammers have found ways to bypass such heuristics. Standard machine learning classification algorithms have proven to be somewhat e↵ective but there is still room for better accuracy with new approaches. In this work, we attempt to detect such comments by applying conventional machine learning algorithms such as Random Forest, Support Vector Machine, Naive Bayes along with certain custom heuristics such as N-Grams which have proven to be very e↵ective in detecting and subsequently combating spam comments

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Spam; Youtube; N-Gram; Naive-Bayes; Random Forest; Support Vector Machine; Word Gram; Character Gram
Subjects: Engineering > MIT Manipal > Information and Communication Technology
Depositing User: MIT Library
Date Deposited: 16 Jul 2018 06:56
Last Modified: 16 Jul 2018 06:56
URI: http://eprints.manipal.edu/id/eprint/151592

Actions (login required)

View Item View Item