Active learning image spam hunter

Yan Gao*, Alok Choudhary

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

Image spam is annoying email users around the world. Most previous work for image spam detection focuses on supervised learning approaches. However, it is costly to get enough trustworthy labels for learning, especially for an adversarial problem where spammers constantly modify patterns to evade the classifier. To address this issue, we employ the principle of active learning where the learner guides the user to label as few images as possible while maximizing the classification accuracy. Active learning is more suited for online image spam filtering since it dramatically reduces the labeling costs with negligible overhead while maintaining high recognition performance. We present and compare two active learning algorithms, based on an SVM and a Gaussian process classifier respectively. To the best of our knowledge, we are the first to apply active learning for the task of spam image filtering. Experimental results demonstrate that our active learning based approaches quickly achieve >99% high detection rate and <0.5% low false positive rate with small number of images being labeled.

Original languageEnglish (US)
Title of host publicationAdvances in Visual Computing - 5th International Symposium, ISVC 2009, Proceedings
Pages293-302
Number of pages10
EditionPART 2
DOIs
StatePublished - 2009
Event5th International Symposium on Advances in Visual Computing, ISVC 2009 - Las Vegas, NV, United States
Duration: Nov 30 2009Dec 2 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume5876 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other5th International Symposium on Advances in Visual Computing, ISVC 2009
Country/TerritoryUnited States
CityLas Vegas, NV
Period11/30/0912/2/09

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Active learning image spam hunter'. Together they form a unique fingerprint.

Cite this