Semi-Supervised Support Vector Machines and Application to Spam Filtering
In: ECML 2006 Discovery Challenge, 22 September 2006, Berlin, Germany.
After introducing the semi-supervised support vector machine (aka TSVM for "transductive SVM"), a few popular training strategies are briefly presented. Then the assumptions underlying semi-supervised learning are reviewed. Finally, two modern TSVM optimization techniques are applied to the spam filtering data sets of the workshop; it is shown that they can achieve excellent results, if the problem of the data being non-iid can be handled properly.