PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Learning Kernel Based Halfspaces with the 0-1 Loss
Shai Shalev-Shwartz, Ohad Shamir and Karthik Sridharan
SIAM Journal on Computing 2011.


We describe and analyze a new algorithm for agnostically learning kernel-based halfspaces with respect to the 0-1 loss function. Unlike most previous formulations which rely on surrogate convex loss functions (e.g. hinge-loss in SVM and log-loss in logistic regression), we provide nite time/sample guarantees with respect to the more natural 0-1 loss function. The proposed algorithm can learn kernel-based halfspaces in worst-case time poly(exp(L log(L=))), for any distribution, where L is a Lipschitz constant (which can be thought of as the reciprocal of the margin), and the learned classier is worse than the optimal halfspace by at most . We also prove a hardness result, showing that under a certain cryptographic assumption, no algorithm can learn kernel-based halfspaces in time polynomial in L

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Theory & Algorithms
ID Code:8907
Deposited By:Shai Shalev-Shwartz
Deposited On:21 February 2012