PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

A Kernel Statistical Test of Independence
A. Gretton, K. Fukumizu, C.H. Teo, L. Song, B. Schölkopf and A.J. Smola
In: Advances in Neural Information Processing Systems 20: Proceedings of the 2007 Conference (2008) MIT Press , Cambridge, MA, USA , pp. 585-592.


Whereas kernel measures of independence have been widely applied in machine learning (notably in kernel ICA), there is as yet no method to determine whether they have detected statistically significant dependence. We provide a novel test of the independence hypothesis for one particular kernel independence measure, the Hilbert-Schmidt independence criterion (HSIC). The resulting test costs O(m^2), where m is the sample size. We demonstrate that this test outperforms established contingency table-based tests. Finally, we show the HSIC test also applies to text (and to structured data more generally), for which no other independence test presently exists.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Book Section
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Computational, Information-Theoretic Learning with Statistics
Learning/Statistics & Optimisation
Brain Computer Interfaces
Theory & Algorithms
ID Code:4335
Deposited By:Bernhard Schölkopf
Deposited On:13 March 2009