SGD-QN: Careful Quasi-Newton Stochastic Gradient Descent
Antoine Bordes, Leon Bottou and Patrick Gallinari
Journal of Machine Learning Research
The SGD-QN algorithm is a stochastic gradient descent algorithm that makes careful use of second-order information and splits the parameter update into independently scheduled components. Thanks to this design, SGD-QN iterates nearly as fast as a first-order stochastic gradient descent but requires less iterations to achieve the same accuracy. This algorithm won the “Wild Track” of the first PASCAL Large Scale Learning Challenge (Sonnenburg et al., 2008).