PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

EPrints submitted by Csaba Szepesvari

Click here to see user's record.

Number of EPrints submitted by this user: 9

Model-based and Model-free Reinforcement Learning for Visual Servoing
Amir massoud Farahmand, Azad Shademan, Martin Jagersand and Csaba Szepesvari
In: ICRA(2009).

A Convergent O(n) Algorithm for Off-policy Temporal-difference Learning with Linear Function Approximation
Richard S. Sutton, Csaba Szepesvari and Hamid R. Maei
In: NIPS-21, 8-11 Dec 2008, Vancouver, BC, Canada.

Regularized Policy Iteration
Amir massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvari and Shie Mannor
In: NIPS-21, 8-11 Dec 2008, Vancouver, BC, Canada.

Regularized Fitted Q-iteration: Application to Planning
Amir massoud Farahmand, Mohammad Ghavamzadeh, Shie Mannor and Csaba Szepesvari
In: EWRL 2008, 30 June - 3 July 2008, Lille, France.

Active learning in group-structured environments
Gabor Bartok, Sandra Zilles and Csaba Szepesvari
In: ALT-08, 13-16 Oct 2008, Budapest, Hungary.

Empirical Bernstein Stopping
Vladimir Mnih, Csaba Szepesvari and Jean-Yves Audibert
In: ICML 2008, 5-9 July 2008, Helsinki, Finland.

Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Alejandro Isaza, Csaba Szepesvari, Vadim Bulitko and Russ Greiner
In: UAI-08, 9-12 July 2008, Helsinki, Finland.

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Richard S. Sutton, Csaba Szepesvari, Alborz Geramiford and Michael Bowling
In: UAI-08, 9-12 July 2008, Helsinki, Finland.

Exploration-exploitation trade-off using variance estimates in multi-armed bandits
Jean-Yves Audibert, Remi Munos and Csaba Szepesvari
Theoretical Computer Science 2009.