EPrints submitted by Csaba Szepesvari
Click here to see user's record. Number of EPrints submitted by this user: 9
Model-based and Model-free Reinforcement Learning for Visual Servoing
Amir massoud Farahmand, Azad Shademan, Martin Jagersand and Csaba Szepesvari
In: ICRA(2009).
A Convergent O(n) Algorithm for Off-policy Temporal-difference Learning with Linear Function Approximation
Richard S. Sutton, Csaba Szepesvari and Hamid R. Maei
In: NIPS-21, 8-11 Dec 2008, Vancouver, BC, Canada.
Regularized Policy Iteration
Amir massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvari and Shie Mannor
In: NIPS-21, 8-11 Dec 2008, Vancouver, BC, Canada.
Regularized Fitted Q-iteration: Application to Planning
Amir massoud Farahmand, Mohammad Ghavamzadeh, Shie Mannor and Csaba Szepesvari
In: EWRL 2008, 30 June - 3 July 2008, Lille, France.
Active learning in group-structured environments
Gabor Bartok, Sandra Zilles and Csaba Szepesvari
In: ALT-08, 13-16 Oct 2008, Budapest, Hungary.
Empirical Bernstein Stopping
Vladimir Mnih, Csaba Szepesvari and Jean-Yves Audibert
In: ICML 2008, 5-9 July 2008, Helsinki, Finland.
Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions
Alejandro Isaza, Csaba Szepesvari, Vadim Bulitko and Russ Greiner
In: UAI-08, 9-12 July 2008, Helsinki, Finland.
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Richard S. Sutton, Csaba Szepesvari, Alborz Geramiford and Michael Bowling
In: UAI-08, 9-12 July 2008, Helsinki, Finland.
Exploration-exploitation trade-off using variance estimates in multi-armed bandits
Jean-Yves Audibert, Remi Munos and Csaba Szepesvari
Theoretical Computer Science
2009.
|