EPrints submitted by Csaba Szepesvari
Click here to see user's record. Number of EPrints submitted by this user: 18
Models of active learning in group-structured state spaces
G Bartók, Csaba Szepesvari and S Zilles
Information and Computation
Volume 208,
Number 4,
,
2010.
Convergent temporal-difference learning with arbitrary smooth function approximation
H Maei, Csaba Szepesvari, S Bhathnagar, D Silver, D Precup and R Sutton
In: NIPS-22(2009).
Training parsers by inverse reinforcement learning
G Neu and Csaba Szepesvari
Journal of Machine Learning
Volume 77,
,
2009.
ISSN http://www.sztaki.hu/~szcsaba/papers/MLJ-SISP-09.pdf
Learning When to Stop Thinking and Do Something
B Póczos, Y Abbasi-Yadkori, Csaba Szepesvari, R Greiner and N Sturtevant
In: ICML-09(2009).
Fast gradient-descent methods for temporal-difference learning with linear function approximation
R. S. Sutton, H. R. Maei, D Precup, S Bhatnagar, D Silver, Csaba Szepesvari and E Wiewiora
In: ICML-09(2009).
Learning exercise policies for american options
Y Li, Csaba Szepesvari and D Schuurmans
In: AISTAT-09, 16-18 Apr 2009, Clearwater Beach, Florida, USA.
Regularized fitted q-iteration for planning in continuous-space markovian decision problems
A. m. Farahmand, M. Ghavamzadeh, Csaba Szepesvari and S. Mannor
In: ACC-09, St. Louis, Missouri, USA(2009).
Apprenticeship learning using inverse reinforcement learning and gradient methods
G Neu and Csaba Szepesvari
In: UAI-07(2007).
Continuous time associative bandit problems
A György, L Kocsis, I Szabó and Csaba Szepesvari
In: IJCAI-07(2007).
Bandit based monte-carlo planning
L Kocsis and Csaba Szepesvari
In: 17th European Conference on Machine Learning(2006).
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
A Antos, Csaba Szepesvari and R Munos
In: COLT-06(2006).
Universal parameter optimisation in games based on SPSA
L Kocsis and Csaba Szepesvari
Machine Learning Journal
Volume 63,
,
2006.
Finite time bounds for fitted value iteration
R Munos and Csaba Szepesvari
Journal of Machine Learning Research
Volume 9,
,
2008.
RSPSA: enhanced parameter optimisation in games
L Kocsis, Csaba Szepesvari and M. H. M. Winands
In: 11th Advances in Computer Games Conference(2006).
A general projection property for distribution families
Y Yu, Y Li, Csaba Szepesvari and D Schuurmans
In: NIPS-22(2009).
Learning to segment from a few well-selected training images
A Farhangfar, R Greiner and Csaba Szepesvari
In: ICML-09(2009).
Local importance sampling: a novel technique to enhance particle filtering
P Torma and Csaba Szepesvari
Journal of Multimedia
Volume 1,
,
2006.
Reinforcement learning algorithms for MDPs
Csaba Szepesvari
Technical Report TR09-13
2009.
|