PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

EPrints submitted by Csaba Szepesvari

Click here to see user's record.

Number of EPrints submitted by this user: 18

Models of active learning in group-structured state spaces
G Bartók, Csaba Szepesvari and S Zilles
Information and Computation Volume 208, Number 4, pp. 364-384, 2010.

Convergent temporal-difference learning with arbitrary smooth function approximation
H Maei, Csaba Szepesvari, S Bhathnagar, D Silver, D Precup and R Sutton
In: NIPS-22(2009).

Training parsers by inverse reinforcement learning
G Neu and Csaba Szepesvari
Journal of Machine Learning Volume 77, pp. 303-337, 2009. ISSN http://www.sztaki.hu/~szcsaba/papers/MLJ-SISP-09.pdf

Learning When to Stop Thinking and Do Something
B Póczos, Y Abbasi-Yadkori, Csaba Szepesvari, R Greiner and N Sturtevant
In: ICML-09(2009).

Fast gradient-descent methods for temporal-difference learning with linear function approximation
R. S. Sutton, H. R. Maei, D Precup, S Bhatnagar, D Silver, Csaba Szepesvari and E Wiewiora
In: ICML-09(2009).

Learning exercise policies for american options
Y Li, Csaba Szepesvari and D Schuurmans
In: AISTAT-09, 16-18 Apr 2009, Clearwater Beach, Florida, USA.

Regularized fitted q-iteration for planning in continuous-space markovian decision problems
A. m. Farahmand, M. Ghavamzadeh, Csaba Szepesvari and S. Mannor
In: ACC-09, St. Louis, Missouri, USA(2009).

Apprenticeship learning using inverse reinforcement learning and gradient methods
G Neu and Csaba Szepesvari
In: UAI-07(2007).

Continuous time associative bandit problems
A György, L Kocsis, I Szabó and Csaba Szepesvari
In: IJCAI-07(2007).

Bandit based monte-carlo planning
L Kocsis and Csaba Szepesvari
In: 17th European Conference on Machine Learning(2006).

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
A Antos, Csaba Szepesvari and R Munos
In: COLT-06(2006).

Universal parameter optimisation in games based on SPSA
L Kocsis and Csaba Szepesvari
Machine Learning Journal Volume 63, pp. 249-286, 2006.

Finite time bounds for fitted value iteration
R Munos and Csaba Szepesvari
Journal of Machine Learning Research Volume 9, pp. 815-857, 2008.

RSPSA: enhanced parameter optimisation in games
L Kocsis, Csaba Szepesvari and M. H. M. Winands
In: 11th Advances in Computer Games Conference(2006).

A general projection property for distribution families
Y Yu, Y Li, Csaba Szepesvari and D Schuurmans
In: NIPS-22(2009).

Learning to segment from a few well-selected training images
A Farhangfar, R Greiner and Csaba Szepesvari
In: ICML-09(2009).

Local importance sampling: a novel technique to enhance particle filtering
P Torma and Csaba Szepesvari
Journal of Multimedia Volume 1, pp. 32-43, 2006.

Reinforcement learning algorithms for MDPs
Csaba Szepesvari
Technical Report TR09-13 2009.