PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Batch Reinforcement Learning Methods for Point to Point movements
Gerhard Neumann, Michael Pfeiffer and Helmut Hauser
(2006) Technical Report. Graz University of Technology, Graz, Austria.


In this report we investigate various Batch Mode Reinforcement Learning (BRL) Algorithms for continuous control problems. There is an increasing interest for Batch Mode Reinforcement Learning algorithms in the research community, because BRL has some interesting properties. Training data is used more efficiently, we can learn completely offline from randomly generated episodes and we can use supervised batch-mode regression algorithms like regression trees or batch mode neural network learning algorithms (which are known to have better convergence properties). In this paper we investigate Experience Replay, one of the first Batch mode algorithms, Monte Carlo Learning, Fitted Q-Iteration and some modifications of these algorithms. The results are compared for different function approximator schemes like Regression Forests, Model Trees (Forests), Local Regression, Neural Networks, LWPR and RBF networks. We compare the results for the Point to Point Movement task, which implements the basic characteristics of moving the Center of Mass of a humanoid robot.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Monograph (Technical Report)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
ID Code:2616
Deposited By:Gerhard Neumann
Deposited On:22 November 2006