PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

EPrints submitted by Rémi Munos

Click here to see user's record.

Number of EPrints submitted by this user: 14

Algorithms for Infinitely Many-Armed Bandits
Yizao Wang, Rémi Munos and Jean-Yves Audibert
In: Neural Information Processig Systems, Vancouver(2008).

Optimistic Planning for Deterministic Systems
Jean Francois Hren and Rémi Munos
In: European Workshop on Reinforcement Learning, Lille(2008).

Adaptive play in Texas Hold’em Poker.
Raphael Maitrepierre, Jérémie Mary and Rémi Munos
In: European Conference on Artificial Intelligence(2008).

Particle Filter-based Policy Gradient in POMDPs
Pierre-Arnaud Coquelin and Rémi Munos
In: Neural Information Processing Systems(2008).

Compressed Least-Squares Regression
Odalric-Ambrym Maillard and Rémi Munos
NIPS 2009 2009.

Pure Exploration in Multi-armed Bandits Problems
Sébastien Bubeck, Rémi Munos and Gilles Stoltz
ALT 2009 2009.

Hybrid Stochastic-Adversarial On-line Learning
Alessandro Lazaric and Rémi Munos
COLT 2009 2009.

Sensitivity analysis in HMMs with application to likelihood maximization
Coquelin Pierre-Arnaud, Deguest Romain and Rémi Munos
NIPS 2009 2009.

Scrambled objects for least-squares regression
Odalric-Ambrym Maillard and Rémi Munos
In: NIPS 2010, december 2010, Vancouver.

Error propagation for approximate policy and value iteration
Amir Massoud Farahmand, Rémi Munos and Csaba Szepesvari
In: NIPS 2010, december 2010, Vancouver.

Online learning in adversarial lipschitz environments
Odalric-Ambrym Maillard and Rémi Munos
In: ECML 2010, Barcelone(2010).

Open loop optimistic planning
Sébastien Bubeck and Rémi Munos
In: COLT 2010, Israel(2010).

Best arm identification in multi-armed bandits
Jean-Yves Audibert, Sébastien Bubeck and Rémi Munos
In: COLT 2010, Israel(2010).

Approximate dynamic programming
Rémi Munos
In: Markov Decision Processes in Artificial Intelligence (2010) ISTE Ltd and John Wiley & Sons Inc , pp. 67-98.