EPrints submitted by Rémi Munos
Click here to see user's record. Number of EPrints submitted by this user: 14
Algorithms for Infinitely Many-Armed Bandits
Yizao Wang, Rémi Munos and Jean-Yves Audibert
In: Neural Information Processig Systems, Vancouver(2008).
Optimistic Planning for Deterministic Systems
Jean Francois Hren and Rémi Munos
In: European Workshop on Reinforcement Learning, Lille(2008).
Adaptive play in Texas Hold’em Poker.
Raphael Maitrepierre, Jérémie Mary and Rémi Munos
In: European Conference on Artificial Intelligence(2008).
Particle Filter-based Policy Gradient in POMDPs
Pierre-Arnaud Coquelin and Rémi Munos
In: Neural Information Processing Systems(2008).
Compressed Least-Squares Regression
Odalric-Ambrym Maillard and Rémi Munos
NIPS 2009
2009.
Pure Exploration in Multi-armed Bandits Problems
Sébastien Bubeck, Rémi Munos and Gilles Stoltz
ALT 2009
2009.
Hybrid Stochastic-Adversarial On-line Learning
Alessandro Lazaric and Rémi Munos
COLT 2009
2009.
Sensitivity analysis in HMMs with application to likelihood maximization
Coquelin Pierre-Arnaud, Deguest Romain and Rémi Munos
NIPS 2009
2009.
Scrambled objects for least-squares regression
Odalric-Ambrym Maillard and Rémi Munos
In: NIPS 2010, december 2010, Vancouver.
Error propagation for approximate policy and value iteration
Amir Massoud Farahmand, Rémi Munos and Csaba Szepesvari
In: NIPS 2010, december 2010, Vancouver.
Online learning in adversarial lipschitz environments
Odalric-Ambrym Maillard and Rémi Munos
In: ECML 2010, Barcelone(2010).
Open loop optimistic planning
Sébastien Bubeck and Rémi Munos
In: COLT 2010, Israel(2010).
Best arm identification in multi-armed bandits
Jean-Yves Audibert, Sébastien Bubeck and Rémi Munos
In: COLT 2010, Israel(2010).
Approximate dynamic programming
Rémi Munos
In:
Markov Decision Processes in Artificial Intelligence
(2010)
ISTE Ltd and John Wiley & Sons Inc
, .
|