PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

EPrints submitted by Odalric-Ambrym Maillard

Click here to see user's record.

Number of EPrints submitted by this user: 15

Compressed Least-squares regression
Odalric-Ambrym Maillard and Rémi Munos
In: ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 22 (2010) MIT Press , pp. 1213-1221.

Complexity versus Agreement for Many Views
Odalric-Ambrym Maillard and Nicolas Vayatis
In: Algorithmic Learning Theory, 20th International Conference Lecture Notes in Computer Science , 20 (5809). (2009) Springer , Porto, Portugal , pp. 232-246. ISBN 978-3-642-04413-7

LSTD with Random Projections
Mohammad Ghavamzadeh, Alessandro Lazaric, Odalric-Ambrym Maillard and Rémi Munos
Advances in Neural Information Processing Systems Volume 23, pp. 721-729, 2010.

Scrambled objects for least-squares regression
Odalric-Ambrym Maillard and Rémi Munos
Advances in Neural Information Processing Systems Volume 23, pp. 1549-1557, 2010.

Finite-Sample Analysis of Bellman Residual Minimization
Odalric-Ambrym Maillard, Rémi Munos, Alessandro Lazaric and Mohammad Ghavamzadeh
JMLR: Workshop and Conference Proceedings Volume 2nd Asian Conference on Machine Learning, Number 13, pp. 299-314, 2010.

Online Learning in Adversarial Lipschitz Environments
Odalric-Ambrym Maillard and Rémi Munos
European conference on Machine learning and knowledge discovery in databases Volume 2, Number 2010, pp. 305-320, 2010.

Adaptive bandits: Towards the best history-dependent strategy
Odalric-Ambrym Maillard and Rémi Munos
JMLR Workshop and Conference Proceedings Volume Volume 15: AISTATS 2011, pp. 570-578, 2011.

Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences.
Odalric-Ambrym Maillard, Rémi Munos and Gilles Stoltz
JMLR Workshop and Conference Proceedings Volume Volume 19: COLT 2011, pp. 497-514, 2011.

Sparse recovery with Brownian sensing
Alexandra Carpentier, Odalric-Ambrym Maillard and Rémi Munos
Advances in Neural Information Processing Systems Number 24, pp. 1782-1790, 2011.

Selecting the State-Representation in Reinforcement Learning
Odalric-Ambrym Maillard, Daniil Ryabko and Rémi Munos
Advances in Neural Information Processing Systems Number 24, pp. 2627-2635, 2011.

Apprentissage Séquentiel : Bandits, Statistique et Renforcement.
Odalric-Ambrym Maillard
(2011) PhD thesis, Université Lille 1.

Linear regression with random projections.
Odalric-Ambrym Maillard and Rémi Munos
Journal of Machine Learning Research Volume 13, pp. 2735-2772, 2012.

Online allocation and homogeneous partitioning for piecewise constant mean-approximation.
Alexandra Carpentier and Odalric-Ambrym Maillard
Neural Information Processing Systems Number 25, 2012.

Hierarchical Optimistic Region Selection driven by Curiosity.
Odalric-Ambrym Maillard
Neural Information Processing Systems Number 25, 2012.

Optimal regret bounds for selecting the state representation in reinforcement learning.
Odalric-Ambrym Maillard, Phuong Nguyen, Ronald Ortner and Daniil Ryabko
International conference on machine learning 2012.