EPrints submitted by Odalric-Ambrym Maillard
Click here to see user's record. Number of EPrints submitted by this user: 15
Compressed Least-squares regression
Odalric-Ambrym Maillard and Rémi Munos
In:
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 22
(2010)
MIT Press
, .
Complexity versus Agreement for Many Views
Odalric-Ambrym Maillard and Nicolas Vayatis
In:
Algorithmic Learning Theory, 20th International Conference
Lecture Notes in Computer Science
, 20
(5809).
(2009)
Springer
, Porto, Portugal
, .
ISBN 978-3-642-04413-7
LSTD with Random Projections
Mohammad Ghavamzadeh, Alessandro Lazaric, Odalric-Ambrym Maillard and Rémi Munos
Advances in Neural Information Processing Systems
Volume 23,
,
2010.
Scrambled objects for least-squares regression
Odalric-Ambrym Maillard and Rémi Munos
Advances in Neural Information Processing Systems
Volume 23,
,
2010.
Finite-Sample Analysis of Bellman Residual Minimization
Odalric-Ambrym Maillard, Rémi Munos, Alessandro Lazaric and Mohammad Ghavamzadeh
JMLR: Workshop and Conference Proceedings
Volume 2nd Asian Conference on Machine Learning,
Number 13,
,
2010.
Online Learning in Adversarial Lipschitz
Environments
Odalric-Ambrym Maillard and Rémi Munos
European conference on Machine learning and knowledge discovery in databases
Volume 2,
Number 2010,
,
2010.
Adaptive bandits: Towards the best history-dependent strategy
Odalric-Ambrym Maillard and Rémi Munos
JMLR Workshop and Conference Proceedings
Volume Volume 15: AISTATS 2011,
,
2011.
Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences.
Odalric-Ambrym Maillard, Rémi Munos and Gilles Stoltz
JMLR Workshop and Conference Proceedings
Volume Volume 19: COLT 2011,
,
2011.
Sparse recovery with Brownian sensing
Alexandra Carpentier, Odalric-Ambrym Maillard and Rémi Munos
Advances in Neural Information Processing Systems
Number 24,
,
2011.
Selecting the State-Representation in Reinforcement Learning
Odalric-Ambrym Maillard, Daniil Ryabko and Rémi Munos
Advances in Neural Information Processing Systems
Number 24,
,
2011.
Apprentissage Séquentiel : Bandits, Statistique et Renforcement.
Odalric-Ambrym Maillard
(2011)
PhD thesis, Université Lille 1.
Linear regression with random projections.
Odalric-Ambrym Maillard and Rémi Munos
Journal of Machine Learning Research
Volume 13,
,
2012.
Online allocation and homogeneous partitioning for piecewise constant mean-approximation.
Alexandra Carpentier and Odalric-Ambrym Maillard
Neural Information Processing Systems
Number 25,
2012.
Hierarchical Optimistic Region Selection driven by Curiosity.
Odalric-Ambrym Maillard
Neural Information Processing Systems
Number 25,
2012.
Optimal regret bounds for selecting the state representation in reinforcement learning.
Odalric-Ambrym Maillard, Phuong Nguyen, Ronald Ortner and Daniil Ryabko
International conference on machine learning
2012.
|