EPrints submitted by Alessandro Lazaric
Click here to see user's record. Number of EPrints submitted by this user: 13
Hybrid Stochastic-Adversarial On-line Learning
Alessandro Lazaric and Rémi Munos
In: Twenty-Second Annual Conference on Learning Theory (COLT-2009), 18-20 Jun 2009, Montreal, Canada.
Bayesian Multi-Task Reinforcement Learning
Alessandro Lazaric and Mohammad Ghavamzadeh
In: Twenty-Seventh International Conference on Machine Learning (ICML-2010), 21-24 June 2010, Haifa, Israel.
Finite-Sample Analysis of LSTD
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
In: Twenty-Seventh International Conference on Machine Learning (ICML-2010), 21-24 June 2010, Haifa, Israel.
Analysis of a Classification-based Policy Iteration Algorithm
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
In: Twenty-Seventh International Conference on Machine Learning (ICML-2010), 21-24 June 2010, Haifa, Israel.
Finite-Sample Analysis of Bellman Residual Minimization
Odalric-Ambrym Maillard, Rémi Munos, Alessandro Lazaric and Mohammad Ghavamzadeh
In: Second Asian Conference on Machine Learning (ACML-2010), 8-10 November 2010, Tokyo, Japan.
Rollout Allocation Strategies for Classification-based Policy Iteration
Victor Gabillon, Alessandro Lazaric and Mohammad Ghavamzadeh
In: ICML 2010 - Workshop Reinforcement Learning and Search in Very Large Spaces, 21-24 June 2010, Haifa, Israel.
LSTD with Random Projections
Mohammad Ghavamzadeh, Alessandro Lazaric, Rémi Munos and Odalric-Ambrym Maillard
In: Twenty-Fourth Annual Conference on Advances in Neural Information Processing Systems (NIPS-2010), 6-9 December 2010, Vancouver, Canada.
Learning with stochastic inputs and adversarial outputs
Alessandro Lazaric and Rémi Munos
Journal of Computer and System Sciences (JCSS)
Volume 78,
Number 5,
,
2012.
Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence
Victor Gabillon, Mohammad Ghavamzadeh and Alessandro Lazaric
In: Twenty-Sixth Annual Conference on Neural Information Processing Systems (NIPS'12), Lake Tahoe, Nevada, USA(2012).
Conservative and Greedy Approaches to Classification-based Policy Iteration
Mohammad Ghavamzadeh and Alessandro Lazaric
In: 26th Conference on Artificial Intelligence (AAAI'12)(2012).
Risk Averse Multi-Arm Bandits
Amir Sani, Alessandro Lazaric and Rémi Munos
In: Twenty-Sixth Annual Conference on Neural Information Processing Systems (NIPS'12)(2012).
A Dantzig Selector Approach to Temporal Difference Learning
Matthieu Geist, Bruno Scherrer, Alessandro Lazaric and Mohammad Ghavamzadeh
In: 29th International Conference on Machine Learning (ICML)(2012).
A Truthful Learning Mechanism for Multi-Slot Sponsored Search Auctions with Externalities (Extended Abstract)
Nicola Gatti, Alessandro Lazaric and Francesco Trovo
In: 13th ACM Conference on Electronic Commerce (EC'12)(2012).
|