PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

EPrints submitted by Alessandro Lazaric

Click here to see user's record.

Number of EPrints submitted by this user: 13

Hybrid Stochastic-Adversarial On-line Learning
Alessandro Lazaric and Rémi Munos
In: Twenty-Second Annual Conference on Learning Theory (COLT-2009), 18-20 Jun 2009, Montreal, Canada.

Bayesian Multi-Task Reinforcement Learning
Alessandro Lazaric and Mohammad Ghavamzadeh
In: Twenty-Seventh International Conference on Machine Learning (ICML-2010), 21-24 June 2010, Haifa, Israel.

Finite-Sample Analysis of LSTD
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
In: Twenty-Seventh International Conference on Machine Learning (ICML-2010), 21-24 June 2010, Haifa, Israel.

Analysis of a Classification-based Policy Iteration Algorithm
Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
In: Twenty-Seventh International Conference on Machine Learning (ICML-2010), 21-24 June 2010, Haifa, Israel.

Finite-Sample Analysis of Bellman Residual Minimization
Odalric-Ambrym Maillard, Rémi Munos, Alessandro Lazaric and Mohammad Ghavamzadeh
In: Second Asian Conference on Machine Learning (ACML-2010), 8-10 November 2010, Tokyo, Japan.

Rollout Allocation Strategies for Classification-based Policy Iteration
Victor Gabillon, Alessandro Lazaric and Mohammad Ghavamzadeh
In: ICML 2010 - Workshop Reinforcement Learning and Search in Very Large Spaces, 21-24 June 2010, Haifa, Israel.

LSTD with Random Projections
Mohammad Ghavamzadeh, Alessandro Lazaric, Rémi Munos and Odalric-Ambrym Maillard
In: Twenty-Fourth Annual Conference on Advances in Neural Information Processing Systems (NIPS-2010), 6-9 December 2010, Vancouver, Canada.

Learning with stochastic inputs and adversarial outputs
Alessandro Lazaric and Rémi Munos
Journal of Computer and System Sciences (JCSS) Volume 78, Number 5, pp. 1516-1537, 2012.

Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence
Victor Gabillon, Mohammad Ghavamzadeh and Alessandro Lazaric
In: Twenty-Sixth Annual Conference on Neural Information Processing Systems (NIPS'12), Lake Tahoe, Nevada, USA(2012).

Conservative and Greedy Approaches to Classification-based Policy Iteration
Mohammad Ghavamzadeh and Alessandro Lazaric
In: 26th Conference on Artificial Intelligence (AAAI'12)(2012).

Risk Averse Multi-Arm Bandits
Amir Sani, Alessandro Lazaric and Rémi Munos
In: Twenty-Sixth Annual Conference on Neural Information Processing Systems (NIPS'12)(2012).

A Dantzig Selector Approach to Temporal Difference Learning
Matthieu Geist, Bruno Scherrer, Alessandro Lazaric and Mohammad Ghavamzadeh
In: 29th International Conference on Machine Learning (ICML)(2012).

A Truthful Learning Mechanism for Multi-Slot Sponsored Search Auctions with Externalities (Extended Abstract)
Nicola Gatti, Alessandro Lazaric and Francesco Trovo
In: 13th ACM Conference on Electronic Commerce (EC'12)(2012).