PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Exploration exploitation in Go: UCT for Monte-Carlo Go
Sylvain Gelly and Yizao Wang
In: On-line Trading of Exploration and Exploitation, 8 December 2006, Whistler, BC, Canada.

Abstract

Algorithm UCB1 for multi-armed bandit problem has already been extended to Algorithm UCT which works for minimax tree search. We have developed a Monte-Carlo program, MoGo, which is the first computer Go program using UCT. We explain our modifications of UCT for Go application, among which efficient memory management, parametrization, ordering of non-visited nodes and parallelization. MoGo is now a top-level Computer-Go program on 9 × 9 Go board.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Poster)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Computational, Information-Theoretic Learning with Statistics
Learning/Statistics & Optimisation
Theory & Algorithms
ID Code:2713
Deposited By:Sylvain Gelly
Deposited On:22 November 2006