[Computer-go] Multi-armed bandit problem theory
pasky at ucw.cz
Wed Oct 26 05:51:02 PDT 2011
On Wed, Oct 26, 2011 at 01:56:03PM +0200, "Ingo Althöfer" wrote:
> Not a direct answer, but some bit of information:
> Bandit theory started in the early 1950' by Herbert Robbins
> (the same Robbins from the 1985 paper). However, he did
> not prove best possible bounds in the seminal paper.
Yes, I actually have a copy of that paper but it doesn't seem that it
could help me better understand the later results.
Petr "Pasky" Baudis
More information about the Computer-go