[Computer-go] Multi-armed bandit problem theory

Petr Baudis pasky at ucw.cz
Wed Oct 26 05:51:02 PDT 2011


On Wed, Oct 26, 2011 at 01:56:03PM +0200, "Ingo Althöfer" wrote:
> Not a direct answer, but some bit of information:
> Bandit theory started in the early 1950' by Herbert Robbins
> (the same Robbins from the 1985 paper). However, he did
> not prove best possible bounds in the seminal paper.

Yes, I actually have a copy of that paper but it doesn't seem that it
could help me better understand the later results.

				Petr "Pasky" Baudis



More information about the Computer-go mailing list