[Computer-go] FYI KL-UCB

Łukasz Lew lukasz.lew at gmail.com
Mon Jul 22 23:57:35 PDT 2013


On Tue, Jul 23, 2013 at 8:50 AM, Hideki Kato <hideki_katoh at ybb.ne.jp> wrote:

> Thanks Lukasz,
>
> For introducing such an interesting paper.
>
> I have a quesion, though.  The second algorithm in Figures 1, 2 and 3
> is termed UCB2 but is apparently called MOSS in Sections 5 (and 1).  Do
> you know which algorithm is actually used in the numerical
> experiments?
>

I don't know, but you might mail the author.


>
> BTW, I guess for MC Go programs, possibly the least "risky" algorithm be
> the best in practice, isn't it?
>

I won't speculate. Only experiments can tell.


>
> Hideki
>
>  ukasz Lew: <
> CAPXT8E4pMwmvkiiTuyHHpBVavgeUPGQLNnODyJoAmFGo0uOo_g at mail.gmail.com>:
> >KL-UCB algorithm
> >http://arxiv.org/pdf/1102.2490v4.pdf
> >
> >"Thus, KL-UCB is optimal for Bernoulli distributions and strictly
> dominates
> >a-UCB for any
> >bounded reward distributions."
> >http://www.princeton.edu/~sbubeck/SurveyBCB12.pdf (page 18)
> --
> Hideki Kato <mailto:hideki_katoh at ybb.ne.jp>
> _______________________________________________
> Computer-go mailing list
> Computer-go at dvandva.org
> http://dvandva.org/cgi-bin/mailman/listinfo/computer-go
>



-- 
Łukasz
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://computer-go.org/pipermail/computer-go/attachments/20130723/16615f2a/attachment.html>


More information about the Computer-go mailing list