[computer-go] On question about Libego110

Łukasz Lew lukasz.lew at gmail.com
Thu Jan 10 05:47:54 PST 2008


The code of any version is easy to get:
http://www.mimuw.edu.pl/~lew/hg/libego/?tags

The file you are talking about is here:
http://www.mimuw.edu.pl/~lew/hg/libego/?file/dfcd0a6db96e/uct.cpp


If you take a look at line 151 you see: (bias should be renamed to
number_of_visits)

    explore_coeff  = log (bias) * explore_rate;

where bias is equivalent to this->bias i.e. number of visits in current node.
2 lines lower is a loop over all children where we compute:

   float child_urgency = child->ucb <pl> (explore_coeff);

where the body of ucb is:

    return
        (pl == player::black ? value : -value) +
        sqrt (explore_coeff / bias);

and "bias" variable refers to the child's bias.

If you have any more questions, just ask.
Łukasz

On Jan 9, 2008 7:49 PM,  <compgo123 at aol.com> wrote:
> I'm looking at the code of Libgo110. I have a question. In the file uct.cpp
> and the definition of class note_t, the explore_coeff is calculated from
> log(node->bias). But in the paper 'Modification of UCT with Patterns in
> Monte-Carlo Go' table 1 line 10 -17, the explore_coeff is calculated from
> log(nb), where nb is the summation of node->bias for all the child nodes.
> Whyis the difference? Or did I read the code wrong?
>
>  Thanks for any explanation.
>
>  DL
>  ________________________________
>  More new features than ever. Check out the new AOL Mail!
>
> _______________________________________________
> computer-go mailing list
> computer-go at computer-go.org
> http://www.computer-go.org/mailman/listinfo/computer-go/
>


More information about the computer-go mailing list