Re: [computer-go] Is Rémi correct?

Weston Markham weston.markham at gmail.com
Wed Feb 6 13:49:44 PST 2008


I know that other people have mentioned this sort of thing already,
but the result of level 8 being better than level 10 matches my own
experience with slightly older versions of gnugo.  As I recall, 8 was
the best, 9 a little worse, and 10 worse again.  Increasing the level
seems to improve play after that, but it dramatically increases the
time.

Weston

On Feb 6, 2008 12:48 PM, Don Dailey <drdailey at cox.net> wrote:
> Here is an update from the new 1000 game test using gungo at level 8
> instead of 10.
>
> Rank Name           Elo    +    - games score oppo. draws
>    1 Gnugo-3.7.11  1800   34   30  2186   97%  1137    0%
>    2 Mogo_03       1507   48   56   186   16%  1800    0%
>    3 Mogo_02       1202   43   51  1000    3%  1800    0%
>    4 Mogo_01       1003   70   96  1000    1%  1800    0%
>
> The test, at this point, seems to indicate that gnugo at level 8 is
> stronger than at level 10 because mogo is not doing as well as in the
> previous test.    It will be more meaningful when we get to levels close
> to gnugo's strength.
>
> - Don
>
>
> > As promised,  to answer Rémi, I did a study with mogo vs Gnu at various
> > levels.   There is NO self play involved, Gnugo-3.7.11 is the only
> > opponent for progressively higher rated version of Mogo.
> >
> > Here are the raw results so far:
> >
> > Rank Name           Elo    +    - games score oppo. draws
> >    1 Mogo_10       2319   72   60   500   95%  1800    0%
> >    2 Mogo_11       2284   94   74   259   94%  1800    0%
> >    3 Mogo_09       2234   57   49   500   92%  1800    0%
> >    4 Mogo_08       2124   43   39   500   87%  1800    0%
> >    5 Mogo_07       2016   35   33   500   78%  1800    0%
> >    6 Mogo_06       1961   32   30   500   72%  1800    0%
> >    7 Mogo_05       1814   28   28   500   52%  1800    0%
> >    8 Gnugo-3.7.11  1800   13   13  5259   44%  1823    0%
> >    9 Mogo_04       1711   29   29   500   37%  1800    0%
> >   10 Mogo_03       1534   35   38   500   18%  1800    0%
> >   11 Mogo_02       1281   60   72   500    5%  1800    0%
> >   12 Mogo_01       1004  115  178   500    1%  1800    0%
> >
> >
> > The issue is whether self-play results distort the rating of programs.
> > In this case, we are only testing whether it distorts the ratings of
> > Mogo since no other programs were tested.
> >
> > In the following table,  I played up to 500 games between Gnugo and Mogo
> > at various levels.   The levels are the exact levels that correspond to
> > the big scalability study.      In the middle column I listed the
> > ratings as computed by bayeselo in games against  ONLY Gnugo and set the
> > default rating of Gnugo to 1800, just as in the study.
> >
> > Unfortunately,  I used level 10 in the gnugo only games but in the big
> > study we use level 8.   It's my understanding there is little difference
> > between these 2 but we can probably assume Mogo might be a little better
> > than indicated relative to the big scalability study.
> >
> > It looks like there indeed is a lot of distortion at the low end of the
> > scale.  Mogo seems much stronger at low levels than the larger
> > scalability study indicated.
> >
> > At the higher levels,  we also get a mismatch,  where Mogo's rating
> > doesn't seem as high when playing only Gnugo.   This is as Rémi
> > claims.
> >
> > One thing to note is that at higher levels it's more difficult to get an
> > accurate rating.  Mogo_10 is winning 95% of it's games against Gnugo,
> > and an extra win or loss every few games can make a lot of difference.
> > However I am inclined to believe this is real since it seems to hold for
> > several upper levels.   At level 7 it's only 42 ELO, but at levels
> > beyond this it's over 100 ELO.
> >
> > I've never doubted that there is some intransivity between programs, but
> > I am a little surprised that it is this much.  Even if the comparison is
> > slightly unfair due to Mogo playing a stronger version of Gnugo in this
> > study,  it's still seems like it must be at least 100 ELO.
> >
> >
> > vers  vs Gnu  Study
> > ----  ------  -----
> >   01    1004    688
> >   02    1281   1093
> >   03    1534   1331
> >   04    1711   1554
> >   05    1814   1751
> >   06    1961   1971
> >   07    2016   2058
> >   08    2124   2270
> >   09    2234   2347
> >   10    2319   2470
> >
> >
> > My suggestion to improve this situation is to play a few thousands games
> > against a well rated Gnugo and set up mogo as a second anchor.
> >
> > - Don
> >
> >
> >
> > _______________________________________________
> > computer-go mailing list
> > computer-go at computer-go.org
> > http://www.computer-go.org/mailman/listinfo/computer-go/
> >
> >
> _______________________________________________
> computer-go mailing list
> computer-go at computer-go.org
> http://www.computer-go.org/mailman/listinfo/computer-go/
>


More information about the computer-go mailing list