[computer-go] The physics of Go playing strength.

Tom Cooper main at astrolabe.plus.com
Sun Apr 8 02:01:56 PDT 2007


The discussion here http://senseis.xmp.net/?EloRating suggests that 
the difference between beginners and top players in go is about 3000 
ELO on a 19x19 board.  This difference is very dependent on the board 
size.  I can
think of a naive argument that this difference should scale linearly 
with the (linear) size of the board, that is as the square-root of 
the area of the board.

At 08:56 08/04/2007, you wrote:

>According these results the slope is considerable greater than in 
>chess. In the classical experiment of Ken Thompons searching 1 ply 
>deeper is worth about 200 Elo. 1 ply corresponds to 5-6 times 
>longer/faster. In 9x9 already a factor of 2 gives the same 
>improvement. This is really remarkable. Another explanation would 
>be, that 100 Elo have in Go a different meaning than in chess.
>It is often argued that the distance between week and stronger 
>player is much greater in Go than in Chess. In chess the distance 
>between an average club player and top humans is about 1000 Elo.
>Maybe in Go its 2000 Elo?? In chess the green level-11 version would 
>have world-champion level. Is it just enough to make a 2 million 
>playouts version to beat the top-Dans in 9x9?  Is it that easy?
>Just build a special purpose chip like ChipTest aka Deep Blue. Or 
>implement it on a cluster. Or just wait a few years on do it on the 
>PC. Or a playstation.
>
>Chrilly
>
>
>
>Is there any notion of the Elo rating of a professional Go player. 
>In chess terms the
>----- Original Message ----- From: "Don Dailey" <drd at mit.edu>
>To: "computer-go" <computer-go at computer-go.org>
>Sent: Sunday, April 08, 2007 3:05 AM
>Subject: [computer-go] The physics of Go playing strength.
>
>
>>A few weeks ago I announced that I was doing a long term
>>scalability study with computer go on 9x9 boards.
>>
>>I have constructed a graph of the results so far:
>>
>>  http://greencheeks.homelinux.org:8015/~drd/public/study.jpg
>>
>>Although I am still collecting data, I feel that I have
>>enough samples to report some results - although I will
>>continue to collect samples for a while.
>>
>>This study is designed to measure the improvement in
>>strength that can be expected with each doubling of computer
>>resources.
>>
>>I'm actually testing 2 programs - both of them UCT style go
>>programs, but one of those programs does uniformly random
>>play-outs and the other much stronger one is similar to
>>Mogo, as documented in one of their papers.
>>
>>Dave Hillis coined the terminolgoy I will be using, light
>>play-outs vs heavy play-outs.
>>
>>For the study I'm using 12 versions of each program.  The
>>weakest version starts with 1024 play-outs in order to
>>produce a move.  The next version doubles this to 2048
>>play-outs, and so on until the 12th version which does 2
>>million (2,097,152) playouts.  This is a substantial study
>>which has taken weeks so far to get to this point.
>>
>>Many of the faster programs have played close to 250 games,
>>but the highest levels have only played about 80 games so
>>far.
>>
>>The scheduling algorithm is very similar to the one used by
>>CGOS.  An attempt is made not to waste a lot of time playing
>>seriously mis-matched opponents.
>>
>>The games were rated and the results graphed.  You can see
>>the result of the graph here (which I also included near the
>>top of this message):
>>
>>  http://greencheeks.homelinux.org:8015/~drd/public/study.jpg
>>
>>The x-axis is the number of doublings starting with 1024
>>play-outs and the y-axis is the ELO rating.
>>
>>The public domain program GnuGo version 3.7.9 was assigned
>>the rating 2000 as a reference point.  On CGOS, this program
>>has acheived 1801, so in CGOS terms all the ratings are
>>about 200 points optimistic.
>>
>>Feel free to interpret the data any way you please, but here
>>are my own observations:
>>
>>  1.  Scalability is almost linear with each doubling.
>>
>>  2.  But there appears to be a very gradual fall-off with
>>      time - which is what one would expect (ELO
>>      improvements cannot be infinite so they must be
>>      approaching some limit.)
>>
>>  3.  The heavy-playout version scales at least as well,
>>      if not better, than the light play-out version.
>>
>>      (You can see the rating gap between them gradually
>>      increase with the number of play-outs.)
>>
>>  4.  The curve is still steep at 2 million play-outs, this
>>      is convincing empirical evidence that there are a few
>>      hundred ELO points worth of improvement possible
>>      beyond this.
>>
>>  5.  GnuGo 3.7.9 is not competive with the higher levels of
>>      Lazarus.  However, what the study doesn't show is that
>>      Lazarus needs 2X more thinking time to play equal to
>>      GnuGo 3.7.9.
>>
>>
>>This graph explains why I feel that absolute playing
>>strength is a poor conceptual model of how humans or
>>computers play go.  If Lazarus was running on the old Z-80
>>processors of a few decades ago, it would be veiewed as an
>>incredibly weak program, but running on a supercomputer it's
>>a very strong program.  But in either case it's the SAME
>>program.  The difference is NOT the amount of work each
>>system is capable of, it's just that one takes longer to
>>accomplish a given amount of work.  It's much like the
>>relationships between power, work, force, time etc.  in
>>physics.
>>
>>Based on this type of analysis and the physics analogy,
>>GnuGo 3.7.9 is a stronger program than Lazarus (even at 9x9
>>go).  Lazarus requires about 2X more time to equalize.  So
>>Lazarus plays with less "force" (if you use the physics
>>analogy) and needs more TIME to get the same amount of work
>>done.
>>
>>ELO is treated numerically as if it were "work" in physics
>>because when it's measured by playing games, both players
>>get the same amount of time.  The time factor cancels out
>>but it cause us to ignore that it's part of the equation.
>>
>>On CGOS, Lazarus and FatMan are the same program, but one
>>does much more work and they have ELO ratings that differ by
>>almost 300 ELO points.   Even though they are the same
>>program you will look on CGOS and believe Lazarus is much
>>stronger because you have not considered the physics of Go
>>playing strength.
>>
>>- Don
>>
>>
>>_______________________________________________
>>computer-go mailing list
>>computer-go at computer-go.org
>>http://www.computer-go.org/mailman/listinfo/computer-go/
>
>_______________________________________________
>computer-go mailing list
>computer-go at computer-go.org
>http://www.computer-go.org/mailman/listinfo/computer-go/
>
>--
>This email has been verified as Virus free
>Virus Protection and more available at http://www.plus.net



More information about the computer-go mailing list