Stefan Pohl Computer Chess

private website for chessengine-tests


Latest Website-News (2017/02/19): Testrun of asmFish 170211 finished. No progress, again. Next testrun: Stockfish 170214. Result not before Saturday.

 

Endless long thinking-time tournament updated.

 

Stay tuned.


Stockfish testing

 

Playing conditions:

 

Hardware: i7-2630QM 2.0GHz Notebook, Windows 10 64bit, 4GB RAM

Fritzmark: singlecore: 3.97 / 1905 (all engines running on one core, only), average meganodes/s displayed by LittleBlitzerGUI: Houdini: 2.0 mn/s, Stockfish: 1.7 mn/s

Hash: 128MB per engine

GUI: LittleBlitzerGUI (draw at 120 moves, resign at 450cp (for 4 moves))

Tablebases: None

Openings: 10moves_SALC_500.epd (download the file at the "Download & Links"-section)

Ponder, Large Memory Pages & learning: Off

Thinking time: 70''+700ms per game/engine (average game-duration: 3.5 minutes)(standardized to the hardware-speed and the thinking time of the excellent FGRL Bullet-ratinglist). One 7000 games-testrun takes about 6 days (running on only 3 of 4 cores). The version-numbers of the Stockfish-development engines are the release-date, written backwards (year,month,day))(example: 141028 = October, 28, 2014), downloaded at chess.ultimaiq.net I always use the latest version of one day, if more than one version per day is released. And I use the version "for modern computers". (At the moment, the compiles for modern windows machines on abrok.eu are around 8% slower, so I dont use them anymore)

 

Each Stockfish-version plays 1000 games against Komodo 10.3, Houdini 5, Shredder 13, Fizbo 1.9, Gull 3, Fire 4, and Critter 1.6a.

 

Latest update: 2017/02/19: asmFish 170211

 

Download the individual statistics here

 

     Program                    Elo    +    -   Games   Score   Av.Op.  Draws

   1 BrainFish 161009 numa    : 3469    9    9  7000    85.0 %   3153   24.7 %
   2 BrainFish 161128 x64     : 3464    8    8  7000    81.3 %   3189   30.5 %
   3 asmFish 161207 x64       : 3427    7    7  7000    78.1 %   3189   33.6 %
   4 asmFish 170211 x64       : 3427    7    7  7000    76.9 %   3201   33.3 % (new)
   5 asmFish 161217 x64       : 3426    7    7  7000    78.0 %   3189   32.9 %
   6 asmFish 170202 x64       : 3426    7    7  7000    76.9 %   3201   33.6 %
   7 asmFish 170109 x64       : 3425    7    7  7000    76.8 %   3201   33.9 %
   8 asmFish 161004 x64       : 3422    8    8  7000    81.5 %   3153   29.2 %
   9 asmFish 170122 x64       : 3422    7    7  7000    76.5 %   3201   33.6 %
  10 Stockfish 161212 x64     : 3409    7    7  7000    76.4 %   3189   35.8 %
  11 Stockfish 170105 x64     : 3403    7    7  7000    74.7 %   3201   36.0 %
  12 Stockfish 170113 x64     : 3403    7    7  7000    74.6 %   3201   35.7 %
  13 Stockfish 170129 x64     : 3400    7    7  7000    74.4 %   3201   36.6 %
  14 Stockfish 161120 x64     : 3396    7    7  7000    75.1 %   3189   35.7 %
  15 Stockfish 161127 x64     : 3394    7    7  7000    74.9 %   3189   36.2 %
  16 Stockfish 8 161101       : 3390    5    5 11000    74.3 %   3192   36.1 %
  17 Houdini 5 x64            : 3367    4    4 22000    54.1 %   3330   46.9 %
  18 Komodo 10.3 x64          : 3332    4    4 15000    53.8 %   3298   40.1 %
  19 Komodo 10.1 x64          : 3324    6    6  8000    65.0 %   3204   33.3 %
  20 Komodo 10.2 x64          : 3316    4    4 15000    54.2 %   3280   38.8 %
  21 Houdini 4 x64            : 3195    5    5 11000    44.7 %   3241   34.7 %
  22 Shredder 13 x64          : 3183    4    4 23000    31.4 %   3332   35.2 %
  23 Fizbo 1.9 x64            : 3168    5    5 14000    30.7 %   3324   29.2 %
  24 Gull 3 x64               : 3129    4    4 27000    25.7 %   3334   32.5 %
  25 Fire 4 x64               : 3119    4    4 27000    24.7 %   3334   32.5 %
  26 Critter 1.6a x64         : 3113    3    3 27000    24.1 %   3334   29.6 %
  27 Equinox 3.3 x64          : 3095    4    4 19000    24.5 %   3314   31.7 %
  28 Mars 3.41 x64            : 3093    6    6 10000    30.7 %   3257   34.5 %

Below you find a diagram of the progress of Stockfish in my tests since the end of 2016

And below that diagram, the older diagrams.

 

You can save the diagrams (as a JPG-picture (in originial size)) on your PC with mouseclick (right button) and then choose "save image"...

The Elo-ratings of older Stockfish dev-versions in the Ordo-calculation can be a little different to the Elo-"dots" in the diagram, because the results/games of new Stockfish dev-versions - when getting part of the Ordo-calculation - can change the Elo-ratings of the opponent engines and that can change the Elo-ratings of older Stockfish dev-versions (in the Ordo-calculation / ratinglist, but not in the diagram, where all Elo-"dots" are the rating of one Stockfish dev-version at the moment, when the testrun of that Stockfish dev-version was finished).


Sie sind Besucher Nr.