Stefan Pohl Computer Chess

private website for chessengine-tests


Latest Website-News (2017/01/17): Testrun of asmFish 170109 finished. No progress in the last month...Next testrun Stockfish 170113. Result not before Monday.

 

Long thinking-time tournament updated. 726 games played after restart.

 

Stay tuned.


Stockfish testing

 

Playing conditions:

 

Hardware: i7-2630QM 2.0GHz Notebook, Windows 10 64bit, 4GB RAM

Fritzmark: singlecore: 3.97 / 1905 (all engines running on one core, only), average meganodes/s displayed by LittleBlitzerGUI: Houdini: 2.0 mn/s, Stockfish: 1.7 mn/s

Hash: 128MB per engine

GUI: LittleBlitzerGUI (draw at 120 moves, resign at 450cp (for 4 moves))

Tablebases: None

Openings: 10moves_SALC_500.epd (download the file at the "Download & Links"-section)

Ponder, Large Memory Pages & learning: Off

Thinking time: 70''+700ms per game/engine (average game-duration: 3.5 minutes)(standardized to the hardware-speed and the thinking time of the excellent FGRL Bullet-ratinglist). One 7000 games-testrun takes about 6 days (running on only 3 of 4 cores). The version-numbers of the Stockfish-development engines are the release-date, written backwards (year,month,day))(example: 141028 = October, 28, 2014), downloaded at chess.ultimaiq.net I always use the latest version of one day, if more than one version per day is released. And I use the version "for modern computers". (At the moment, the compiles for modern windows machines on abrok.eu are around 8% slower, so I dont use them anymore)

 

Each Stockfish-version plays 1000 games against Komodo 10.3, Houdini 5, Shredder 13, Fizbo 1.9, Gull 3, Fire 4, and Critter 1.6a.

 

Latest update: 2017/01/17: asmFish 170109

 

Download the individual statistics here

 

     Program                    Elo    +    -   Games   Score   Av.Op.  Draws

   1 BrainFish 161009 numa    : 3467    8    8  7000    85.0 %   3151   24.7 %
   2 BrainFish 161128 x64     : 3464    7    7  7000    81.3 %   3188   30.5 %
   3 asmFish 161207 x64       : 3427    7    7  7000    78.1 %   3188   33.6 %
   4 asmFish 161217 x64       : 3426    7    7  7000    78.0 %   3188   32.9 %
   5 asmFish 170109 x64       : 3426    7    7  7000    76.8 %   3201   33.9 % (new)
   6 asmFish 161004 x64       : 3421    8    8  7000    81.5 %   3151   29.2 %
   7 Stockfish 161212 x64     : 3408    7    7  7000    76.4 %   3188   35.8 %
   8 Stockfish 170105 x64     : 3404    7    7  7000    74.7 %   3201   36.0 %
   9 Stockfish 161120 x64     : 3395    7    7  7000    75.1 %   3188   35.7 %
  10 Stockfish 161127 x64     : 3394    7    7  7000    74.9 %   3188   36.2 %
  11 Stockfish 8 161101       : 3390    5    5 11000    74.3 %   3191   36.1 %
  12 Houdini 5 x64            : 3368    4    4 17000    57.5 %   3305   45.1 %
  13 Komodo 10.3 x64          : 3339    5    5 10000    62.6 %   3238   37.0 %
  14 Komodo 10.1 x64          : 3322    6    6  8000    65.0 %   3203   33.3 %
  15 Komodo 10.2 x64          : 3315    4    4 15000    54.2 %   3279   38.8 %
  16 Houdini 4 x64            : 3194    5    5 11000    44.7 %   3240   34.7 %
  17 Shredder 13 x64          : 3184    4    4 18000    34.5 %   3309   36.0 %
  18 Fizbo 1.9 x64            : 3163    6    6  9000    36.3 %   3274   30.2 %
  19 Gull 3 x64               : 3127    4    4 22000    27.7 %   3315   33.4 %
  20 Fire 4 x64               : 3115    4    4 22000    26.4 %   3315   33.2 %
  21 Critter 1.6a x64         : 3112    4    4 22000    26.1 %   3315   30.7 %
  22 Equinox 3.3 x64          : 3094    4    4 19000    24.5 %   3314   31.7 %
  23 Mars 3.41 x64            : 3092    5    5 10000    30.7 %   3256   34.5 %

Below you find a diagram of the progress of Stockfish in my tests since the end of 2016

And below that diagram, the older diagrams.

 

You can save the diagrams (as a JPG-picture (in originial size)) on your PC with mouseclick (right button) and then choose "save image"...

The Elo-ratings of older Stockfish dev-versions in the Ordo-calculation can be a little different to the Elo-"dots" in the diagram, because the results/games of new Stockfish dev-versions - when getting part of the Ordo-calculation - can change the Elo-ratings of the opponent engines and that can change the Elo-ratings of older Stockfish dev-versions (in the Ordo-calculation / ratinglist, but not in the diagram, where all Elo-"dots" are the rating of one Stockfish dev-version at the moment, when the testrun of that Stockfish dev-version was finished).


Sie sind Besucher Nr.