Stefan Pohl Computer Chess

private website for chessengine-tests


Latest Website-News (2016/12/04): Testrun of Stockfish 161127 finished. -2 Elo to Stockfish 161120. Next testrun: BrainFish 161128. Result not before Friday.

 

Long thinking-time tournament updated, too.

 

Stay tuned...


Stockfish testing

 

Playing conditions:

 

Hardware: i7-2630QM 2.0GHz Notebook, Windows 10 64bit, 4GB RAM

Fritzmark: singlecore: 3.97 / 1905 (all engines running on one core, only), average meganodes/s displayed by LittleBlitzerGUI: Houdini: 2.0 mn/s, Stockfish: 1.7 mn/s

Hash: 128MB per engine

GUI: LittleBlitzerGUI (draw at 120 moves, resign at 450cp (for 4 moves))

Tablebases: None

Openings: 10moves_SALC_500.epd (download the file at the "Download & Links"-section)

Ponder, Large Memory Pages & learning: Off

Thinking time: 70''+700ms per game/engine (average game-duration: 3.5 minutes)(standardized to the hardware-speed and the thinking time of the excellent FGRL Bullet-ratinglist). One 7000 games-testrun takes about 6 days (running on only 3 of 4 cores). The version-numbers of the Stockfish-development engines are the release-date, written backwards (year,month,day))(example: 141028 = October, 28, 2014), downloaded at chess.ultimaiq.net I always use the latest version of one day, if more than one version per day is released. And I use the version "for modern computers". (At the moment, the compiles for modern windows machines on abrok.eu are around 8% slower, so I dont use them anymore)

 

Each Stockfish-version plays 1000 games against Komodo 10.2, Houdini 5, Shredder 13, Gull 3, Fire 4, Critter 1.6a and Equinox 3.3.

 

Latest update: 2016/12/04: Stockfish 161127

 

Download the individual statistics here

 

     Program                    Elo    +    -   Games   Score   Av.Op.  Draws

   1 BrainFish 161009 numa    : 3465    8    8  7000    85.0 %   3149   24.7 %
   2 asmFish 161004 x64       : 3419    7    7  7000    81.5 %   3149   29.2 %
   3 Stockfish 161120 x64     : 3395    7    7  7000    75.1 %   3187   35.7 %
   4 Stockfish 161127 x64     : 3393    7    7  7000    74.9 %   3187   36.2 % (new)
   5 Stockfish 8 161101       : 3390    6    6  9000    75.8 %   3177   35.3 %
   6 Houdini 5 x64            : 3368    6    6  9000    66.2 %   3236   40.5 %
   7 Komodo 10.2 x64          : 3321    5    5 11000    62.3 %   3223   36.7 %
   8 Komodo 10.1 x64          : 3320    6    6  8000    65.0 %   3201   33.3 %
   9 Houdini 4 x64            : 3193    5    5 11000    44.7 %   3239   34.7 %
  10 Shredder 13 x64          : 3184    5    5 10000    41.7 %   3250   38.3 %
  11 Gull 3 x64               : 3123    5    5 14000    31.7 %   3275   35.9 %
  12 Fire 4 x64               : 3114    5    5 14000    30.6 %   3276   36.6 %
  13 Critter 1.6a x64         : 3110    5    5 14000    30.1 %   3276   33.7 %
  14 Equinox 3.3 x64          : 3093    5    5 14000    28.2 %   3278   34.0 %
  15 Mars 3.41 x64            : 3091    5    5 10000    30.7 %   3255   34.5 %

Below you find a diagram of the progress of Stockfish in my tests since the end of 2016

And below that diagram, the older diagrams.

 

You can save the diagrams (as a JPG-picture (in originial size)) on your PC with mouseclick (right button) and then choose "save image"...

The Elo-ratings of older Stockfish dev-versions in the Ordo-calculation can be a little different to the Elo-"dots" in the diagram, because the results/games of new Stockfish dev-versions - when getting part of the Ordo-calculation - can change the Elo-ratings of the opponent engines and that can change the Elo-ratings of older Stockfish dev-versions (in the Ordo-calculation / ratinglist, but not in the diagram, where all Elo-"dots" are the rating of one Stockfish dev-version at the moment, when the testrun of that Stockfish dev-version was finished).


Sie sind Besucher Nr.