CloudyGo
Other Runs
v17 - 20x256 Squeeze-and-Excitation
v16 - 40x256 40 block!
v15 - 20x256 Q=loss
v14 - 20x256 Bigtable + Q=loss
v13 - 20x256 "Master" (SL)
v12 - 20x256 BS=2
v11 - 20x256 Q=draw
v10 - 20x256
v9 - 20x256
v7 - 20x128
v5 - 10x128
v3-9x9
leela-zero Official LZ games and eval
leela-zero-eval More eval games
leela-zero-eval-time Eval on even time
cross-eval (eval only)
KataGo KataGo games
Pages
Model list
Graphs
Model Evolution
Josekis
Figure 3
Evaluation
Model Details
Model Graphs
Newest Eval
Results
Puzzles
All Eval Graph
v5
Graphs
V5 Graphs
RESULTS
300
320
340
360
380
400
420
440
460
480
500
520
540
560
580
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Model Number
Winrate
Winrate by Model
300
320
340
360
380
400
420
440
460
480
500
520
540
560
580
200
220
240
260
280
300
320
340
360
200
220
240
260
280
300
320
340
360
Model Number
Game length
Game length by Model
This aren't that useful and we have live data from logs now
300
320
340
360
380
400
420
440
460
480
500
520
540
560
580
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
0.08
0.09
0.10
Model Number
Bad resign rate
Bad Resign Rate by Model
552
554
556
558
560
562
564
566
568
570
572
574
576
578
580
0.55
0.60
0.65
0.70
0.75
0.80
0.85
0.55
0.60
0.65
0.70
0.75
0.80
0.85
Model Number
Resign rate for X% error
Threshold for 1%, 2% and 5% Bad Resign Rate
300
320
340
360
380
400
420
440
460
480
500
520
540
560
580
4,000
5,000
6,000
7,000
8,000
9,000
10,000
11,000
12,000
Model Number
Number of self-play games CloudyGo has ingested
Number of games (processed) by Model
Feb 25
Mar 04
Mar 11
Mar 18
Mar 25
April
Apr 08
Apr 15
Apr 22
Apr 29
0
10,000
20,000
30,000
40,000
50,000
60,000
70,000
80,000
90,000
100,000
110,000
120,000
130,000
0
10,000
20,000
30,000
40,000
50,000
60,000
70,000
80,000
90,000
100,000
110,000
120,000
130,000
Day
Number of Games
Games Per Day
300
320
340
360
380
400
420
440
460
480
500
520
540
560
580
0
50,000
100,000
150,000
200,000
250,000
300,000
350,000
0
50,000
100,000
150,000
200,000
250,000
300,000
350,000
Model Number
Sum visits
Sum of visits to top move per game by Model
300
320
340
360
380
400
420
440
460
480
500
520
540
560
580
9.0
9.2
9.4
9.6
9.8
10.0
10.2
10.4
10.6
10.8
Model Number
Soft-n of all better not-played moves (see early game temperature)
How badly soft-n messed up each game by Model
300
320
340
360
380
400
420
440
460
480
500
520
540
560
580
-1,000
-800
-600
-400
-200
0
200
400
600
800
-1,000
-800
-600
-400
-200
0
200
400
600
800
Model Number
Delta Rating
Rating delta from last model
-300
-250
-200
-150
-100
-50
0
50
100
150
200
250
300
0
10
20
30
40
50
60
70
80
90
100
Model number delta
elo delta
Win rate by difference in model number