CloudyGo
Other Runs
v17 - 20x256 Squeeze-and-Excitation
v16 - 40x256 40 block!
v15 - 20x256 Q=loss
v14 - 20x256 Bigtable + Q=loss
v13 - 20x256 "Master" (SL)
v12 - 20x256 BS=2
v11 - 20x256 Q=draw
v10 - 20x256
v9 - 20x256
v7 - 20x128
v5 - 10x128
v3-9x9
leela-zero Official LZ games and eval
leela-zero-eval More eval games
leela-zero-eval-time Eval on even time
cross-eval (eval only)
KataGo KataGo games
Pages
Model list
Graphs
Model Evolution
Josekis
Figure 3
Evaluation
Model Details
Model Graphs
Newest Eval
Results
Puzzles
All Eval Graph
v12
Graphs
V12 Graphs
RESULTS
700
750
800
850
900
950
0.00
0.05
0.10
0.15
0.20
0.25
0.30
0.35
0.40
0.45
0.50
0.00
0.05
0.10
0.15
0.20
0.25
0.30
0.35
0.40
0.45
0.50
Model Number
Winrate
Winrate by Model
700
750
800
850
900
950
210
220
230
240
250
260
270
280
290
300
310
320
210
220
230
240
250
260
270
280
290
300
310
320
Model Number
Game length
Game length by Model
This aren't that useful and we have live data from logs now
Model Number
Bad resign rate
Bad Resign Rate by Model
970
972
974
976
978
980
982
984
986
988
990
992
994
996
998
1
1
Model Number
Resign rate for X% error
Threshold for 1%, 2% and 5% Bad Resign Rate
700
750
800
850
900
950
50,000
100,000
150,000
200,000
250,000
300,000
350,000
Model Number
Number of self-play games CloudyGo has ingested
Number of games (processed) by Model
Wed 19
Fri 21
Sep 23
Tue 25
Thu 27
Sat 29
October
Wed 03
Fri 05
Oct 07
Tue 09
Thu 11
Sat 13
Mon 15
Wed 17
0
100,000
200,000
300,000
400,000
500,000
600,000
700,000
800,000
900,000
1,000,000
1,100,000
0
100,000
200,000
300,000
400,000
500,000
600,000
700,000
800,000
900,000
1,000,000
1,100,000
Day
Number of Games
Games Per Day
700
750
800
850
900
950
0
50,000
100,000
150,000
200,000
250,000
300,000
350,000
400,000
450,000
500,000
0
50,000
100,000
150,000
200,000
250,000
300,000
350,000
400,000
450,000
500,000
Model Number
Sum visits
Sum of visits to top move per game by Model
700
750
800
850
900
950
6.0
6.2
6.4
6.6
6.8
7.0
7.2
7.4
7.6
7.8
8.0
8.2
8.4
8.6
8.8
Model Number
Soft-n of all better not-played moves (see early game temperature)
How badly soft-n messed up each game by Model
720
740
760
780
800
820
840
860
880
900
920
940
960
980
1,000
-800
-600
-400
-200
0
200
400
600
800
-800
-600
-400
-200
0
200
400
600
800
Model Number
Delta Rating
Rating delta from last model
-600
-400
-200
0
200
400
600
0
10
20
30
40
50
60
70
80
90
100
Model number delta
elo delta
Win rate by difference in model number