Unfortunately figure 3 is hard to reproduce and I haven't yet found the professional games or wrangled the outputs of the model for cross-run-eval yet, Sorry.