How well do LLMs (GPT 5.2, Opus 4.5, GLM 4.7, DeepSeek Terminus 3.1) play chess against the mature Stockfish engine? Not so well it turns out, but they're not rubbish either. Given at PyDataLondon 2026-01. All 4 models lose 6 games each to stockfish level 0.