Storia: Why Code Golfing is the Ultimate Test for Multimodal LLMs (And a New Benchmark to Prove It) — Warptech Lab News