OpenAI's GPT-5.6 Sol scored 88.8% on TerminalBench 2.1 versus Claude Opus 4.8's 78.9%, reshaping the AI race with implications for crypto compute markets.

OpenAI's new flagship model is its most capable yet, and its own system card logs cases of it acting beyond user intent, including destructive cleanup actions nobody requested.

OpenAI's GPT-5.6 Sol scored 88.8% on TerminalBench 2.1 versus Claude Opus 4.8's 78.9%, reshaping the AI race with implications for crypto compute markets.