Under strict US government regulations, OpenAI is releasing GPT-5.6, its new generation of AI models. According to OpenAI, it can match Anthropic's Mythos 5 in numerous benchmarks – and like Mythos, it was withdrawn for the general public under pressure from the US government. GPT-5.6 is said to exhibit improved agentic capabilities in programming, biology, and cybersecurity. The protective measures against misuse have also reportedly been improved.
GPT-5.6 includes the Balanced model Terra for everyday use, Luna, which is designed for speed and efficiency, and Sol, the most powerful model as the flagship. OpenAI cites results from various benchmark tests to underscore the strengths of GPT-5.6. In the coding benchmark Terminal-Bench 2.1, Sol and Sol Ultra outperform Mythos 5 with scores of 91.9 percent and 88.8 percent, respectively, compared to Mythos 5's 88 percent. Sol Ultra is an enhanced version of Sol with a new feature that allows it to call upon additional sub-agents to further increase Sol's performance.
In the biological domain, OpenAI names the benchmark GeneBench v1, comparing itself here with the previous model GPT 5.5. When using 20,000 tokens, Sol achieves around 25 percent, while GPT-5.5 only achieves 20 percent.












