XBOW tests Anthropic's Mythos Preview for offensive security

We received early access to Mythos Preview for early capability testing a few weeks back. Below are the details on how we tested Mythos Preview, what we found, and what it means.

About three months ago, Anthropic invited us to help them assess the capability of a new model they thought represented a significant shift in capability. So we put it through our security gauntlet. Benchmarks, workflows, interactive use, and integrations.

Today, we can finally share details on how we tested Mythos Preview, what we found, and what it means.

Spoilers: This model is a major advance. It is substantially better than prior models at finding vulnerability candidates, especially when source code is available. It communicates with unusual technical precision, reasons well about code, and shows strong promise in complex domains such as native-code analysis and reverse engineering.

Our takeaway: Mythos Preview is a powerful tool for generating strong vulnerability leads and technically precise analysis. It is especially adept at analyzing source code with a security mindset. It's not magic, though: a model is a brain without a body.

We received early access to Mythos Preview for early capability testing a few weeks back. Below are the details on how we tested Mythos Preview, what we found, and what it means.

Today, we can finally share details on how we tested Mythos Preview, what we found, and what it means.

XBOW tests Anthropic's Mythos Preview for offensive security

XBOW tests Anthropic's Mythos Preview for offensive security

Other newsrooms on this story

Related reading

Cloudflare says Anthropic's Mythos Preview finds exploit chains that earlier…

Anthropic's new model went rogue in testing

Project Glasswing: what Mythos showed us

Anthropic says Mythos can turn software patches into exploits in minutes

Anthropic withholds Mythos Preview model because its hacking is too powerful

Anthropic widens Mythos testing despite warnings of mass cyberattacks

Other newsrooms on this story

Related reading

Cloudflare says Anthropic's Mythos Preview finds exploit chains that earlier…

Anthropic's new model went rogue in testing

Project Glasswing: what Mythos showed us

Anthropic says Mythos can turn software patches into exploits in minutes

Anthropic withholds Mythos Preview model because its hacking is too powerful

Anthropic widens Mythos testing despite warnings of mass cyberattacks