LLMs That Actually Pen Test: What Post-Training for Security Means for Your AI Stack

Security researchers have spent years arguing that LLMs should be more helpful with offensive security tasks. The models kept refusing. Now someone just shipped a post-trained model that does the work instead of lecturing you about responsible disclosure, and it reportedly found thousands of real zero-days. That is not a headline you can ignore if you are building any kind of AI system that touches code, infrastructure, or automated pipelines.

What Actually Happened

Two things landed close together that, read side by side, tell a clear story about where AI security tooling is going.

First, the Argus Red team shipped a CLI-accessible model that they post-trained specifically for penetration testing. The pitch is simple: instead of a general-purpose model that refuses to explain how buffer overflows work, you get one that treats offensive security as the actual task. No jailbreaks, no prompt engineering gymnastics. The model was trained to do the job.

LLMs That Actually Pen Test: What Post-Training for Security Means for Your AI Stack

What Actually Happened

Two things landed close together that, read side by side, tell a clear story about where AI security tooling is going.

LLMs That Actually Pen Test: What Post-Training for Security Means for Your AI Stack

LLMs That Actually Pen Test: What Post-Training for Security Means for Your AI Stack

Other newsrooms on this story

Related reading

LLM Security Vulnerabilities Engineers Need to Know in 2026

The Auditor's AI Workflow: How I Use LLMs Without Trusting Them

LLM Hacking, Secure Code Review, and AI Model Weight Integrity Explored

The Safety Feature That Taught an LLM to Lie

LLM Agents Are Now Finding Zero-Days: How AI is Autonomously Rewriting the…

LLM collapse: The danger of training LLMs on AI-generated data

Other newsrooms on this story

Related reading

LLM Security Vulnerabilities Engineers Need to Know in 2026

The Auditor's AI Workflow: How I Use LLMs Without Trusting Them

LLM Hacking, Secure Code Review, and AI Model Weight Integrity Explored

The Safety Feature That Taught an LLM to Lie

LLM Agents Are Now Finding Zero-Days: How AI is Autonomously Rewriting the…

LLM collapse: The danger of training LLMs on AI-generated data