How I got a threat-classification AI running on-agent in under 8ms — no GPU, no cloud

When I tell people that Watch Cortex classifies threats in under 8ms on-agent — no cloud call, no...

lunedì 15 giugno 2026 New tab

1,737 words~8 min read

When I tell people that Watch Cortex classifies threats in under 8ms on-agent — no cloud call, no GPU, no round-trip — the first question is usually: how?

The second question is: why bother? Just send it to the cloud.

Let me answer the second one first, because it explains all the engineering decisions that follow.

Why on-agent matters

The cloud-call model for security agents has a fundamental problem: it fails when you need it most.

Other newsrooms on this story

· 1 sources

Full timeline →

thenextweb.com·Jun 11, 2026 · 14 g fa
Why the next AI safety problem is the conversation between models

How I got a threat-classification AI running on-agent in under 8ms — no GPU, no cloud

Other newsrooms on this story

How I got a threat-classification AI running on-agent in under 8ms — no GPU, no cloud

Other newsrooms on this story

Related reading

When Your Background AI Agent Becomes a C2 Server

No Cloud, No Vendor Lock-In: Running AI Agents on Hardware You Control

How I Run a 50-Agent AI Workforce on a Single 6GB GPU

Overcome data gravity: 4 principles for AI security

Local-First AI: Why Your Threat Intel Shouldn't Live on Someone Else's Server

Your Agents Need a Security Boundary. Heres Why Its Become Non-Negotiable.

Related reading

When Your Background AI Agent Becomes a C2 Server

No Cloud, No Vendor Lock-In: Running AI Agents on Hardware You Control

How I Run a 50-Agent AI Workforce on a Single 6GB GPU

Overcome data gravity: 4 principles for AI security

Local-First AI: Why Your Threat Intel Shouldn't Live on Someone Else's Server

Your Agents Need a Security Boundary. Heres Why Its Become Non-Negotiable.