How to Run Reliable Local LLM Agents on an RTX 3090: A Benchmark (5 Models, Priced in Watts)

I gave GLM-4.5-Air (106B, open weights) 12 coding tasks through opencode on my RTX 3090. It scored 0%...

domenica 28 giugno 2026 New tab

TL;DRAI

GLM-4.5-Air: 0% with opencode, 93% with LangGraph native tool-calling on RTX 3090. For on-premise agents, orchestrator design and tool-tuned models (Qwen3-Coder 30B best) outweigh raw throughput; optimize for watt-cost per correct task.

450 words~2 min read

I gave GLM-4.5-Air (106B, open weights) 12 coding tasks through opencode on my RTX 3090. It scored 0% — never edited a single file.

Same model, same GPU, same tasks, but driven by a ~150-line LangGraph agent instead: 93%.

The model was never the problem. The orchestrator was. Here's the benchmark — including the part nobody else measures, the electricity cost per correct task.

Setup

RTX 3090 (24 GB) + 128 GB RAM, models via ollama, Q4 quants, temp 0.2

How to Run Reliable Local LLM Agents on an RTX 3090: A Benchmark (5 Models, Priced in Watts)

How to Run Reliable Local LLM Agents on an RTX 3090: A Benchmark (5 Models, Priced in Watts)

Other newsrooms on this story

Related reading

GLM-5.2 open agent benchmark: 22% Less Tool Failure

Benchmarking inference at scale: coding agents

How Much Does It Actually Cost to Run a Local LLM? (€ per Million Tokens,…

I spent two weeks optimizing 96GB of VRAM for local LLMs. Paid APIs still won.

Mistral Large vs LLaMA 4 vs Phi-4: Best Open-Source LLM for Code Generation in…

I Benchmarked 3 Local LLMs on My Laptop — Here's What the Numbers Actually Show

Other newsrooms on this story

Related reading

GLM-5.2 open agent benchmark: 22% Less Tool Failure

Benchmarking inference at scale: coding agents

How Much Does It Actually Cost to Run a Local LLM? (€ per Million Tokens,…

I spent two weeks optimizing 96GB of VRAM for local LLMs. Paid APIs still won.

Mistral Large vs LLaMA 4 vs Phi-4: Best Open-Source LLM for Code Generation in…

I Benchmarked 3 Local LLMs on My Laptop — Here's What the Numbers Actually Show