Qwen Is Not Yet Ready to Power Local OpenClaw Deployments

Three weeks ago I ran a model showdown — twelve tasks, five models, one RTX 5090 — and Qwen3.5-35B-A3B won. 85.3 weighted score, 206 tok/s, fits in VRAM with room to spare. I switched it to the default and figured I was done.

I was not done.

This is what two weeks of actually living with Qwen looked like: the config work I had to do before it was usable, the incident that almost killed the experiment, and the ergonomic gap that means frontier models still own my serious work.

Making It Actually Work

The first day I switched Qwen to the default model in OpenClaw, something was wrong. Responses showed raw <think>...</think> tags in the visible output. Tool calls came back as plain text — create_workspace, just sitting there — instead of proper OpenAI-compatible tool_calls objects. The bot was trying to call tools. It just wasn't calling them.

I was not done.

Making It Actually Work

Qwen Is Not Yet Ready to Power Local OpenClaw Deployments

Qwen Is Not Yet Ready to Power Local OpenClaw Deployments

Other newsrooms on this story

Related reading

Model Showdown Round 9: Qwen 3.6 27B vs Qwen 3.6 35B-A3B vs Qwythos-9B vs…

Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120…

I Spent Two Weeks Pitting Qwen 3 Max Against DeepSeek V4

Qwen3.7 Max vs Open-Weight LLMs: Practical Migration Notes

Qwen 3.6 27B is the sweet spot for local development

Qwen3.6-35B NVFP4 runs on one H100 — A100 owners are out

Other newsrooms on this story

Related reading

Model Showdown Round 9: Qwen 3.6 27B vs Qwen 3.6 35B-A3B vs Qwythos-9B vs…

Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120…

I Spent Two Weeks Pitting Qwen 3 Max Against DeepSeek V4

Qwen3.7 Max vs Open-Weight LLMs: Practical Migration Notes

Qwen 3.6 27B is the sweet spot for local development

Qwen3.6-35B NVFP4 runs on one H100 — A100 owners are out