If a 270M Model Already Worked, Why Did I Fine-Tune a 7B One?

Over three posts I built three fine-tuned models for the same banking-intent task — full fine-tuning a 270M model, LoRA on 1.5B, QLoRA on 7B. They all landed around the same accuracy.

Which raises an honest, slightly uncomfortable question: if a 270M model on my laptop already worked, why reach for a 7B model at all?

The answer most "bigger is better" content skips

For this task — you wouldn't. A good engineer picks the smallest model that clears the bar, not the biggest one available. The small model is cheaper to serve, runs in milliseconds, and you fully own it. Choosing the 7B here would be over-engineering.

Reaching for a bigger model isn't a flex. It's a response to a requirement the small one can't meet. Here are the four cases where small stops being enough:

Over three posts I built three fine-tuned models for the same banking-intent task — full fine-tuning a 270M model, LoRA on 1.5B, QLoRA on 7B. They all landed around the same accuracy.

Which raises an honest, slightly uncomfortable question: if a 270M model on my laptop already worked, why reach for a 7B model at all?

The answer most "bigger is better" content skips

Reaching for a bigger model isn't a flex. It's a response to a requirement the small one can't meet. Here are the four cases where small stops being enough:

If a 270M Model Already Worked, Why Did I Fine-Tune a 7B One?

If a 270M Model Already Worked, Why Did I Fine-Tune a 7B One?

Other newsrooms on this story

Related reading

I Fine-Tuned a 270M Model on My Laptop (Full Fine-Tuning, From Scratch)

LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune

QLoRA: Fine-Tuning a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)

Fine-Tuning Llama 3.2 3B on Medical QA: Week 4 - When Lower Loss Meant a Worse…

Fine-Tuning Small Open-Source LLMs to Outperform Large Closed-Source Models by…

I A/B tested 4 LLMs on the same 500 queries. The results surprised me.

Related reading

I Fine-Tuned a 270M Model on My Laptop (Full Fine-Tuning, From Scratch)

LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune

QLoRA: Fine-Tuning a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)

Fine-Tuning Llama 3.2 3B on Medical QA: Week 4 - When Lower Loss Meant a Worse…

Fine-Tuning Small Open-Source LLMs to Outperform Large Closed-Source Models by…

I A/B tested 4 LLMs on the same 500 queries. The results surprised me.

Other newsrooms on this story