LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune

Part 2 of a 4-part series. How LoRA works (the low-rank trick), a working PEFT config, and three real GPU walls I hit — the FP16 unscale error, an OOM, and a 2-GPU speed mystery.

domenica 21 giugno 2026 New tab

566 words~3 min read

In Part 1 I fully fine-tuned a 270M model — updating every weight. That's fine for a tiny model. It gets painful as models grow, because full fine-tuning needs gradients and optimizer state for every parameter (~4× the model size in memory).

So: what do you do when the model is too big to comfortably fine-tune all of?

The idea behind LoRA

LoRA (Low-Rank Adaptation) rests on one observation: the change fine-tuning makes to a weight matrix is "low rank" — it lives in a small subspace. You don't need to learn the full update ΔW; you can learn it as the product of two skinny matrices, B·A:

output = W·x + (B·A)·x

Other newsrooms on this story

· 1 sources

Full timeline →

huggingface.co·Jun 18, 2026 · 6 g fa
Beyond LoRA: Can you beat the most popular fine-tuning technique?

LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune

Other newsrooms on this story

LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune

Other newsrooms on this story

Related reading

QLoRA: Fine-Tuning a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)

I Fine-Tuned a 270M Model on My Laptop (Full Fine-Tuning, From Scratch)

Beyond LoRA: Can you beat the most popular fine-tuning technique?

How to use Alpaca-LoRA to fine-tune a model like ChatGPT – Replicate blog

LoRA and QLoRA fine-tuning: what they actually do under the hood

How to Fine-Tune LFM2 Using QLoRA and DPO: A Complete Step-by-Step Coding…

Related reading

QLoRA: Fine-Tuning a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)

I Fine-Tuned a 270M Model on My Laptop (Full Fine-Tuning, From Scratch)

Beyond LoRA: Can you beat the most popular fine-tuning technique?

How to use Alpaca-LoRA to fine-tune a model like ChatGPT – Replicate blog

LoRA and QLoRA fine-tuning: what they actually do under the hood

How to Fine-Tune LFM2 Using QLoRA and DPO: A Complete Step-by-Step Coding…