WARPTECHNEWS · LAB
HomeAIBusinessTechArchive
WARPTECH LAB NEWS

Warptech Lab News aggrega le notizie più rilevanti da oltre 700 fonti internazionali, con classificazione AI, TL;DR sintetici e timeline cluster su singole storie.

Navigazione

  • Home
  • Archivio
  • Editor's Brief
  • Cerca
  • Il tuo account
  • Newsletter tech/AI

Informazioni legali

  • Privacy Policy
  • Termini di servizio
  • Cookie Policy

© 2026 Sparktech S.R.L. — Tutti i diritti riservati. Sito gestito e manutenuto da Sparktech S.R.L.

Sede legale: Corso Libertà 55, 13100 Vercelli (VC), Italia · P.IVA / C.F. 02835910023 · Contatti: admin@warptechlab.com

Home
Storia in 1 fonti

I Fine-Tuned a 270M Model on My Laptop (Full Fine-Tuning, From Scratch)

Part 1 of a 4-part series. Full fine-tuning a tiny Gemma 3 model for intent classification — the generative framing, the loss-masking trick, and why full FT is so learning-rate sensitive.

Raccontata dadev.to

Timeline cronologica

  1. domenica 21 giugno 2026·dev.to

    I Fine-Tuned a 270M Model on My Laptop (Full Fine-Tuning, From Scratch)

    Part 1 of a 4-part series. Full fine-tuning a tiny Gemma 3 model for intent classification — the generative framing, the loss-masking trick, and why full FT is so learning-rate…

  2. domenica 21 giugno 2026·dev.to

    LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune

    Part 2 of a 4-part series. How LoRA works (the low-rank trick), a working PEFT config, and three real GPU walls I hit — the FP16 unscale error, an OOM, and a 2-GPU speed mystery.

  3. domenica 21 giugno 2026·dev.to

    If a 270M Model Already Worked, Why Did I Fine-Tune a 7B One?

    Part 4 (finale) of a 4-part series. Three model sizes tied on the same task — so when does bigger actually earn its keep? And the bug no model size could fix.