Storia: The paradox of LLM self-distillation: Faster reasoning, weaker generalization - TechTalks — Warptech Lab News