Storia: Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention — Warptech Lab News