Storia: Stochastic Gradient Descent (SGD's) Frequency Bias and How Adam Fixes It — Warptech Lab News