Why TPUs Aren't Popular (Even Though They're Cheaper Per Token)
TPUs and Trainium beat NVIDIA on cost-per-token and watts-per-token. So why does almost everyone still run on NVIDIA? The answer is the static-shape tax that systolic-array silicon forces on your code, your pipeline, and your org chart.