A practical comparison of the four subword tokenization algorithms powering every major LLM, with code examples and a decision framework for picking the right one.