Author(s): malrsapps

Originally published on Towards AI.

This project started with curiosity rather than a clear plan.

I wanted to understand how language models were actually trained and whether it was possible to build and train one myself on local hardware.

At first, I considered building a small coding assistant. Later, I shifted toward a Turkish tax-focused model. Along the way, I found myself digging into datasets, tokenizers, training loops, GPU limitations, and model architecture.