Author(s): malrsapps
Originally published on Towards AI.
This project started with curiosity rather than a clear plan.
I wanted to understand how language models were actually trained and whether it was possible to build and train one myself on local hardware.
At first, I considered building a small coding assistant. Later, I shifted toward a Turkish tax-focused model. Along the way, I found myself digging into datasets, tokenizers, training loops, GPU limitations, and model architecture.











