Training Azerbaijani language models on Amazon SageMaker AI | Amazon Web Services

Azercell Telecom LLC, Azerbaijan's leading telecommunications provider, wanted to build an Azerbaijani large language model (LLM) on Amazon SageMaker AI for telecom use cases and a customer-facing chatbot. The challenge: adapting foundation models (FMs) to a morphologically rich language with limited training data and no existing blueprint for efficient LLM training in Azerbaijani. In a six-week collaboration, Azercell worked with the AWS Generative AI Innovation Center to establish a production-ready framework on Amazon SageMaker AI.

giovedì 28 maggio 2026 New tab

This solution builds on open source tools including PyTorch, Hugging Face Transformers, and Liger Kernels. The authors would also like to thank Aiham Taleb, Arefeh Ghahvechi, Manav Choudhary, Rohit Thekkanal, Daz Akbarov, Jamila Jamilova, Ross Povelikin, Almas Moldakanov, Christelle Xu, and Ivan Khvostishkov for their contributions in making this project possible.

Azercell Telecom LLC, Azerbaijan’s leading telecommunications provider, wanted to build an Azerbaijani large language model (LLM) on Amazon SageMaker AI for telecom use cases and a customer-facing chatbot. The challenge: adapting foundation models (FMs) to a morphologically rich language with limited training data and no existing blueprint for efficient LLM training in Azerbaijani. In a six-week collaboration, Azercell worked with the AWS Generative AI Innovation Center to establish a production-ready framework on Amazon SageMaker AI that delivered a 23% higher training throughput and 58% lower peak GPU memory usage through kernel-level optimizations on an ml.p5.48xlarge instance. The framework also achieved a 2× improvement in tokens per word using a custom tokenizer, effectively doubling the amount of Azerbaijani text that fits within the model’s context window. If you work with low-resource or morphologically complex languages, this post walks through the approach so you can evaluate similar techniques.

Training Azerbaijani language models on Amazon SageMaker AI | Amazon Web Services

Training Azerbaijani language models on Amazon SageMaker AI | Amazon Web Services

Other newsrooms on this story

Related reading

Category: Experience-Based Acceleration

Category: High Performance Computing

Cohere Labs Launches Tiny Aya, Making Multilingual AI Accessible

Comprehensive observability for Amazon SageMaker AI LLM inference: From GPU…

Open Source and In-House: How Uber Optimizes LLM Training

Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs…

Other newsrooms on this story

Related reading

Category: Experience-Based Acceleration

Category: High Performance Computing

Cohere Labs Launches Tiny Aya, Making Multilingual AI Accessible

Comprehensive observability for Amazon SageMaker AI LLM inference: From GPU…

Open Source and In-House: How Uber Optimizes LLM Training

Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs…