JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

JetBrains released Mellum2, open-sourcing the weights under the Apache 2.0 license. The first version of Mellum was a completion-focused 4B dense model. Mellum2 is its successor: a general-purpose model specialized in software engineering. It covers code generation and editing, debugging, multi-step reasoning, tool use and function calling, agentic coding, and conversational programming assistance.

JetBrains team positions Mellum2 as a “focal model” — a fast, specialized component inside larger AI systems, not a standalone replacement for frontier models.

Architecture

Mellum2 uses a Mixture-of-Experts (MoE) architecture with 12B total parameters and 2.5B active parameters per token. In MoE models, only a subset of parameters runs on each token. Here, the model has 64 experts and activates 8 per token. This keeps per-token compute equivalent to a 2.5B dense model, while the total parameter count provides higher capacity for specialization.

Key architectural details:

JetBrains team positions Mellum2 as a “focal model” — a fast, specialized component inside larger AI systems, not a standalone replacement for frontier models.

Architecture

Key architectural details:

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

Other newsrooms on this story

Related reading

JetBrains open-sources Mellum 2, featuring 12B total parameters

JetBrains open-sources Mellum2 to challenge third-party API limitations

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog

Cohere Releases Command A+: A 218B Sparse MoE Model for Agentic Workflows That…

Cheaper, Better, Faster, Stronger | Mistral AI

Other newsrooms on this story

Related reading

JetBrains open-sources Mellum 2, featuring 12B total parameters

JetBrains open-sources Mellum2 to challenge third-party API limitations

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog

Cohere Releases Command A+: A 218B Sparse MoE Model for Agentic Workflows That…

Cheaper, Better, Faster, Stronger | Mistral AI