WARPTECHNEWS · LAB

Home AI Business Tech Archive

WARPTECH LAB NEWS

Warptech Lab News aggrega le notizie più rilevanti da oltre 700 fonti internazionali, con classificazione AI, TL;DR sintetici e timeline cluster su singole storie.

Navigazione

Home
Archivio
Editor's Brief
Cerca
Il tuo account
Newsletter tech/AI

Informazioni legali

Privacy Policy
Termini di servizio
Cookie Policy

© 2026 Sparktech S.R.L. — Tutti i diritti riservati. Sito gestito e manutenuto da Sparktech S.R.L.

Sede legale: Corso Libertà 55, 13100 Vercelli (VC), Italia · P.IVA / C.F. 02835910023 · Contatti: admin@warptechlab.com

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

A Blog post by JetBrains on Hugging Face

lunedì 1 giugno 2026 New tab

515 words~2 min read

Back to Articles

Mellum2 is a 12B-parameter Mixture-of-Experts model trained from scratch on natural language and code.

The model activates only 2.5B parameters per token, making it efficient for high-throughput, low-latency inference.

Mellum2 is can be used for routing, RAG, summarization, sub-agents, high-throughput coding features, and private deployments.

It is released under the Apache 2.0 license.

Other newsrooms on this story

· 10 sources

Full timeline →

marktechpost.com·Jun 2, 2026 · 3 g fa
JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines
neowin.net·Jun 2, 2026 · 3 g fa
JetBrains open-sources Mellum 2, featuring 12B total parameters
blog.jetbrains.com·Jun 1, 2026 · 4 g fa
Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog
dev.to·Jun 2, 2026 · 3 g fa
JetBrains open-sources Mellum2 to challenge third-party API limitations
mistral.ai·May 28, 2026 · 8 g fa
Cheaper, Better, Faster, Stronger | Mistral AI
allenai.org·May 28, 2026 · 8 g fa
Olmix: A framework for data mixing throughout LM development | Ai2
dev.to·Jun 4, 2026 · 1 g fa
Running Mixtral 8x7B at 21+ TPS on Pure CPU via io_uring and Predictive Caching
marktechpost.com·Jun 4, 2026 · 22 h fa
NVIDIA AI Releases Nemotron 3 Ultra: An Open 550B Mixture-of-Experts Hybrid Mamba-Transformer for Long-Running Agents
mistral.ai·May 28, 2026 · 8 g fa
Mistral Small 3 | Mistral AI
venturebeat.com·Jun 1, 2026 · 4 g fa
MiniMax M3 debuts, eclipsing GPT-5.5 and Gemini 3.1 Pro on key benchmark performance for just 5-10% of the cost

Related reading

marktechpost.com

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in…

JetBrains open-sources Mellum2, a 12B MoE model with 2.5B active parameters for routing, RAG, and sub-agent pipelines.

marktechpost.com·3 g fa

JetBrains open-sources Mellum 2, featuring 12B total parameters

JetBrains has open-sourced Mellum 2, the successor to Mellum, its code completion-focused model that was also released as open…

neowin.net·3 g fa

blog.jetbrains.com

Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog

Trained from scratch and designed for practical deployment, Mellum2 is built for routing, Q&A, sub-agents, and private AI use in…

blog.jetbrains.com·4 g fa

JetBrains open-sources Mellum2 to challenge third-party API limitations

JetBrains open-sources Mellum2, a fast, 12B-parameter coding model running on your own infrastructure, surpassing limitations of…

dev.to·3 g fa

the-decoder.com

Researchers train AI model that hits near-full performance with just 12.5…

Researchers at the Allen Institute for AI and UC Berkeley have built EMO, a mixture-of-experts model whose experts specialize in…

the-decoder.com·20 g fa

EMO: Pretraining mixture of experts for emergent modularity

A Blog post by Ai2 on Hugging Face

huggingface.co·28 g fa