
Latest open artifacts (#21): Open model bonanza! Gemma 4, DeepSeek V4, Kimi…
An eventful month with one flagship release after another
24articoli totali nell'archivio

An eventful month with one flagship release after another



‘Distillation attacks’ is a horrible term for what is happening right now.

The complex factors that determine the single evaluation number so many focus on. Plus, how this changes in the future.

What I expect to come next and why, focused on the open-closed gap.

And yes, I hate consortia too.


Hint: it's not benchmark scores.

New orgs! New types of models! With Nemotron Super, Sarvam, Cohere Transcribe, & others

The case for why self-improvement is real but it doesn't lead to fast takeoff.

On evaluating and understanding the frontier of agents, and why I still turn to Claude.

Markets, capabilities, cope, and bewilderment in the industrialization of language models.

Interconnects interview #17 on the past, present, and future of the Nemotron project.

Launching a new open model series. Interconnects Interview #15.

Interconnects interview #16 to celebrate the release of Trinity Large.

The latest Olmo model and discussions at the frontier of open-source post training tools.

Welcome to the year of the horse!

Reacting to Anthropic's post on "distillation attacks."

The open-closed gap, distillation, innovation timescales, how open models win, specialized models, what’s missing, etc.


Tons of useful "niche" models and anticipation of big releases coming soon.