DeepSeek tests “sparse attention” to slash AI processing costs

Chinese lab's v3.2 release explores a technique that could make running AI far less costly.

martedì 30 settembre 2025 New tab

887 words~4 min read

selective memory

Chinese lab’s v3.2 release explores a technique that could make running AI far less costly.

Getty Images

Ever wonder why ChatGPT slows down during long conversations? The culprit is a fundamental mathematical challenge: Processing long sequences of text requires massive computational resources, even with the efficiency tricks that companies have already deployed. While US tech giants can afford to throw more hardware at the problem, Chinese AI company DeepSeek, which is cut off from a steady supply of some advanced AI chips by export restrictions, has extra motivation to squeeze more performance from less silicon.

DeepSeek tests “sparse attention” to slash AI processing costs

DeepSeek tests “sparse attention” to slash AI processing costs

Other newsrooms on this story

Related reading

DeepSeek releases 'sparse attention' model that cuts API costs in half |…

Faster AI, lower costs: DSpark eases bottlenecks and chip strain, says DeepSeek

US-China AI race intensifies as DeepSeek releases 'reduced' cost model

DeepSeek Launches New AI Model as Funding Rumors Spread

China's DeepSeek launches next-gen AI model. Here's what makes it different

What makes DeepSeek-V3.2 so efficient? - TechTalks

Other newsrooms on this story

Related reading

DeepSeek releases 'sparse attention' model that cuts API costs in half |…

Faster AI, lower costs: DSpark eases bottlenecks and chip strain, says DeepSeek

US-China AI race intensifies as DeepSeek releases 'reduced' cost model

DeepSeek Launches New AI Model as Funding Rumors Spread

China's DeepSeek launches next-gen AI model. Here's what makes it different

What makes DeepSeek-V3.2 so efficient? - TechTalks