New AI model called "Count Anything" does exactly what it says, and that's harder than it sounds

"Count Anything" is intended to be the first AI model capable of counting objects in any type of image, from crowds to cell samples under a microscope, using nothing more than a text prompt. In a comparative test, it cuts the error rate in half compared to previous systems. However, the approach still struggles with extremely dense objects and ambiguous terms.

sabato 13 giugno 2026 New tab

Large language models can describe images, interpret charts, and pull text from photos. Multimodality is a given for modern AI systems. But one seemingly simple task remains surprisingly hard: reliably counting objects in an image.

Getting those counts right has real consequences, whether it's a doctor reading a scan, a farmer estimating crop yields, or a city planner analyzing traffic. Until now, each of these tasks has required its own specialized system.

That's where "Count Anything" comes in. The new AI model from researchers at Tsinghua University and other institutions aims to count objects across very different types of images, whether that's heads in crowds, cars in satellite photos, cells in medical scans, or bacterial colonies in the lab.

It's a familiar problem. A system that reliably counts heads in a crowd often chokes on tightly packed cells under a microscope or tiny vehicles seen from above. The researchers want a single model that takes text input, marks every counted object in the image, and handles wildly different image types.

Two counters are better than one

New AI model called "Count Anything" does exactly what it says, and that's harder than it sounds

New AI model called "Count Anything" does exactly what it says, and that's harder than it sounds

Other newsrooms on this story

Related reading

Counting tokens is dumb. So we built a free metric for AI proficiency.

AI scores a ‘C–’ on its hardest math test yet

Button-pushing explorers: How to grasp that AI agents can do amazing things…

AI's next big leap is models that understand the world.

This Half-Gigabyte AI Model Runs Local Agents on Your Phone - Decrypt

Tool count is a vanity metric. Annotation coverage is what makes an AI agent…

Other newsrooms on this story

Related reading

Counting tokens is dumb. So we built a free metric for AI proficiency.

AI scores a ‘C–’ on its hardest math test yet

Button-pushing explorers: How to grasp that AI agents can do amazing things…

AI's next big leap is models that understand the world.

This Half-Gigabyte AI Model Runs Local Agents on Your Phone - Decrypt

Tool count is a vanity metric. Annotation coverage is what makes an AI agent…