Yann LeCun, Meta’s chief AI scientist and one of the godfathers of deep learning, is making a nuanced argument that cuts against both AI hype and AI doomerism simultaneously. Large language models are commercially useful, he says. They’ll justify the billions being poured into GPU clusters and data centers. But the bubble isn’t in the infrastructure spending. It’s in the belief that these models can think like humans.

The case for LLMs as utility, not oracle

LeCun’s argument is straightforward once you strip away the academic jargon. LLMs are good at a growing list of practical tasks: coding assistance, enterprise search, document summarization, customer service automation. These applications generate real revenue and solve real problems. That makes the massive infrastructure buildout, the GPU farms and the power plants, a defensible investment.

LeCun draws an aggressive line in the sand about what these models fundamentally cannot do. He’s been arguing for years that next-token prediction, the core mechanism behind every major LLM from GPT-4 to Claude to Llama, is a “dead end” for achieving anything resembling genuine intelligence.

LLMs learn by consuming trillions of tokens of text. A child learns to understand the physical world from far fewer hours of visual experience. The model can generate a convincing paragraph about how gravity works. The toddler can actually catch a ball. These are fundamentally different kinds of understanding.