OpenAI study suggests AI may be about to eclipse human expertise in real-world tasks

Good morning. Rarely does a 29-page scholarly paper merit the attention of top-level executives, but every business leader should be familiar with a recent study from OpenAI. It’s the best description yet of how AI can handle real-world tasks, showing which AI models are excelling, and hinting at what it all means for humans in the years ahead. The paper can be heavy going, but you can get a masterful summary from our AI Editor, Jeremy Kahn.

For leaders, three points stand out:

The study is highly realistic. It examined 44 occupations and 1,320 specialized tasks required by those occupations. For example: the final testing step in manufacturing a cable spooling truck for underground mining operations. Appropriate professionals (average experience: 14 years) vetted the tasks, all of which are elements of actual work deliverables. Previous research has almost always focused on less realistic tests. The AI results were graded by expert humans who didn’t know if they were looking at work from AI or from an expert human professional.

The best models are already nearly as good as human industry experts. The study examined seven AI models from Open AI, Google’s Gemini, xAI’s Grok, and Anthropic’s Claude. The clear winner was Claude Opus 4.1, which came within a few percentage points of reaching parity with human industry experts. The best models also completed tasks about 100 times faster and 100 times cheaper than the industry experts, though the comparisons ignore “the human oversight, iteration, and integration steps required in real workplace settings,” OpenAI says.

OpenAI study suggests AI may be about to eclipse human expertise in real-world tasks | Fortune

Related reading

Sam Altman says not even the CEO’s job is safe from AI as it will soon perform…

As workers fear for AI job cuts, Open AI co-founder says AI agents will take a…

AI is rewriting the rules of work. Our job is to shape what comes next | Fortune

Some CEOs fear AI could become the user interface for almost everything |…

The next competitive edge in business? A new skill partnership between humans,…

'We cannot allow private companies, acting as both judge and party, to set the…