NVIDIA Achieves Leading Agentic Coding Performance on First Agentic AI Benchmark | NVIDIA Technical Blog

AI agents have fundamentally changed the complexity of inference workloads. Until now, the industry has struggled to define a standard for measuring how inference systems perform under these conditions. Artificial Analysis AgentPerf (AA-AgentPerf) offers the industry’s first multi-vendor open benchmarks profiling trajectories that are representative of real-world AI agent coding tasks.

This post explains how AA-AgentPerf sets a new standard for measuring agentic workload performance, and how NVIDIA extreme co-design helps deliver up to 20x better agentic coding performance than previous generations.

What is AA-AgentPerf?

AA-AgentPerf is a hardware benchmark created by Artificial Analysis that measures the number of concurrent AI agents an inference system can support while meeting predefined, model-specific performance service level objective (SLO) tiers. An SLO is defined as a specific threshold of output token speed and time-to-first-token (TTFT). The benchmark results are normalized per accelerator and per megawatt to enable comparison across hardware configurations.

Figure 1. The AA-AgentPerf hardware benchmark measures the throughput and efficiency of running multiple AI agents in parallel

NVIDIA Achieves Leading Agentic Coding Performance on First Agentic AI Benchmark | NVIDIA Technical Blog

Other newsrooms on this story

Related reading

NVIDIA Blackwell Leads on First Agentic AI Infrastructure Benchmark

Other newsrooms on this story

Related reading

NVIDIA Blackwell Leads on First Agentic AI Infrastructure Benchmark

NVIDIA Blackwell Leads AgentPerf, the First Agentic-AI Infra Benchmark:…

AA-AgentPerf releases initial results for DeepSeek V4 Pro benchmark, showing…

Scaling Agentic AI Factories Through Extreme Co-Design with NVIDIA BlueField |…

Artificial Analysis launches coding agent benchmarks with event in San Francisco

Inside NVIDIA Rubin GPU Architecture: Powering the Era of Agentic AI | NVIDIA…