I instrumented 95 DataLoaders in a production GraphQL API — here's what I found

DataLoader is the standard fix for GraphQL's N+1 query problem. Batch your database calls per request, cache within the request lifecycle, done.

But once DataLoader is in production, you're flying blind. Which loaders are actually called per request? Is your cache hit rate 15% or 60%? Should your batch size be 10 or 50? APM tools tell you resolver latency, but they don't understand DataLoader batching.

I built dataloader-ai to answer those questions. Then I tested it for real by instrumenting 95 DataLoader instances in Open Collective's GraphQL API.

The problem: invisible batching

Open Collective runs one of the largest open-source GraphQL APIs on the web. Their server/graphql/loaders/ directory contains 96 DataLoader instances across 20 files — loaders for collectives, expenses, transactions, members, comments, orders, and more.

DataLoader is the standard fix for GraphQL's N+1 query problem. Batch your database calls per request, cache within the request lifecycle, done.

I built dataloader-ai to answer those questions. Then I tested it for real by instrumenting 95 DataLoader instances in Open Collective's GraphQL API.

The problem: invisible batching

I instrumented 95 DataLoaders in a production GraphQL API — here's what I found

I instrumented 95 DataLoaders in a production GraphQL API — here's what I found

Other newsrooms on this story

Related reading

I Built an AI Data Chat Tool in My Portfolio App Using Gemma 4, CrewAI, DuckDB,…

The Gap Between Prototype and Production: What We Learned at Nometria

Treasure Hunting at Scale: Why Our Cache-Aside Cache Cost Us 40% in Tail…

How I Stopped Shopify's GraphQL API From Throttling My App (Parallel Query…

Benchmarking LLM Structured Outputs

Building AI Infrastructure That Actually Scales in Production

Other newsrooms on this story

Related reading

I Built an AI Data Chat Tool in My Portfolio App Using Gemma 4, CrewAI, DuckDB,…

The Gap Between Prototype and Production: What We Learned at Nometria

Treasure Hunting at Scale: Why Our Cache-Aside Cache Cost Us 40% in Tail…

How I Stopped Shopify's GraphQL API From Throttling My App (Parallel Query…

Benchmarking LLM Structured Outputs

Building AI Infrastructure That Actually Scales in Production