The Problem: Pushing Instant Notifications to Millions of Devices

Answer-first: Uber's RAMEN system maintains persistent gRPC bidirectional streams to every active driver app. When a match is made, the ride offer travels from the matching engine to the driver's phone in under 100ms — without polling. This is how millions of connections are held open simultaneously without crashing the backend.

When DISCO decides to match you with Driver John Doe, the system must:

Send the ride offer to exactly John Doe's phone (out of millions of connected phones).

Deliver it in milliseconds (not seconds).