Great Stack to Doesn't Work #9
Distributed Tracing: "Why Does This Request Take 3 Seconds?"
A survival guide for when everything goes wrong in production.
A user clicks "Place Order." The spinner spins. Three seconds pass. The order completes.
Three seconds. For a button click. The product manager asks: "Why does this take 3 seconds?" You check the API gateway. 50ms. You check the order service. 80ms. You check the payment service. 120ms. You check the inventory service. 60ms. The total is 310ms. Where's the other 2,690ms?






