Run AI model failover drills with schema-safe retries, fallback contracts, circuit breakers, golden tasks, and recovery logs before provider issues hit users.