I used to ask a simple question when routing coding tasks across models:

Which model is strong enough for this?

That question is still useful, but it is not the first one I ask anymore.

The better first question is:

How quickly can I verify the output?