For models that regularly hit the output token limit, if there was a way to pay 2X credits to get longer output, I would use that a lot I think.
A similar idea was suggested about Sonnet -- https://feedback.getmerlin.in/feature-requests/p/longer-output-mode-for-claude-35-sonnet -- , but this would be for all compatible models.