I think this might be an unsolved problem. When GPT-5 came out, they had a "rout...

siva7 · 2026-04-17T08:16:51 1776413811

You're misunderstanding the purpose of "auto"-model-routing or things like "adaptive thinking". It's a solved problem for the companies. It solves their problems. Not yours ;)

ai_slop_hater · 2026-04-17T03:18:17 1776395897

Maybe it is an unsolved problem, but either way I am confused why Anthropic is pushing adaptive thinking so hard, making it the only option on their latest models. To combat how unreliable it is, they set thinking effort to "high" by default in the API. In Claude Code, they now set it to "xhigh" by default. The fact that you cannot even inspect the thinking blocks to try and understand its behavior doesn't help. I know they throw around instructions how to enable thinking blocks, or blocks with thinking summaries, or whatever (I am too confused by now, what it is that they allow us to see), but nothing worked for me so far.

siva7 · 2026-04-17T05:17:37 1776403057

Because with adaptive thinking they control compute, not you

solarkraft · 2026-04-17T01:33:04 1776389584

I find that GPT 5.4 is okay at it. It does think harder for harder problems and still answers quickly for simpler ones, IME.

nomel · 2026-04-17T00:42:15 1776386535

Is knowing how hard a problem is, before doing it, solved in humans?

biglost · 2026-04-17T01:09:58 1776388198

Yes, everyweek when assigning fking points to tasks on jira/s

arthurcolle · 2026-04-17T02:58:59 1776394739

As a unit this is funny, Jira points assigned per second (now possible with parallel tool calling AIs)

Gareth321 · 2026-04-17T07:50:22 1776412222

I don't think so. If the model used to analyse the complexity is dumb, it won't route correctly. They clearly don't want to start every query using the highest level of intelligence as this could undermine their obvious attempt at resource optimisation.

I faced the same issue using Open Router's intelligent routing mechanism. It was terrible, but it had a tendency to prefer the most expensive model. So 98% of all queries ended up being the most expensive model, even for simple queries.

mochomocha · 2026-04-17T03:05:40 1776395140

It makes me think of this parallel: often in combinatorial optimization ,estimating if it is hard to find a solution to a problem costs you as much as solving it.

With a small bounded compute budget, you're going to sometimes make mistakes with your router/thinking switch. Same with speculative decoding, branch predictors etc.