So the cofounder of hugging face made a post about qwen 3.6 being atclaude level...

solenoid0937 · 2026-05-11T14:55:43 1778511343

It's just not there yet. I have tried all the models from April, including the Gemma 4 variants.

These are so far from Opus it's not even funny. They are not close to being in the same league. Gemma might be like a frontier model from a couple years ago, but with much worse performance in long context chats.

2ndorderthought · 2026-05-11T16:20:20 1778516420

Correct they aren't opus. They are sonnet with a little hand holding. They also run on a single GPU at 40 tps.

No one is saying a local model will give you anthropics business in a 5min download. People are saying, "hmm, maybe I should do this one locally". People are also saying "this is surprisingly good enough for me given the trade offs"

fg137 · 2026-05-12T11:27:52 1778585272

> "hmm, maybe I should do this one locally"

If your time is worth nothing to even triage that question.

Unless you have fanatic needs for data privacy or really don't have Internet, running local models almost certainly results in negative ROI overall.

Not to mention that you need to have decent hardware (that is getting expensive by the day) to even have this conversation in the first place.

People in this post talk as if everyone has a Mac with 24GB or 32GB RAM. When the reality is that most people use a Windows laptop with crappy integrated GPU.

anon373839 · 2026-05-11T15:37:06 1778513826

Hm. I think there is a bit of a shifting goalpost dynamic at play here. Those April releases, even the fast MoE versions, are better than big cloud models from 18 months ago. I remember when everyone was gushing about Sonnet 3.7 and what a transformative experience development was using it. So was it useful or wasn’t it? A tool doesn’t lose its usability just because a better one comes along.

To me, these small local LLMs are highly useful (and this “usable”) even though they don’t match the output of today’s frontier models.

2ndorderthought · 2026-05-11T16:21:10 1778516470

Completely agree. I would even shift the 18months up a bit. I have been impressed with qwen3.6

fg137 · 2026-05-12T11:33:25 1778585605

I'll believe that when Uber deploys local models for developers and ask them to prefer local models over proper Anthropic ones.