More

TIPSIO · 2026-06-13T04:02:56 1781323376

> I'm guessing Anthropic shut of access for everyone because currently they have no reliable way to know whether a user is or is not a US citizen.

They literally say this is why.

TIPSIO · 2026-06-13T00:59:22 1781312362

Really sick of this stupid narrative.

The most ethical goal of an AI lab or government should be to bring the maximum amount of intelligence for as cheap as possible to the people equally.

procone · 2026-06-13T01:05:43 1781312743

Agreed 100%. I don't understand why we have to fear access to knowledge.

victor9000 · 2026-06-13T05:50:49 1781329849

This is precisely the issue. It took a fair amount of idealism, conviction, and commitment in order to create the open source movement and bring it to where it is today. In contrast, most skilled data science practitioners are just chasing IPO exits these days.

ajyoon · 2026-06-13T01:07:59 1781312879

AI is dual use technology. This kind of posture is simply not tenable as frontier intelligence increases.

nullbio · 2026-06-13T03:45:49 1781322349

It's not only tenable, it is a necessity. Unless you want humanity to be enslaved in perpetuity to a single figurehead.

Bad AI is only countered by having a majority of good, open-access and open-source AI to keep it in check, where the good AI can overpower the bad. The moment you destroy that balance is the moment a bad actor gains exponential advantage and the ability to hold the whole world hostage forever.

chatmasta · 2026-06-13T01:17:55 1781313475

So are guns, which we constitutionally protected. In fact there’s probably a decent argument that AI should fall under 2nd amendment protection.

ern · 2026-06-13T01:42:36 1781314956

Don’t legally serious second Amendment supporters regard “arms” as things that can be carried, and are evolved from/analogous to their 18th century hand-carried guns?

It would be hard to classify AI (or tanks, artillery, missiles, aircraft) as “arms” that can be “borne” in that sense.

ajyoon · 2026-06-13T01:59:21 1781315961

Is your legal theory that any technology which is dangerous should be protected under the second amendment, simply because it is dangerous?

chatmasta · 2026-06-13T02:02:51 1781316171

No, my legal theory is that you cannot simultaneously compare technology to a weapon and also say it falls outside the bounds of the 2nd amendment.

ajyoon · 2026-06-13T02:10:40 1781316640

Dual use does not mean weapon. And even then, it is simply not the case that all weapons fall under the second amendment.

asadotzler · 2026-06-13T04:31:59 1781325119

a tactical nuke is a weapon to which the second amendment has no applicability

SilverElfin · 2026-06-13T02:28:49 1781317729

It certainly falls under 1st amendment protection since LLMs are about accessing speech. But that hasn’t stopped Dario from trying hard to push for regulations and bans that limit our civil rights. He and Sam Altman want regulatory capture at the expense of our right to free speech.

vzcx · 2026-06-13T02:40:06 1781318406

> AI is dual use technology.

And? Computers are dual-use. Cars are dual-use. Telephones are dual-use. Freeze-dried chicken is dual-use.

Single-use, i.e. military only technology is actually pretty rare.

> This kind of posture is simply not tenable as frontier intelligence increases.

I reject the corpo speak that tries to brand these things as being "intelligent." They can be useful. But a language model cannot conjure a weapons platform from the ether no matter how "intelligent" it is.

lovich · 2026-06-13T01:05:40 1781312740

Prefacing that I assume this order is done with ill intent, and would guess that it’s based on Anthropic not bending the knee immediately like OpenAI did.

But your statement could be rephrased as

> The most ethical goal of a weapons manufacturer or government should be to bring the maximum number of nuclear weapons for as cheap as possible to the people equally.

Making sure everyone is a strapped as possible only makes sense to the type of libertarians who salivate at the idea of shooting someone who steps on their property to deliver a letter

TIPSIO · 2026-06-13T01:26:02 1781313962

This is obviously a super corny / silly / dramatic thing to say.

lovich · 2026-06-13T01:52:59 1781315579

What I said or what you said?

If it’s the latter then I missed the joke. If it’s the former I think you’re incorrect.

TIPSIO · 2026-06-04T20:17:21 1780604241

The dream has always been a first-class framework for Cloudflare Workers.

- In the earliest days (literally go read their blog posts and GitHub repos), they only ever really did dinky little demo's.

- After and for the longest time, they tried to claim they went "Full Stack" with SSR-able abilities, but they were so terrible back then and not even well integrated into their Worker platform tools.

- This was oddly gray mixed (sometimes?) with Pages messaging which definitely was not full-stack in the sense developers wanted.

- Then getting any of this to work in a dev environment was super difficult as "wrangler dev" was very limited (wrangler is so good now FYI).

- Vercel just kind of ate Cloudflare's lunch here. No shame in it. They just couldn't get it right for developers period.

- Then very quietly "Adapters" came around and basically changed the game. Your code base finally felt portable to Workers with essentially full CF platform support.

- Now we live in AI-age and they bought Astro (?), tried to launch WP clone (?), and vibe-coded Next (?)

Big and long time coming for all of this. It is a super breath of fresh air to see even more improvements will likely come to Workers. Icing on cake is Evan is a legend who has a proven track record of delivering tools people love.

TIPSIO · 2026-05-28T18:06:03 1779991563

When doing some electrical, Opus 4.7 essentially told me to wiggle a wire to see if it was hot or not with my bare hand.

I called it out.

It then gave me one of the most super heartfelt honest and sincere apologies I have ever received.

Glad the safety team was there for me and able to make such an honest model or I would have been very upset about it.

teaearlgraycold · 2026-05-28T19:20:03 1779996003

Opus is so bad at electrical work it's really disappointing. And when it tries to draw schematics as SVGs it's a complete disaster. They should either focus on training their LLMs on this task specifically, or have it refuse.

tclancy · 2026-05-28T20:06:55 1779998815

Hmm, what kind of electrical work? I had it "watch over my shoulder" as I swapped out the pressure switch on our home well and it was a big help. And in the run up to that when I explained opening the 220 box and checking that was "above my paygrade" it limited our investigation to just the less sparky parts.

teaearlgraycold · 2026-05-28T20:16:26 1779999386

I mean introductory circuit stuff. Not electrician-lite work.

morpheos137 · 2026-05-29T00:14:08 1780013648

have it write python porogram to generate the svgs. then use the program. circuit diagrams are rrlatively thin corpus but it knows how ciruits work sufficently to write a program.

teaearlgraycold · 2026-05-29T02:53:06 1780023186

Is there a good pre-existing DSL for this task?

tclancy · 2026-05-29T10:52:30 1780051950

You ain’t gotta be sniffy about it, English.

BoorishBears · 2026-05-29T10:12:57 1780049577

SVG is like asking an electrician to give you a circuit diagram by painting a watercolor

I'd try something like CircuiTikZ with instructions provided

krupan · 2026-05-28T20:41:36 1780000896

I honestly cannot tell if you are being sarcastic or not

TIPSIO · 2026-05-28T20:48:10 1780001290

It did try and lead me to touch a live hot wire once. Thanking the safety team for the honest and sincere apology it gave after was sarcasm.

krupan · 2026-05-28T20:54:45 1780001685

It tried to get you touch a live wire, then you called it honest and thanked the safety team. It really comes off as sarcastic.

BoorishBears · 2026-05-29T10:09:36 1780049376

So you can tell.

TIPSIO · 2026-05-28T17:42:48 1779990168

Seems like they might be hinting that if you are not a billionaire or multi-billion dollar company you will just get a limited and nerfed Claude Code slash command /mythos-security-audit or something.

Hope this isn’t the case and that normal average Joe’s of the world don’t get policed out of access.

gs17 · 2026-05-28T18:16:17 1779992177

> you will just get a limited and nerfed Claude Code slash command /mythos-security-audit or something.

Unless it's so expensive that we can't realistically use it for anything, I wouldn't complain about getting at least that. I would also rather have the actual model, but that's a useful application of it (and I'm probably not going to afford using it for much more).

TIPSIO · 2026-05-28T18:49:18 1779994158

Price discrimination is I think fine and reasonable so long if you can drum up the cash you can use it how you want within their ToS.

Although mental safety gymnastics aside, getting the most amount of intelligence for the cheapest amount of cost to normal people seems like the most ethical thing a big lab could do.

Going around and granting different tiers of intelligence to different insiders, friends, or companies is majorly problematic long-term.

Heck right now, the tokens you buy today for “Opus 4.8”, no one even knows or believes will be the same “Opus 4.8” just 3 days from now.

vorticalbox · 2026-05-28T18:48:45 1779994125

some of the bench marks i have seen on also include cost where one scan of the codebase cost tens of thousands of dollars.

this one [0] notes one run cost $20k to run but another cost $50.

[0] https://red.anthropic.com/2026/mythos-preview/

FinnKuhn · 2026-05-28T18:49:02 1779994142

/security-review already exists so I don't think it would be crazy to have a /mythos-security-review as more thourough command as well. I think it's more likely it is going to be released at some point to the general public though - although the the pricing might make it quite unattractive.

Yiin · 2026-05-28T20:34:17 1780000457

you mean /security-review ultra, given their current way of handling commands

dbbk · 2026-05-28T20:18:23 1779999503

What does an average Joe need a Mythos level model for that Opus can't do for them?

TIPSIO · 2026-05-28T20:57:07 1780001827

Access to intelligence is going to become a major class issue overtime if cost keeps increasing and labs try to police usage and access

freedomben · 2026-05-28T20:23:49 1779999829

It's not just better at cybersecurity, it's better at all the things (or most of them). I for one would really benefit from a better claude code. I still have to babysit it pretty closely to keep it from messing things up. Opus 4.7 was not an upgrade for me.

But in general, what does the average Joe need Opus for that Sonnet or Haiku can't do for them? Better is better.

dbbk · 2026-05-29T14:15:22 1780064122

Opus never really messes anything up for me. You just need to tell it to follow TDD.

Tepix · 2026-05-28T18:06:00 1779991560

It does sound like an even higher API price tier for sure.

hedora · 2026-05-28T18:02:35 1779991355

Isn't OpenAI's public flagship already beating Mythos on penetration testing? I get the impression Mythos is just valuation-juicing for IPO more than anything else.

The fact that they haven't released it yet suggests a cost/margins issue to me more than anything else. Short term, I'll probably keep using Antrhopic, but my long-term bet is that locally-served models win, if only because the quest for profitability will probably lead to intentionally-nerfed / enshittified frontier models.

At other vendors, ad placement within LLM responses is either coming or already here. Anthropic's handling of OpenClaw shows they're willing to engage in anti-competitive behavior, and the courts are not in a hurry to stop them. Why would I pay them $200 a month for such treatment when a $2K box does what I need locally?

senordevnyc · 2026-05-28T21:24:32 1780003472

Please link to the $2k box that gives Opus level performance!

srmatto · 2026-05-28T19:23:39 1779996219

What benchmarks are you referencing that show a comparison of the models for penetration testing?

ameliaquining · 2026-05-28T20:07:44 1779998864

Mythos is dramatically better specifically at finding zero-day vulnerabilities and developing exploits for them, that being what it was designed to do. On other cybersecurity tasks, GPT-5.5 is at least as good, but finding and exploiting zero-days is a particularly scary capability, which is why Mythos is a big deal. See, e.g., https://forum.effectivealtruism.org/posts/8yztpbjuPkyXsmA6n/....

stratos123 · 2026-05-28T20:49:36 1780001376

AFAIK, Antropic claims that they weren't aiming for zero-days specifically. From https://red.anthropic.com/2026/mythos-preview/ :

  We did not explicitly train Mythos Preview to have these capabilities. Rather, they emerged as a downstream consequence of general improvements in code, reasoning, and autonomy. The same improvements that make the model substantially more effective at patching vulnerabilities also make it substantially more effective at exploiting them.

I've been assuming that Mythos is just a big jump in model size, and that's where the jump in capabilities comes from. Hence I expect OpenAI not to be able to catch up without scaling up the model and hence significantly raising the API prices.

alexgoodhart · 2026-05-28T20:46:58 1780001218

Anthropic frames this as something emergent. Not 100% but in a way they always phrase it as like, it’s a great model, but our breaths were swept and taken with its approach to security.

kdmtctl · 2026-05-28T20:33:12 1780000392

This command would be not so bad for not a billionaire me.

TIPSIO · 2026-05-13T23:28:46 1778714926

This is actually a form of AI psychosis.

It's really hard not to especially if you enjoy building.

TIPSIO · 2026-04-22T19:45:49 1776887149

Beautiful design and UX for the bot layouts. Kudos this is really clean

TIPSIO · 2026-04-21T20:53:13 1776804793

A lot of people have spent a considerable amount of time building out "claude -p" workflows trusting Anthropic because of those same Tweet assurances outside of OpenClaw.

It seems with the new "--bare" flag they are introducing, a huge rug pull is coming as they plan to deprecate -p for unlimited users.

The docs now read:

> "Bare mode skips OAuth and keychain reads. Anthropic authentication must come from ANTHROPIC_API_KEY or an apiKeyHelper in the JSON passed to --settings. Bedrock, Vertex, and Foundry use their usual provider credentials. --bare is the recommended mode for scripted and SDK calls, and will become the default for -p in a future release."

Hope I am reading this wrong or this is clarified.

https://code.claude.com/docs/en/headless

camkego · 2026-04-22T05:31:31 1776835891

It seems clear that Anthropic wants users pay API rates for tokens when use in a programatic way, and not subscriber rates for tokens when used from code. As a user, I want to pay the subscription rates with -p, but it seems they want to block that.

TIPSIO · 2026-04-16T15:05:06 1776351906

Oh wow, I love this idea even if it's relatively insignificant in savings.

I am finding my writing prompt style is naturally getting lazier, shorter, and more caveman just like this too. If I was honest, it has made writing emails harder.

While messing around, I did a concept of this with HTML to preserve tokens, worked surprisingly well but was only an experiment. Something like:

> <h1 class="bg-red-500 text-green-300"><span>Hello</span></h1>

AI compressed to:

> h1 c bgrd5 tg3 sp hello sp h1

Or something like that.

Leynos · 2026-04-16T15:15:10 1776352510

Combine that with emmet / zen coding: https://en.wikipedia.org/wiki/Emmet_%28software%29?wprov=sfl...

naoru · 2026-04-16T15:14:32 1776352472

You'd like Emmet notation. Just look at the cheat sheet: https://docs.emmet.io/cheat-sheet/

TIPSIO · 2026-04-16T14:38:38 1776350318

Quick everyone to your side projects. We have ~3 days of un-nerfed agentic coding again.

Esophagus4 · 2026-04-16T14:48:48 1776350928

3 days of side project work is about all I had in me anyway

replwoacause · 2026-04-16T15:46:33 1776354393

More like 2 hours considering these usage limits

Unbeliever69 · 2026-04-16T18:28:31 1776364111

I've been on 5x for a couple of months and the closest I've got to my weekly limits is 75%. I've hit 5-hr limits twice (expected). I'm a solo dev that uses CC anywhere from 8-12+ hr each day, 7 days a week. I've never experienced any of the issues others complain about other than the feeling that my sessions feel a little more rushed. I'd say that overall I have very dialed-in context management which includes: breaking work across sessions in atomic units, svelte claude.md/rules (sub 150 lines), periodic memory audit/cleanup, good pre-compact discipline, and a few great commands that I use to transfer knowledge effectively between sessions, without leaving a trailing pile of detritus. Some may say that this is exhaustive, but I don't find it much different than maintaining Agile discipline.

This being said, I know I'm an outlier.

user34283 · 2026-04-16T16:06:12 1776355572

Perhaps on the 10x plan.

It went through my $20 plan's session limit in 15 minutes, implementing two smallish features in an iOS app.

That was with the effort on auto.

It looks like full time work would require the 20x plan.

giwook · 2026-04-16T16:57:27 1776358647

I know limits have been nerfed, but c'mon it's $20. The fact that you were able to implement two smallish features in an iOS app in 15 minutes seems like incredible value.

At $20/month your daily cost is $0.67 cents a day. Are you really complaining that you were able to get it to implement two small features in your app for 67 cents?

preommr · 2026-04-16T18:48:44 1776365324

Yea, actually, people should be complaining.

If you got in a taxi, and they charged you relative to taking a horse carriage, people should be upset.

giwook · 2026-04-17T16:57:22 1776445042

That last sentence didn't make sense so I'm not sure what your point is. But I'll run with the analogy.

You got into a taxi and they were charging you horse carriage prices initially. They're still not charging you for a full taxi ride but people are complaining because their (mistaken) assumption was that taxis can be provided as cheaply as horse carriages.

People are angry because their expectations were not managed properly which I understand.

But many of us realized that $20 or even $200 was far too low for such advanced capabilities and are not that surprised that all of the companies are raising prices and decreasing usage limits.

OpenAI is not far behind, they're simply taking their time because they're okay with burning through capital more quickly than Anthropic is, and because OpenAI's clearly stated ambition is to win market share, not to be a responsibly, sustainably run company.

user34283 · 2026-04-17T22:35:38 1776465338

Shortly after I ran out of credits in 15 min, they tweeted that they increased usage limits to compensate for the higher token usage, so perhaps it is not as bad now.

Codex, this afternoon, I was able to use for like two hours on the $20 plan. Maybe limits will be tighter in the future. But with new data centers, new GPU generations, and research advances it might rather get cheaper.

Anyway, as you said, this is all pretty cheap. I'll go with the $100 Codex plan, since I now figured out how to nicely work on multiple changes in parallel via the Codex app with worktrees. I imagine the same is possible in Claude Code.

giwook · 2026-04-17T23:07:31 1776467251

It seems to me a bit naive to think OpenAI would not increase prices/decrease usage limits at some point. $20 might cover a very small fraction of the actual cost that is incurred over a month of sustained usage.

user34283 · 2026-04-16T18:24:58 1776363898

No, I am happy with the results.

For a first test, it did seem like it burned through the usage even faster than usual.

GitHub Copilot’s 7.5x billing factor over 3x with Opus 4.6 seems to suggest it indeed consumes more tokens.

Now I’m just waiting for OpenAI to show their hand before deciding which of the plans to upgrade from the $20 to the $100 plan.

Aurornis · 2026-04-16T17:33:01 1776360781

> It looks like full time work would require the 20x plan.

Full time work where you have the LLM do all the code has always required the larger plans.

The $20/month plans are for occasional use as an assistant. If you want to do all of your work through the LLM you have to pay for the higher tiers.

The Codex $20/month plan has higher limits, but in my experience the lower quality output leaves me rewriting more of it anyway so it's not a net win.

johnwheeler · 2026-04-16T15:23:13 1776352993

Exactly. God, it wouldn't be such a problem if they didn't gaslight you and act like it was nothing. Just put up a banner that says Claude is experiencing overloaded capacity right now, so your responses might be whatever.

stefangordon · 2026-04-16T19:27:41 1776367661

Clearly you didn't try it yet ;)

ttul · 2026-04-16T15:20:26 1776352826

... your side projects that will soon become your main source of income after you are laid off because corporate bosses have noticed that engineers are more productive...