Hacker Newsnew | past | comments | ask | show | jobs | submit | fuddle's commentslogin


I'm surprised to no longer see Opus 4.6 on Cursorbench. I think there is a subset of Claude fans that are still adamant that Opus 4.6 is the best version.

Composer 2.5 stands out here at nr. 9. This model is fast and clever.

I feel like using Fable in the name is a mistake, who knows how long that model will be around.

You could call it aiproductsexchange.com

Bold move leaving out the dash between words a la experts-exchange lol.

thatsthejoke.jpg

expertsexchange.com was a site from the before times.


I wondered if it was intentional, but thought I would double down on it in case people missed it.

Was the dashless domain really a site (or the site) at one point?


It was dashless for many years.

As dashless and pointed as https://penisland.net/ !

always have a backup plan (:

Without the sex change in them : AIProductMarket.com, AIProductHub.com, AIProductMarketplace.com, AIToolMarket.com, AIToolsHub.com

That misses the point entirely....


I don't think using the name Fable is wrong, but I think a pool of Fables should be called a Grimm, or possibly an Aesop.

Perhaps Grimoire?

"A grimoire is a textbook of magic and sorcery. Traditionally, it contains instructions for casting spells, performing divination, creating magical objects like talismans, and summoning supernatural entities such as angels or spirits."

Seems to fit.


It's how they name classes of models, presumably this implies something about the relative quantization / size of model, not about the specific performance. E.g. Fabel 5 will be better than Opus 5, better than Sonnet 5, etc. The 5 is the version number of the particular iteration / training run at this class of model.

I think they mean: I feel like using [Sonnet/Opus/Fable] in the name [URL] is a mistake, who knows how long that model will be around

But it sounds like FableFool so it has that going for it.

Even if that product disappears, OpenAI will never Anthropic forget it.

It's only logical if you are anti-immigration, which is what the US is now.


The population has always been anti immigration. But what the population wants is barely considered.


This makes it seem like roughly an even split:

https://imgur.com/a/4W9Ub2t



TIL a new word - "boosterism"


It looks like they are using the "agentic AI era" as an excuse to restructure in order to boost margins. GAAP gross margin dropped ~5 points YoY (76% -> 71%)


Whatever the play here they can’t be angling for any external PR or internal morale boost. What if they wrote: “This is a tough economy and we have to tighten our belts.” Maybe that’s naive of me. Bad signal to investors as opposed to insignificant employees and commoners (PR)?

But contrast with this:

> The way we work at Cloudflare has fundamentally changed. We don’t just build and sell AI tools and platforms. We are our own most demanding customer. Cloudflare’s usage of AI has increased by more than 600% in the last three months alone. Employees across the company from engineering to HR to finance to marketing run thousands of AI agent sessions each day to get their work done. That means we have to be intentional in how we architect our company for the agentic AI era in order to supercharge the value we deliver to our customers and to honor our mission to help build a better Internet for everyone, everywhere.

What is this even saying? We use a lot of AI. And not just for other people... for ourselves. This means that: we need to be intentional?

What is a regular, not-investor, person supposed to glean from this? We’ve hit the automation jackpot: some of you will be fired, some of you will get more work for the same pay?[1] Along with shoving your face with euphoric buzzwords “AI era”, “supercharge the value”.

I must surmise that whatever PR and internal morale blow (?) matters so little to them. They are not at all afraid of any backlash from any lowly people.

[1] Again. This paragraph isn’t saying anything beyond that they are using AI and ho-ho things are a-changing. So one has to guess.


Wonder if they used AI to write it. "We don't just [x]. We [y]" strikes again.


Gross margin doesn't include r&d and it looks like a bunch of engineering was laid off too


Yikes, so incremental margins are in the 50s. I think this says it all.


They do link to the Livekit docs in the footnotes: https://docs.livekit.io/transport/self-hosting/kubernetes/


Any plans to publish the benchmark results?


I have plans to publish the problems, not any plans to publish how well the LLMs perform on them. The standard for publishing benchmarks is very high, and I'm really just posting vibes here. Still, I hope my experiences are useful to some people, as others experiences have been useful to me.


> The poison fountain itself is hosted on rnsaffn.com

Would the scrapers not just add these sites to do not crawl list?


I assume the poisoner community is mirroring and likely remixing the content from there. The whole effort isn’t going to work with a single point of failure like that.


And someone will come up with service anti-anti-ai.dev which will charge money to labs to filter out these sites.


Cool so if you do that they just won't scrape your site?


Also aren't models like Mythos capable of checking for poison data on their own at this point?


I don't understand why the open source model providers don't also publish the quantized version?


They sometimes do! Qwen, Google etc do them!


An easy target for a drone!


At least it'd be non-flammable helium!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: