this post was submitted on 18 May 2026

227 points (96.3% liked)

Fuck AI

7070 readers

1737 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.

founded 2 years ago

MODERATORS

VerbFlow@lemmy.world

MrMcGasion@lemmy.world

TootSweet@lemmy.world

BigMikeInAustin@lemmy.world

cynar@lemmy.world

drmeanfeel@lemmy.world

pavnilschanda@lemmy.world

CriticalMedicine@lemmy.world

WonderfulWanderer@lemmy.world

Communist@lemmy.ml

eatCasserole@lemmy.world

SpaceNoodle@lemmy.world

NutWrench@lemmy.world

Soup@lemmy.cafe

iAvicenna@lemmy.world

Tinks@lemmy.world

wizblizz@lemmy.world

corus_kt@lemmy.world

Prandom_returns@lemm.ee

JimSamtanko@lemm.ee

TrickDacy@lemmy.world

TheFriar@lemm.ee

ArmokGoB@lemmy.dbzer0.com

HawlSera@lemm.ee

andrew_bidlaw@sh.itjust.works

MeDuViNoX@sh.itjust.works

33550336@lemmy.world

Nougat@fedia.io

Lost_My_Mind@lemmy.world

Quill7513@slrpnk.net

glowing_hans@sopuli.xyz

e8d79@discuss.tchncs.de

ThefuzzyFurryComrade@pawb.social

227

The Time Bomb Went Off: AI's All-You-Can-Eat Era Just Ended in Real Time (www.thestateofbrand.com)

submitted 1 day ago by brianpeiris@lemmy.ca to c/fuck_ai@lemmy.world

49 comments fedilink hide all child comments

Excerpt:

The IPO Math Forces the Issue

Both OpenAI and Anthropic are on IPO timelines for the second half of 2026. OpenAI completed the largest private funding round in history in April, $122 billion at an $852 billion post-money valuation. Anthropic has reportedly surpassed $30 billion in annualized revenue. Massive numbers, both of them. Also both attached to companies that are still burning cash at extraordinary rates.

Public markets will not tolerate the gap between subscription revenue and compute cost that has defined the past three years. The moment either company files, analysts will demand unit economics that show a path to margin. Usage-based billing is the fastest way to demonstrate that path.

None of this contradicts the repricing thesis. The pricing war is the last land grab before the gate closes. Both companies are spending aggressively now to lock in users whose switching costs will make them sticky when prices rise. OpenAI offers two months free. Anthropic offers 50% more capacity. Both expire in July. What comes after July is the real pricing.

you are viewing a single comment's thread
view the rest of the comments

[–] NaibofTabr@infosec.pub 14 points 1 day ago (1 children)

How does this work when “good enough” AI like Deepseek V4, GLM and such are so dirt cheap they’re basically free for businesses? And available from tons of providers, or even self hostable?

Typically what separates enterprise-grade products and services from alternatives is a contract with an SLA... but that generally means there's some contractual requirements for the reliability and productivity of the product or service. I'm not sure that any of the overhyped chatbots are reliable enough to support such contractual obligations, or that there's a useful way to measure their productivity.

[–] brucethemoose@lemmy.world 11 points 1 day ago* (last edited 1 day ago) (1 children)

contract with an SLA

Plenty of hosters provide that. Cerebras, for example, fabs their own ASICs (seperate from Nvidia), builds them into servers, hosts a number of open-weights models themselves in friendly jurisdictions, and offers SLAs for enterprise clients; it doesn't get more "guaranteed" than that in AI Land, but there are tons of hosts to choose from.

https://www.cerebras.ai/build-with-us

The major sticking point is that the best open weights models are Chinese. This doesn't actually matter from a security standpoint anymore than buying a Chinese tire does; they're dumb weights anyone can finetune, host and run on whatever software/hardware stack one wants... But try explaining that technical distinction of "using Chinese AI" to executives responsible for entire corporations.

There are even attempts to "launder" Chinese models to make them palatable for western enterprise use. For example:

https://huggingface.co/microsoft/MAI-DS-R1

https://huggingface.co/unsloth/r1-1776

[–] NaibofTabr@infosec.pub 1 points 1 day ago (1 children)

Plenty of hosters provide that. Cerebras, for example, fabs their own ASICs (seperate from Nvidia), builds them into servers, hosts a number of open-weights models themselves in friendly jurisdictions, and offers SLAs for enterprise clients; it doesn’t get more “guaranteed” than that in AI Land, but there are tons of hosts to choose from.

This makes sense for first-party hardware businesses like Cerebras that are renting or selling their platform to developer businesses (second party) for the purpose of creating AI-based software tools which they will then sell as services to other businesses (third party), and I can see that guarantees could be written in a contract for the first-to-second-party relationship.

What I don't see is that any such guarantees can be effectively written or enforced in a second-to-third-party contract, where an AI SaaS company is selling their software service to companies that don't do their own development, and expect that the service they have contracted will produce reliable results.

[–] brucethemoose@lemmy.world 1 points 22 hours ago* (last edited 22 hours ago)

Actually, what Cerebra’s does is no different than any generic host. They provide API access to LLM weights, though most providers will do it with some standard open source serving software like VLLM or SGLang.

And they all use the same open weights LLMs. They arent the software developer.

Cerebras doesn’t train their own model. And I think this is fine for service guarantees as long as the weights do not change, hence will provide the exact same deterministic results at zero temperature (and generally perform the same when used as a service).

My experience is that a lot of “enterprise” LLM stuff is used in bulk, for results that can be “good enough” with a reasonable error rate. Like (for example) extracting info from literally millions of documents. Or as RAG/querying their own internal documentation.