The IPO Math Forces the Issue
Both OpenAI and Anthropic are on IPO timelines for the second half of 2026. OpenAI completed the largest private funding round in history in April, $122 billion at an $852 billion post-money valuation. Anthropic has reportedly surpassed $30 billion in annualized revenue. Massive numbers, both of them. Also both attached to companies that are still burning cash at extraordinary rates.
Public markets will not tolerate the gap between subscription revenue and compute cost that has defined the past three years. The moment either company files, analysts will demand unit economics that show a path to margin. Usage-based billing is the fastest way to demonstrate that path.
None of this contradicts the repricing thesis. The pricing war is the last land grab before the gate closes. Both companies are spending aggressively now to lock in users whose switching costs will make them sticky when prices rise. OpenAI offers two months free. Anthropic offers 50% more capacity. Both expire in July. What comes after July is the real pricing.
This makes sense for first-party hardware businesses like Cerebras that are renting or selling their platform to developer businesses (second party) for the purpose of creating AI-based software tools which they will then sell as services to other businesses (third party), and I can see that guarantees could be written in a contract for the first-to-second-party relationship.
What I don't see is that any such guarantees can be effectively written or enforced in a second-to-third-party contract, where an AI SaaS company is selling their software service to companies that don't do their own development, and expect that the service they have contracted will produce reliable results.
Actually, what Cerebra’s does is no different than any generic host. They provide API access to LLM weights, though most providers will do it with some standard open source serving software like VLLM or SGLang.
And they all use the same open weights LLMs. They arent the software developer.
Cerebras doesn’t train their own model. And I think this is fine for service guarantees as long as the weights do not change, hence will provide the exact same deterministic results at zero temperature (and generally perform the same when used as a service).
My experience is that a lot of “enterprise” LLM stuff is used in bulk, for results that can be “good enough” with a reasonable error rate. Like (for example) extracting info from literally millions of documents. Or as RAG/querying their own internal documentation.