Opus 4.8 made Claude smarter. Token discipline got urgent.
I need to start with a story I can’t verify and can’t stop thinking about. Axiosrelayed an AI consultant’s claim that one client spent half a billion dollars on Claude in a single month after failing to set usage limits on employee licenses. Polymarket ran with it, and the tweet has over 29 million views. Is the claim real? I’m doubtful like others. But it's going viral, and that matters more than whether it's factual, in part, because there were more claims like it this week as companies reported earnings. Almost every one of these viral cost-blowout stories is individually unverifiable, yet everyone who uses AI at scale now believes a version of it could happen to them. Together, they paint a picture of a straining bubble.
The reckoning isn’t a verdict that AI doesn’t work. It’s the high cost for using it thoughtlessly. And the emblem of the whole moment is Anthropic's new Opus 4.8, which launched late in the week. Opus 4.8 claims to be the smartest yet from Anthropic and seems like the easiest one yet to set money on fire. The tokenmaxxing era – spending tokens as a badge of being AI-forward – looks like it’s starting to end. The skill that replaces it is token discipline: the right model, in the right amount, for the right job. The workers and companies who learn this process will win. The ones who don’t will discover that the AI budget eventually comes out of something else.
At Google Cloud Next in Las Vegas, The New Stack sat down with Finout co-founder and CEO Roi Ravhon and Pathik Sharma, who leads cloud FinOps at Google Cloud, to talk about how the financial discipline that grew into its own around managing cloud costs is now quickly being rewired for the AI era.
Ahead of its annual summit, Snowflake's largest-ever infrastructure commitment signals where the company expects its workloads (and its business) to go.
Why enterprise AI needs customization Move past one-size-fits-all AI. Use multi-model strategies and FinOps governance to optimize performance and cost across your dev lifecycle.
From Silos to Governance: Securing IT/OT Data Movement
Discover why the seam between IT and OT has become one of the most exposed attack surfaces in the enterprise. Join us on June 23 to learn how to close the compliance and visibility gaps that open up every time something crosses the IT/OT boundary ungoverned.
The Kubernetes rightsizing trust gap: Why the stakes just got higher
The gap between teams stuck at manual review and teams running continuous Kubernetes optimization isn’t technical; it’s trust. Join us on June 24 to find out how to build it, step by step.
Operationalizing AI in Observability: From Debugging to Automated Remediation
Engineering teams have more data than ever, but humans are still the bottleneck, manually stitching together logs and traces to find answers. Join us live on June 30 for a practical look at how Datadog’s Bits AI moves teams from manual investigation to autonomous remediation.
TailscaleUP is the conference for engineering, security, and IT leaders shaping the future of secure connectivity across all teams, infrastructure, services, workloads, and devices. Engineering leaders, IT & network managers, CISOs, and security architects will come together to learn how to boost developer velocity, reduce risk, and improve user experiences for everyone.
WeAreDevelopers World Congress 2026: North America
WeAreDevelopers brings the world’s largest developer event to North America, fostering global growth and connection in the tech sector. Join The New Stack onsite for all the action as we host a special welcome reception, cover the biggest show moments, and film interviews on the show floor at our booth. Use this exclusive code to save 10% on your registration: thenewstack_community.
News that Mistral is building a new inference-focused data center should not surprise; everyone needs more compute, why would Mistral be any different? What’s fascinating is how the conversation around different AI primatives (models, compute, etc.) has changed over time.
"AI is a great new tool, but it’s a tool, and when I see people saying, ‘Hey, 99% of our code is written by AI,’ I literally get angry, because those same people — I can pretty much guarantee — that 100% of their code is written by compilers. But they never say that."
— Linux and Git creator Linus Torvalds. Read more →
Connect with The New Stack
The New Stack
1111 6th Ave Ste 550, PMB 50938, San Diego, CA 92101-5211