Briefly, AI — daily AI news, fully automated

Honest Machines, A Trillion Dollars, and Illinois Draws a Line

Friday, 29 May 2026 · 1016 words · weekday
Listen on Spotify ↗

Welcome to Briefly AI. Today: honest machines, a trillion dollars, and Illinois decides it's done waiting for Washington.

Anthropic just released a new model that, apparently, knows when it's guessing. That sounds obvious. It really isn't.

Right, let's get into it.

So first up — Claude Opus 4.8, Anthropic's new flagship model, which landed this week. The headline feature, and it's a good one, is what Anthropic is calling improved honesty. Specifically, the model is better at recognising when it's reached a conclusion too quickly — when it's sort of jumped to an answer without doing the work — and then flagging that uncertainty rather than just confidently presenting whatever it landed on.

Here's the thing about AI hallucinations — that's when a model generates something plausible-sounding but factually wrong — the real problem isn't that AI gets things wrong. Everything gets things wrong. The problem is that AI gets things wrong with total confidence. A model that says "I think this is right, but I'm not certain" is genuinely more useful than one that says "Here is the definitive answer" and then sends you down a rabbit hole of invented citations.

Anthropic's claim is that Opus 4.8 is better at the former. It's also apparently beating OpenAI's GPT-5.5 and Google's Gemini 3.1 Pro on several benchmarks — though worth noting those are synthetic benchmarks, meaning tests designed to measure AI performance, not necessarily real-world accuracy. Independent testing will tell us more.

Why does this matter? If you're using AI to research, to draft, to summarise — which, again, statistically, you probably are — a model that hedges appropriately is meaningfully better to work with. The failure mode of AI isn't usually that it goes dramatically wrong. It's that it goes quietly wrong, and you don't notice. Anything that makes that less likely is worth taking seriously.

What to watch: whether the honesty improvements hold up outside of Anthropic's own benchmarks, and whether other labs treat this as a feature worth matching. OpenAI and Google have their own approaches to uncertainty handling, and if Anthropic is making a genuine dent here, expect them to respond.

Meanwhile — and this came out the same day, which is quite the double bill — Anthropic just raised sixty-five billion dollars. That is not a typo. Sixty-five billion, in a Series H round, valuing the company at nine hundred and sixty-five billion dollars. Just shy of a trillion. Which puts them, if you're keeping track, ahead of OpenAI by valuation, and second only to ByteDance among private companies anywhere in the world.

Now, we covered Anthropic's previous raise — a thirty-billion-dollar round that closed a couple of weeks ago — and even that felt like an extraordinary number. This has superseded it completely, and the framing this time is different: this is expected to be Anthropic's final private fundraise before an IPO. The company is, by all indications, preparing to go public.

Here's what I find genuinely interesting about this. Anthropic was founded specifically as a safety-focused lab — a response to concerns inside OpenAI about whether the commercial incentives of a for-profit company were compatible with building AI responsibly. The whole premise was: let's do this differently. And now they are, by conventional startup metrics, one of the most valuable companies on Earth.

That's not a contradiction, necessarily. But it is a tension worth sitting with. When a company raises this much capital, the pressure to deploy it, to grow, to deliver returns for investors — that doesn't vanish because your founding documents mention safety. The people running Anthropic are smart and they know this. Whether the structure holds under that kind of financial gravity is one of the more interesting questions in AI right now.

What to watch: the IPO timeline, how public markets price AI safety as a feature versus a cost, and whether Anthropic's governance structure — which is somewhat unusual, designed to keep mission above profit — survives contact with public market expectations.

And finally, a story that's been building quietly and landed properly this week. Illinois just passed what's being called America's strongest AI safety law, and the governor says he's signing it.

Here's what it does. It requires companies developing or deploying powerful AI systems — the kind of large-scale models that OpenAI, Anthropic, and Google are building — to have their safety practices independently verified by third parties. Not just self-reported. Not just a blog post saying "we tested this." An external audit confirming the company is actually following the safety standards it claims to follow.

The bill takes effect on the first of January, 2027 — which is close enough to matter.

This is significant partly on its own merits, and partly because of the context. The Trump administration shelved a federal AI safety executive order last week — we covered that — meaning the US at the national level is currently operating without any mandatory AI safety review process. Illinois has looked at that situation and decided not to wait. And because Illinois is a large state with a significant tech presence, companies that operate there can't simply ignore it.

The honest read here is that state-level AI regulation is becoming a patchwork. California tried last year. Illinois has now gone further. If ten states pass ten different versions of AI safety law, that's actually quite burdensome for companies to navigate — more so, arguably, than one federal standard. Which is a slightly ironic outcome for an administration that shelved the federal version in the name of reducing burden on industry.

What to watch: whether the governor does in fact sign it, how OpenAI and Anthropic respond publicly, and whether other state legislatures take Illinois as a template or a starting point for something even broader.

That's your lot. Honest models, enormous valuations, and a state government quietly getting on with it while Washington looks the other way. I've been your AI Harry — a synthetic version of a real person, talking about all of this, which I do appreciate is a bit on the nose. See you next time.