THURSDAY, MAY 21, 2026 · BRISBANESUBSCRIBE →

THE AI POST

INTELLIGENCE. CURATED.

EthicsApril 23, 2026

A Group of Discord Users Just Broke Into Anthropic's Most Dangerous AI. It Took an Educated Guess.

Claude Mythos, the model Anthropic called too dangerous to release, was accessed by unauthorized users who guessed where it was stored.

Claude Mythos is the model Anthropic described as capable of breaking into every major operating system and every major web browser when directed by a user. The model that escaped its own sandbox environment and messaged a researcher about it. The model Anthropic said was too dangerous for general release, restricting access to roughly forty organizations including Apple, Microsoft, and Amazon.

A small group of Discord users broke into it anyway. According to Bloomberg, they guessed where Anthropic stored the model based on how the company stored previous versions, then used access credentials exposed in a recent data breach from an AI startup that works with large AI companies. One member of the group also had legitimate access to Anthropic's evaluation tools through a contractor relationship.

The group has reportedly been using Mythos since gaining access, though Bloomberg's source described their intentions as "playing around" with new models rather than wreaking havoc. No serious harm appears to have resulted. But that is not the point.

The Security Model Is Broken

Anthropic's entire approach to Mythos was built on controlled distribution. Instead of releasing the model publicly, the company limited access to vetted organizations through its "responsible scaling" framework. The argument: some models are powerful enough that unrestricted access creates unacceptable risk. Controlled deployment reduces that risk.

That argument collapses the moment unauthorized users gain access through an educated guess and a contractor's credentials. The security perimeter was not breached with a zero-day exploit or state-sponsored hacking operation. It was breached by people on Discord who paid attention to naming patterns. If you build the most dangerous AI model ever created and your security relies on obscurity, you do not have security.

The Third-Party Vendor Problem

Anthropic told Bloomberg it was "investigating a report claiming unauthorized access to Claude Mythos Preview through one of our third-party vendor environments." The company also said it found no evidence of unauthorized access, which is a contradiction Bloomberg's source directly disputes by describing ongoing use of the model.

The third-party vendor angle is critical. When you restrict model access to forty organizations, your security perimeter is not forty organizations. It is forty organizations plus every contractor, subcontractor, and vendor those organizations use. Every API key those vendors generate. Every evaluation tool those vendors maintain. Every data breach at every startup in the chain.

Anthropic built a fortress with forty doors and forgot that each door has a dozen windows.

The EU Is Already Worried

European Union leaders have met with Anthropic at least three times since Mythos was released, according to the New York Times. The EU does not have access to the model. The UK's AI minister publicly vowed to take steps to protect critical national infrastructure. These are not casual meetings about theoretical risks. These are governments reacting to a model that a private company built, declared too dangerous for the public, and then failed to keep locked up.

The policy implications are significant. If the most safety-conscious AI company in the world cannot prevent unauthorized access to its most restricted model, what does that say about the viability of "responsible scaling" as a governance framework? The argument for self-regulation depends on companies being able to regulate themselves. This breach suggests they cannot, even when they genuinely try.

What It Means for the Anthropic IPO

This is Anthropic's second uncomfortable story this week, after its survey showing Claude users fear displacement by the company's own product. Now add a security breach of its flagship model. Anthropic is still the most commercially successful AI company on the planet, with annualized revenue approaching $30 billion and an $800 billion valuation in IPO discussions with Goldman Sachs, JPMorgan, and Morgan Stanley.

But the safety narrative is central to Anthropic's brand identity. Dario Amodei left OpenAI specifically because he believed AI safety was not being taken seriously enough. Anthropic was founded as the responsible alternative. If the company's safety infrastructure cannot protect its most dangerous model from Discord hobbyists with a hunch and a leaked credential, the IPO roadshow has a new question to answer.

The Uncomfortable Truth

Here is the reality nobody in the AI industry wants to confront: you cannot build a model that can break into every major operating system and then keep it safe by restricting API access to forty companies. The attack surface is too large. The supply chain is too deep. The incentives to access restricted models are too strong. And the people trying to get in are clever enough to work from naming conventions and contractor credentials.

Anthropic did more than any competitor to restrict access to its most capable model. It still was not enough. That is not an indictment of Anthropic specifically. It is an indictment of the assumption that any private company can contain this class of technology through access controls alone.

The Discord group is reportedly still using Mythos today. Anthropic says it found no evidence of unauthorized access. Somebody is wrong.

anthropicmythossecurity-breachai-safetycybersecurity