Anthropic Withholds AI Model After Finding Thousands of Software Flaws

Anthropic decided not to publicly release its new AI model, Claude Mythos, after the system discovered thousands of zero-day vulnerabilities ^[1].

The decision highlights a growing tension between the pursuit of advanced artificial intelligence and the risk of creating tools that can be weaponized for cyber-attacks. Because the model can autonomously identify and exploit software flaws, its release could provide malicious actors with a powerful weapon to target critical infrastructure.

Anthropic announced the model in April 2026 ^[2], but the firm chose to withhold it from the public. A company spokesperson said the model found thousands of zero-day vulnerabilities across major browsers and operating systems ^[1]. The spokesperson said this discovery provided a six-to-12-month window before those vulnerabilities could be exploited ^[1].

The move prompted high-level meetings in Washington, D.C. Federal AI Minister Evan Solomon, the Treasury secretary, and the Federal Reserve chair met with major bank CEOs to discuss the implications for the financial sector. Solomon said the decision to not release the model is the "responsible" approach ^[3].

However, some industry observers question if the risk is truly unprecedented. A cybersecurity expert said the perils revealed by Mythos are achievable using older models, including those from OpenAI and Anthropic ^[4]. Other reports suggest the model represents a new level of risk that makes it too dangerous for public access ^[5].

Anthropic is headquartered in San Francisco, California. The firm continues to coordinate with U.S. government officials to manage the security risks associated with the model's capabilities.

“"We found thousands of zero‑day vulnerabilities across major operating systems and browsers"”

The withholding of Claude Mythos signals a shift in the AI industry toward 'safety-first' deployment, where the potential for autonomous cyber-offensive capabilities outweighs the commercial benefit of a product launch. It underscores a critical vulnerability in global software architecture, where a single AI model can uncover systemic flaws faster than human developers can patch them, potentially shifting the balance of power in cybersecurity toward those who control the most advanced models.

Sources

[1]duckduckgo news — Anthropic's Mythos set off a cybersecurity 'hysteria.' Experts say the threat was already here

[2]duckduckgo news — Mythos Is The AI The Grid Should Fear, And The One It Needs

[3]bing news — Mythos AI is a cybersecurity threat, but it doesn’t rewrite the rules of the game

[4]duckduckgo news — Anthropic's Mythos found thousands of zero-day vulnerabilities. The Fed chair called the banks.

[5]bing news — Too dangerous to release: is Mythos the start of the restricted-AI era?

[6]bing news — Artificial Intelligence Minister says Anthropic taking ‘responsible’ approach with Mythos

Anthropic Withholds AI Model After Finding Thousands of Software Flaws

Sources

Related

Insufficient Data to Report on Bill Ackman Trade

Information Unavailable for Security Now Episode 1083

Professor Jonathan Rees Discusses Industrial Revolution on WIRED

Comments