Futurism: Top Security Experts Alarmed by Power of Anthropic’s New Hacker AI

Top Security Experts Alarmed by Power of Anthropic’s New Hacker AI

“Within hours of getting the model, we knew it was different.”

Published Apr 16, 2026 1:13 PM EDT

Add Futurism(opens in a new tab)More information

Anthropic researchers were alarmed by the power of the company's latest Mythos AI model, suggesting it could supercharge hackers. — Getty / Futurism

Sign up to see the future, today

Can’t-miss innovations from the bleeding edge of science and tech Email address Sign Up

In November, Anthropic revealed that a Chinese state-sponsored hacking group had exploited its Claude AI’s agentic capabilities to infiltrate dozens of targets around the world.

It was trivially easy to get around Anthropic’s AI guardrails, with the hackers simply pretending to work for legitimate cybersecurity organizations — highlighting how woefully unprepared we are for powerful AI models that could accelerate the discovery of serious vulnerabilities.

https://f43a043906accc7df1747bfbcf78c204.safeframe.googlesyndication.com/safeframe/1-0-45/html/container.html

And now, Anthropic’s latest Mythos AI model is making that nightmare scenario feel more real than ever. As Bloomberg reports, the company’s executives were seemingly so alarmed by the system’s capabilities that they decided to only make it available to a select number of organizations as part of “Project Glasswing.” The goal: give the organizations a fighting chance to get ahead of a potential cybersecurity crisis in the making.

But considering Anthropic has yet to publicly release its model, plenty of questions remain surrounding the company’s eyebrow-raising claims.

In his own testing, Anthropic-affiliated AI researcher Nicholas Carlini told Bloomberg that it didn’t take long for Mythos to get past security protocols and gain access to sensitive data.

His findings reflect the experience of the company’s Frontier Red Team, a group of 15 Anthropic employees tasked with challenging cybersecurity by simulating adversarial attacks.

“Within hours of getting the model, we knew it was different,” the team’s head, Logan Graham, told Bloomberg.

The biggest difference between Mythos and previous AI models was its ability to autonomously exploit vulnerabilities, an ominous new facet of the industry’s transition towards agentic models.

The Frontier Red Team even caught earlier models of Mythos trying to cover its tracks after violating human instructions, according to the model’s system card, as well as escaping a sandbox environment and gaining access to the internet.

The team also found that the model identified serious “Linux kernel vulnerabilities,” which it could chain together to “construct a functional exploit” of the open-source operating system — which underpins “most modern computing,” as Linux foundation executive director Jim Zemlin told Bloomberg.

It’s not just Anthropic’s own researchers ringing the alarm bells. In their testing, researchers at the UK state-backed AI Security Institute (AISI) found that Mythos “represents a step up over previous frontier models in a landscape where cyber performance was already rapidly improving.”

“Future frontier models will be more capable still, so investment now in cyber defense is vital,” the group warned.

At the same time, white hat cybersecurity experts could use Mythos’ apparent capabilities to their own advantage as well.

“AI cyber capabilities are dual use; while they pose security challenges, they can also help deliver game-changing improvements in defense,” the AISI wrote.

By keeping its hand extremely close to the chest and not releasing it to the public, Anthropic is playing a dangerous game — putting its own reputation on the line as it makes bombastic claims.

“A growing number of people are wondering if Anthropic is the AI industry’s ‘boy who cried wolf,’” White House AI advisor David Sacks tweeted. “If Mythos-related threats don’t materialize, the company will have a serious credibility problem.”

More on Mythos: Anthropic Warns That “Reckless” Claude Mythos Escaped a Sandbox Environment During Testing

Victor Tangermann

Senior Editor

I’m a senior editor at Futurism, where I edit and write about NASA and the private space sector, as well as topics ranging from SETI and artificial intelligence to tech and medical policy.

TAGS IN THIS STOR

About michelleclarke2015

Life event that changes all: Horse riding accident in Zimbabwe in 1993, a fractured skull et al including bipolar anxiety, chronic fatigue …. co-morbidities (Nietzche 'He who has the reason why can deal with any how' details my health history from 1993 to date). 17th 2017 August operation for breast cancer (no indications just an appointment came from BreastCheck through the Post). Trinity College Dublin Business Economics and Social Studies (but no degree) 1997-2003; UCD 1997/1998 night classes) essays, projects, writings. Trinity Horizon Programme 1997/98 (Centre for Women Studies Trinity College Dublin/St. Patrick's Foundation (Professor McKeon) EU Horizon funded: research study of 15 women (I was one of this group and it became the cornerstone of my journey to now 2017) over 9 mth period diagnosed with depression and their reintegration into society, with special emphasis on work, arts, further education; Notes from time at Trinity Horizon Project 1997/98; Articles written for Irishhealth.com 2003/2004; St Patricks Foundation monthly lecture notes for a specific period in time; Selection of Poetry including poems written by people I know; Quotations 1998-2017; other writings mainly with theme of social justice under the heading Citizen Journalism Ireland. Letters written to friends about life in Zimbabwe; Family history including Michael Comyn KC, my grandfather, my grandmother's family, the O'Donnellan ffrench Blake-Forsters; Moral wrong: An acrimonious divorce but the real injustice was the Catholic Church granting an annulment – you can read it and make your own judgment, I have mine. Topics I have written about include annual Brain Awareness week, Mashonaland Irish Associataion in Zimbabwe, Suicide (a life sentence to those left behind); Nostalgia: Tara Hill, Co. Meath.

View all posts by michelleclarke2015 →

Futurism: Top Security Experts Alarmed by Power of Anthropic’s New Hacker AI

Top Security Experts Alarmed by Power of Anthropic’s New Hacker AI

Sign up to see the future, today

Victor Tangermann

Senior Editor

About michelleclarke2015

Leave a comment Cancel reply

Archives

Meta

Futurism: Top Security Experts Alarmed by Power of Anthropic’s New Hacker AI

Top Security Experts Alarmed by Power of Anthropic’s New Hacker AI

Sign up to see the future, today

Victor Tangermann

Senior Editor

Share this:

Related

About michelleclarke2015

Leave a comment Cancel reply

Archives

Meta