technologyAI-Enhanced

May 1, 2026

OpenAI's GPT-5.5 Matches Anthropic's Mythos Preview in Cybersecurity Tests

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Ars Technica

·2 min read·gpt-5-5mythos-previewcybersecurity

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Image: Ars Technica

💡 In a Nutshell

New research from the UK's AI Security Institute reveals that OpenAI's GPT-5.5, recently launched, matches Anthropic's Mythos Preview in cybersecurity evaluations. Both models were tested on various Capture the Flag challenges, with GPT-5.5 slightly outperforming Mythos Preview in expert tasks, indicating significant advancements in AI capabilities for cybersecurity.

◆🔑 Key Points

01GPT-5.5 achieved an average score of 71.4% on expert cybersecurity tasks.
02Mythos Preview scored 68.6% on the same tasks, showing competitive performance.
03GPT-5.5 solved a complex disassembler task in just over 10 minutes.
04Both models struggled with the Cooling Tower simulation, failing to disrupt control software.
05GPT-5.5 succeeded in 3 out of 10 attempts on a data extraction attack simulation.

In-Article Ad

✎📝 Full Summary

Recent evaluations by the UK's AI Security Institute (AISI) show that OpenAI's GPT-5.5, which launched publicly last week, matches Anthropic's Mythos Preview in cybersecurity performance. The AISI tested both models on 95 Capture the Flag challenges, focusing on skills like reverse engineering and cryptography. GPT-5.5 achieved an average score of 71.4% on expert tasks, slightly surpassing Mythos Preview's 68.6%. Notably, GPT-5.5 completed a challenging disassembler task in 10 minutes and 22 seconds with a cost of $1.73 in API calls. In a simulated data extraction attack, GPT-5.5 succeeded in 3 out of 10 attempts, compared to 2 out of 10 for Mythos Preview. However, both models failed to perform well in the Cooling Tower simulation, which tests the disruption of power plant control software, a challenge that has stumped previous AI models as well.

In-Article Ad

##️⃣ Key Figures

71.4%

Average score of GPT-5.5 on expert cybersecurity tasks

68.6%

Average score of Mythos Preview on expert tasks

10 minutes and 22 seconds

Time taken by GPT-5.5 to solve a complex disassembler task

3 out of 10

Success rate of GPT-5.5 in a data extraction attack simulation

In-Article Ad

?❓ FAQ

These challenges are designed to evaluate AI models on various cybersecurity tasks, testing their capabilities in real-world scenarios.

GPT-5.5 shows significant advancements, particularly in expert-level tasks, but still struggles with certain complex simulations.

✦

Reader Poll

Advanced AnalyticsAnalytics

Do you believe AI models like GPT-5.5 will significantly enhance cybersecurity?

Yes, they will improve securityNo, they won't be effectiveOnly for specific tasksNot sure

Connecting to poll...

More about OpenAI

OpenAI to augment workers, not replace them: CEO Sam Altman

OpenAI's CEO Sam Altman Discusses AI's Role in the Workforce

The Economic Times • May 1, 2026

Amazon Web Services, Microsoft and NVIDIA will provide AI tech to Pentagon

Tech Giants Partner with Pentagon for AI Development

Engadget • May 1, 2026

Pentagon Partners With OpenAI, Google, SpaceX For Military AI Use But Leaves Out This Company

Pentagon Collaborates with Major AI Firms, Excludes Anthropic Amid Controversy

News 18 • May 1, 2026

Read the original article

Visit the source for the complete story.

Read Original

OpenAI's GPT-5.5 Matches Anthropic's Mythos Preview in Cybersecurity Tests

Reader Poll

Related Stories

Shanghai Mall Introduces Gaming Pods for Shoppers' Partners

Meta Considers Withdrawing Facebook and Instagram from New Mexico Amid Child Safety Legal Battle

US Cybersecurity Officials Propose Shorter Deadlines for Fixing IT Vulnerabilities Amid AI Threats

OpenAI's CEO Sam Altman Discusses AI's Role in the Workforce

Viral Google Maps Image Blurs Cow's Face, Sparks Online Reactions

More about OpenAI

OpenAI's CEO Sam Altman Discusses AI's Role in the Workforce

Tech Giants Partner with Pentagon for AI Development

Pentagon Collaborates with Major AI Firms, Excludes Anthropic Amid Controversy

Popular Topics

OpenAI's GPT-5.5 Matches Anthropic's Mythos Preview in Cybersecurity Tests

Reader Poll

More about OpenAI

Read the original article

Related Stories

Shanghai Mall Introduces Gaming Pods for Shoppers' Partners

Meta Considers Withdrawing Facebook and Instagram from New Mexico Amid Child Safety Legal Battle

US Cybersecurity Officials Propose Shorter Deadlines for Fixing IT Vulnerabilities Amid AI Threats

OpenAI's CEO Sam Altman Discusses AI's Role in the Workforce

Viral Google Maps Image Blurs Cow's Face, Sparks Online Reactions

Popular Topics

🔔 Never Miss a Story