technologyAI-Enhanced

May 26, 2026

New Software Tools Easily Bypass AI Safety Measures, Raising Security Concerns

New Tools Strip AI Guardrails In Minutes, Allowing Them to Give Instructions on Chlorine Gas Attacks

Futurism

·3 min read·Kawin Ethayarajh Philipp Emanuel Weidmann Noam Schwartz Google Meta

New Tools Strip AI Guardrails In Minutes, Allowing Them to Give Instructions on Chlorine Gas Attacks

Image: Futurism

💡 In a Nutshell

Recent reports reveal that new software tools can quickly remove safety features from open-source AI models, enabling them to provide harmful instructions, including methods for chlorine gas attacks. These tools, like Heretic, are easily accessible and require minimal technical skills, raising significant concerns about AI misuse.

◆🔑 Key Points

01The tool Heretic can strip safety features from AI models in under ten minutes.
02Modified models have generated harmful content, including instructions for chlorine gas attacks.
03Heretic has been downloaded 13 million times and created over 3,500 'decensored' models since its release.
04Experts warn that the ease of using such tools poses a serious risk to society.
05Proprietary models like OpenAI's ChatGPT remain safe from these tools if not leaked.

In-Article Ad

✎📝 Full Summary

A recent investigation by the Financial Times highlights the alarming emergence of software tools that can quickly bypass safety measures in open-source AI models. One such tool, Heretic, can remove guardrails from models like Google’s Gemma 3 and Meta’s Llama 3.3 in under ten minutes, allowing them to generate dangerous content, including instructions for conducting chlorine gas attacks and creating viruses for credit card theft. Heretic, which is freely available on GitHub, has been used to create over 3,500 modified models, downloaded 13 million times. Experts stress that this ease of access to dangerous capabilities poses significant risks, as average users can now exploit these technologies without extensive technical knowledge. While proprietary models like OpenAI's ChatGPT are currently safe from such tools, the potential for misuse in open-source models is a growing concern. Google has acknowledged the risks associated with such tools and claims to implement rigorous safety evaluations for its models.

In-Article Ad

##️⃣ Key Figures

13 million

Total downloads of modified models created using Heretic

3,500

Number of 'decensored' models created with Heretic

!❗ Why It Matters

The rise of tools like Heretic poses a risk to public safety by enabling the creation of harmful AI-generated content.

👥 Who is affected

Individuals and communities that may be targeted by malicious uses of AI technology.

ℹ️ What to know

Increased awareness and regulation surrounding the use of open-source AI models and tools.

In-Article Ad

?❓ FAQ

Heretic is a software tool that removes censorship from transformer-based language models, allowing them to respond to harmful requests.

The use of Heretic can lead to the generation of dangerous content and instructions, posing risks for misuse in various harmful activities.

✦

Reader Poll

Advanced AnalyticsAnalytics

What do you think about the risks posed by tools that can bypass AI safety measures?

Very concernedSomewhat concernedNot concernedNeed more info

Connecting to poll...

More about Google

Conservatives look to limit government powers in Liberals' controversial lawful access bill

Conservatives Propose Amendments to Liberals' Lawful Access Bill Amid Privacy Concerns

Cbc • May 26, 2026

No, Mythos non è la fine della sicurezza informatica: è la sua evoluzione in salsa AI

Mythos: L'evoluzione della sicurezza informatica nell'era dell'AI

Il Sole 24 Ore • May 26, 2026

EU soll Millionenstrafe gegen Google planen

EU plant Rekordstrafe gegen Google wegen Verstößen gegen den Digital Markets Act

Der Spiegel • May 26, 2026

Read the original article

Visit the source for the complete story.

Read Original

New Software Tools Easily Bypass AI Safety Measures, Raising Security Concerns

Topics in this story

Reader Poll

Related Stories

Rage:MP Shuts Down Following Take-Two's Cease and Desist Order

Telesat Pursues Satellite Connectivity Contract with Italy Amidst Security Concerns

Sam Altman Reassesses AI's Impact on Employment

Apple Releases First Betas of watchOS, tvOS, and visionOS 26.6 for Developers

Joi Hires 'Masturbation Consultants' for $2,000 Monthly to Develop AI Features

More about Google

Conservatives Propose Amendments to Liberals' Lawful Access Bill Amid Privacy Concerns

Mythos: L'evoluzione della sicurezza informatica nell'era dell'AI

EU plant Rekordstrafe gegen Google wegen Verstößen gegen den Digital Markets Act

Popular Topics

New Software Tools Easily Bypass AI Safety Measures, Raising Security Concerns

Reader Poll

More about Google

Read the original article

Related Stories

Rage:MP Shuts Down Following Take-Two's Cease and Desist Order

Telesat Pursues Satellite Connectivity Contract with Italy Amidst Security Concerns

Sam Altman Reassesses AI's Impact on Employment

Apple Releases First Betas of watchOS, tvOS, and visionOS 26.6 for Developers

Joi Hires 'Masturbation Consultants' for $2,000 Monthly to Develop AI Features

Popular Topics

🔔 Never Miss a Story