New Software Tools Easily Bypass AI Safety Measures, Raising Security Concerns
New Tools Strip AI Guardrails In Minutes, Allowing Them to Give Instructions on Chlorine Gas Attacks

Image: Futurism
Recent reports reveal that new software tools can quickly remove safety features from open-source AI models, enabling them to provide harmful instructions, including methods for chlorine gas attacks. These tools, like Heretic, are easily accessible and require minimal technical skills, raising significant concerns about AI misuse.
- 01The tool Heretic can strip safety features from AI models in under ten minutes.
- 02Modified models have generated harmful content, including instructions for chlorine gas attacks.
- 03Heretic has been downloaded 13 million times and created over 3,500 'decensored' models since its release.
- 04Experts warn that the ease of using such tools poses a serious risk to society.
- 05Proprietary models like OpenAI's ChatGPT remain safe from these tools if not leaked.
Advertisement
In-Article Ad
A recent investigation by the Financial Times highlights the alarming emergence of software tools that can quickly bypass safety measures in open-source AI models. One such tool, Heretic, can remove guardrails from models like Google’s Gemma 3 and Meta’s Llama 3.3 in under ten minutes, allowing them to generate dangerous content, including instructions for conducting chlorine gas attacks and creating viruses for credit card theft. Heretic, which is freely available on GitHub, has been used to create over 3,500 modified models, downloaded 13 million times. Experts stress that this ease of access to dangerous capabilities poses significant risks, as average users can now exploit these technologies without extensive technical knowledge. While proprietary models like OpenAI's ChatGPT are currently safe from such tools, the potential for misuse in open-source models is a growing concern. Google has acknowledged the risks associated with such tools and claims to implement rigorous safety evaluations for its models.
Advertisement
In-Article Ad
The rise of tools like Heretic poses a risk to public safety by enabling the creation of harmful AI-generated content.
Advertisement
In-Article Ad
Reader Poll
What do you think about the risks posed by tools that can bypass AI safety measures?
Connecting to poll...
More about Google

Conservatives Propose Amendments to Liberals' Lawful Access Bill Amid Privacy Concerns
Cbc • May 26, 2026

Mythos: L'evoluzione della sicurezza informatica nell'era dell'AI
Il Sole 24 Ore • May 26, 2026

EU plant Rekordstrafe gegen Google wegen Verstößen gegen den Digital Markets Act
Der Spiegel • May 26, 2026
Read the original article
Visit the source for the complete story.


![Apple Seeds First Betas of watchOS 26.6, tvOS 26.6, and visionOS 26.6 to Developers [Download]](/_next/image?url=https%3A%2F%2Fwww.iclarified.com%2Fimages%2Fnews%2F100966%2F100966%2F100966-1280.jpg&w=1200&q=75)
