Anthropic will pay up to $15,000 to hackers who find vulnerabilities in its AI systems

Anthropic has announced the launch of an expanded vulnerability hunting program, offering third-party cybersecurity experts up to $15,000 in rewards for identifying critical issues in its artificial intelligence systems.

Image source: Copilot

The initiative aims to find “universal evasion techniques,” that is, hacking techniques that can consistently bypass AI security measures in high-risk areas such as chemical, biological, radiological and nuclear threats, as well as in the cyber domain. According to VentureBeat, Anthropic will invite ethical hackers to test its system before its public launch, to immediately prevent potential exploits that could lead to abuse of its AI systems.

Interestingly, this approach differs from the strategies of other major players in the field of AI. For example, OpenAI and Google have bounty programs, but they focus more on traditional software vulnerabilities rather than AI industry-specific exploits. Additionally, Meta✴ has recently come under fire for its relatively veiled stance on AI safety research. On the contrary, Anthropic’s clear focus on openness sets a new standard for transparency on this issue.

However, the effectiveness of vulnerability scanning programs in addressing the full range of AI security problems remains controversial. Experts note that a more comprehensive approach may be required, including extensive testing, improved interpretability and perhaps new governance structures needed to ensure AI systems globally align with human values.

The program starts as an invitation-only initiative (closed testing) in partnership with the renowned HackerOne platform, but in the future Anthropic plans to expand the program by making it open and creating a separate independent model for industry collaboration on AI security.

admin

Share
Published by
admin

Recent Posts

Despelote — goo-o-o-o-o-o-o-o-o-o-ol! Review

One of my first memories (or perhaps the very first one – is it possible…

14 hours ago

Design and specifications of the flagship smartphone Sony Xperia 1 VII leaked online

A few days before the official presentation, details about the new flagship Sony Xperia 1…

14 hours ago

GTA VI Delay to 2026 Causes New Panic Among Game Developers

Bloomberg journalist Jason Schreier reported on the domino effect triggered by the recent delay of…

15 hours ago

Nintendo warns it will block consoles for users who engage in piracy and hacking

Nintendo has updated its user agreement, formalizing the right to remotely disable Switch consoles if…

16 hours ago

Gigabyte Unveils X870 and B850 Aorus Stealth Motherboards with Back-Side Power Connectors

Gigabyte has unveiled the X870 Aorus Stealth and B850 Aorus Stealth motherboards for Ryzen 7000,…

17 hours ago

Alienware Unveils Thin, Affordable Aurora 16 and 16X Gaming Laptops with Understated Designs

Alienware, a subsidiary of Dell known for its futuristic gaming laptops, has released new high-performance…

1 day ago