Anthropic will pay up to $15,000 to hackers who find vulnerabilities in its AI systems

Anthropic has announced the launch of an expanded vulnerability hunting program, offering third-party cybersecurity experts up to $15,000 in rewards for identifying critical issues in its artificial intelligence systems.

Image source: Copilot

The initiative aims to find “universal evasion techniques,” that is, hacking techniques that can consistently bypass AI security measures in high-risk areas such as chemical, biological, radiological and nuclear threats, as well as in the cyber domain. According to VentureBeat, Anthropic will invite ethical hackers to test its system before its public launch, to immediately prevent potential exploits that could lead to abuse of its AI systems.

Interestingly, this approach differs from the strategies of other major players in the field of AI. For example, OpenAI and Google have bounty programs, but they focus more on traditional software vulnerabilities rather than AI industry-specific exploits. Additionally, Meta✴ has recently come under fire for its relatively veiled stance on AI safety research. On the contrary, Anthropic’s clear focus on openness sets a new standard for transparency on this issue.

However, the effectiveness of vulnerability scanning programs in addressing the full range of AI security problems remains controversial. Experts note that a more comprehensive approach may be required, including extensive testing, improved interpretability and perhaps new governance structures needed to ensure AI systems globally align with human values.

The program starts as an invitation-only initiative (closed testing) in partnership with the renowned HackerOne platform, but in the future Anthropic plans to expand the program by making it open and creating a separate independent model for industry collaboration on AI security.

admin

Share
Published by
admin

Recent Posts

Windows 11 will become smarter: Microsoft is testing AI file search

Microsoft is testing a new artificial intelligence (AI)-powered search feature in the latest build for…

1 hour ago

Merger instead of sale: Perplexity AI wants to save TikTok in the US

Perplexity AI proposed on Saturday, a day before TikTok was blocked in the United States,…

1 hour ago

Battle Shapers – fear of ambition. Review

Not defined Roguelikes with a first-person perspective are a fairly niche genre segment, but they…

6 hours ago

ASRock introduced industrial mini-PCs and motherboards based on Intel Arrow Lake-H and AMD Ryzen 300 AI

ASRock Industrial, according to the CNX-Software resource, presented industrial computers of a small form factor…

7 hours ago

The potential US Secretary of Transportation promised to deal with SpaceX fines and eliminate the space bureaucracy

This week, Congress held confirmation hearings for new ministers nominated by new US President Donald…

9 hours ago

Vast Space has built the world’s first private space station; it will go into orbit this year

California-based startup Vast Space has announced the completion of the world's first commercial space station,…

9 hours ago