Anthropic—Published RSP noncompliance and anti-retaliation policy for whistleblower protection

Dec 1, 2025

Anthropic published 'RSP Noncompliance and Anti-Retaliation Policy' outlining how employees can report suspected RSP violations. First frontier AI company to publicly commit to ongoing monitoring and reporting on whistleblowing system - achieving 'Level 2 Whistleblowing Transparency.'

Scoring Impact

Topic	Direction	Relevance	Contribution
Whistleblower Protection	+toward	primary	+1.00
Overall incident score =			+0.590

Score = avg(topic contributions) × significance (medium ×1) × confidence (0.59)

Evidence (1 signal)

Confirms Policy Change Dec 1, 2025 verified

Anthropic published RSP Noncompliance and Anti-Retaliation Policy in December 2025, achieving Level 1 Whistleblowing Transparency

In December 2025, Anthropic published their 'RSP Noncompliance and Anti-Retaliation Policy' (PDF) outlining how employees can report suspected RSP violations through confidential Navex reporting tool. The policy protects good-faith reporters from retaliation and extends to AI research collaborators and academic partners. Anthropic became the second leading AI company to publish such a policy (after OpenAI in October 2024) and the first frontier AI company to publicly commit to ongoing monitoring and reviews of their whistleblowing system, achieving 'Level 1 Whistleblowing Transparency.' The policy notes it can achieve 'Level 2' by publishing usage and outcome reporting. The AI Whistleblower Initiative noted Anthropic took this step 'in the absence of any regulatory or scandal-driven pressure.' METR review confirmed no RSP noncompliance reports had been received as of their 2025 pilot risk report.

Anthropic,AI Whistleblower Initiative,METR