Anthropic—Signed agreement with U.S. AI Safety Institute for pre-deployment model testing
Anthropic signed agreement with NIST's U.S. AI Safety Institute to provide access to major new models before and after public release for collaborative research on capability evaluation and safety risk mitigation. The Institute will provide feedback on potential safety improvements.
Scoring Impact
| Topic | Direction | Relevance | Contribution |
|---|---|---|---|
| AI Oversight | +toward | secondary | +0.50 |
| AI Safety | +toward | primary | +1.00 |
| Overall incident score = | +0.443 | ||
Score = avg(topic contributions) × significance (medium ×1) × confidence (0.59)
Evidence (1 signal)
Anthropic signed Memorandum of Understanding with U.S. AI Safety Institute on August 29, 2024
On August 29, 2024, the U.S. AI Safety Institute at NIST announced agreements with Anthropic and OpenAI - the 'first of their kind' between the U.S. government and tech industry. The MOU enables the Institute to receive access to major new Anthropic models before and after public release for collaborative research on capability evaluation and safety risk mitigation. Anthropic co-founder Jack Clark stated: 'Our collaboration with the U.S. AI Safety Institute leverages their wide expertise to rigorously test our models before widespread deployment.'