An update on our election safeguards
Summary
Anthropic announced efforts to improve the accuracy and fairness of Claude's election-related information ahead of U.S. midterm elections and other major elections worldwide.
Key Points
- Claude is trained to maintain political neutrality and address diverse political viewpoints with equal depth and analytical rigor.
- The model reinforces principles of political neutrality through character training and system prompts.
- Opus 4.7 and Sonnet 4.6 scored fairness ratings of 95% and 96%, respectively, across prompts spanning the political spectrum.
- The evaluation methodology and open-source datasets have been published to encourage reproduction and iteration.
Notable Quotes & Details
Notable Data / Quotes
- Opus 4.7 and Sonnet 4.6 scored 95% and 96%
Intended Audience
AI researchers, policymakers, general readers