Search Results: BehavioralEvaluation

Anthropic Unveils Bloom: Open-Source Framework for Automated AI Behavioral Testing

December 22, 2025 2 min read

Anthropic releases Bloom, an open-source agentic framework that automates behavioral evaluations of frontier AI models. The tool generates targeted scenarios to quantify alignment risks like delusional sycophancy and self-preservation biases across models, achieving 86% correlation with human judgment.