
AI
Inside Anthropic's Quest to Understand AI Minds Through Project Vend
2/10/2026

AI
OpenAI Retires GPT-4o Amid Safety Concerns, Highlighting AI's Unpredictable Risks
2/10/2026

AI
New Benchmark Reveals High Rates of Constraint Violations in AI Agents Under Performance Pressure
2/10/2026
AI
Google's Gemini AI Model Shows Promise in Early Tests, But Falls Short of Hype
2/8/2026

AI
LLMs Need Companion Bots to Check Work, Keep Them Honest
2/7/2026

Security
The Kernel-First Approach to AI Safety: Why Trust Is the Wrong Foundation for Agentic Systems
2/7/2026

Machine Learning
The Waymo World Model: A New Frontier For Autonomous Driving Simulation
2/6/2026

AI
When AI Takes the Couch: Psychometric Jailbreaks Reveal Internal Conflict in Frontier Models
2/5/2026

Security
OpenAI Appoints Anthropic Safety Veteran to Lead AI Preparedness Amid Industry Scrutiny
2/4/2026

AI
Inside Elon Musk's Bet to Hook X Users That Turned Grok Into a Porn Generator
2/2/2026
AI
The Emergence of Selfish AI: When Optimization Conflicts with Human Values
2/2/2026

AI
Anthropic researchers detail “disempowerment patterns” in AI assistant interactions
2/2/2026

AI
Anthropic's Safety-First AI Strategy Faces Commercial Realities in High-Stakes Industry
2/2/2026