
AI
AI's Mathematical Renaissance: How Reasoning Models Are Transforming Mathematics and Evaluation
2/17/2026

AI
When AI Takes the Couch: Psychometric Jailbreaks Reveal Internal Conflict in Frontier Models
2/5/2026

LLMs
FACTS Benchmark Suite Introduced to Evaluate Factual Accuracy of Large Language Models
1/12/2026

LLMs
LLM poetry and the 'greatness' question
1/11/2026